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Introduction 


This book is the first of two volumes on differential geometry and mathematical 
physics. It is the result of our teaching these subjects at the University of Leipzig 
over the last few decades to students of physics and of mathematics. The present 
volume is devoted to manifolds, Lie groups and the theory of Hamiltonian systems. 
The second volume will deal with fibre bundles, topology and gauge field theory, 
including aspects of the theory of gravity. 

While the aim and scope of differential geometry are somewhat well defined, it 
is, perhaps, less clear what we possibly mean by mathematical physics. Historically, 
this term was rather imprecise and so is still nowadays. Indeed, its interpretation de- 
pends on culture and context. In our understanding, mathematical physics is the area 
where theoretical physics and pure mathematics meet, stimulate and fertilize each 
other. On the one hand, this interplay leads to a deeper structural understanding of 
theoretical physics and to new results obtained with new mathematical methods. On 
the other hand, it stimulates the development of old and new branches in mathemat- 
ics. Thus, in our understanding, it is impossible to draw a precise borderline between 
theoretical physics and pure mathematics. Over the last decades it sometimes hap- 
pened that the solution of a problem posed by physicists had an even larger impact 
on the development of mathematics than on the field of physics from where it arose. 
There is a number of texts where the status and the role of mathematical physics is 
discussed, see e.g. the papers of Greenberg [112], Faddeev [87] and Jaffe and Quinn 
[151],! as well as the classical contributions of Poincaré [241] and Hilbert [128]. 

There is no doubt that, over the last few centuries, the interrelation between 
physics and geometry has been especially tight and fruitful. In particular, this in- 
teraction has stimulated the development of modern differential geometry. In this 
complex process, which we cannot describe here,” the development of the notion 
of manifold was of great importance. The conceptual definition of this notion was 


'This very interesting and somewhat provocative article stimulated a lot of responses by leading 
scientists, see [27]. 


2See [264-266] and [281, 282] for a detailed discussion of these historical aspects. 
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presented by Riemann in his famous Habilitationsvortrag in Gottingen in the year 
1854, see [251].° Riemann developed a general geometry (nowadays called Rie- 
mannian geometry), which included Euclidean as well as non-Euclidean geometry 
as originated by Bolyai, Gau8 and Lobachevsky. From Riemann’s presentation it is 
clear that a deeper understanding of the nature of physical space was one of the main 
motivations for his studies. In his understanding, space works on bodies and bodies 
have an influence on space. This idea made Riemann depart from the metaphysical 
attitude towards space as a given unchangeable entity and pass to a modern field the- 
oretical point of view. He even suggested that the metric might be determined by the 
physical masses. Thus, on a rather philosophical level, he made a step towards the 
conceptual foundations of Einstein’s theory of gravity, which came 60 years later. 
At the same time, he created the mathematical framework for this theory. 

In the following years, a number of great mathematicians contributed to the field, 
but it was Poincaré who brought the concept of manifold to its modern form.* As he 
said himself, he was led to this concept by his previous investigations on the theory 
of differential equations and its applications to dynamics, in particular, of the n-body 
problem. On the one hand, on the basis of this abstract manifold concept, he laid 
the foundations of modern algebraic topology. On the other hand, he continued his 
studies on dynamical systems with emphasis on their global, qualitative behaviour, 
on the way creating a lot of tools which nowadays still play an important role. At the 
same time, he provided a geometrization of the formalism of analytical mechanics 
as developed by Lagrange,> Hamilton,° Jacobi, Liouville and Poisson. Instead of 
formulating dynamics in terms of local coordinates in Euclidean space, he viewed 
it as a global system described by a Hamiltonian vector field on the phase space 
manifold. Thus, modern symplectic geometry was born.’ 

In the twentieth century, the interaction between physics and geometry continued 
to be strong and successful. Of course, first of all, we should mention Einstein’s the- 
ory of gravity, which is based on the discovery that gravity is a geometric property of 
spacetime and that spacetime is curved by matter. Starting from the nineteen-fifties, 
all other fundamental forces were geometrized in a similar spirit leading to modern 
gauge theory. The necessary mathematical foundations, including the general theory 


3There were, of course, precursors. In particular, Cauchy and Gauf should be mentioned. Gau8 
even used the term manifold, but in his understanding it was restricted to affine subspaces of an 
n-dimensional vector space. 


4See the famous Analysis situs [239] from the year 1895, together with five subsequent comple- 
ments within the following nine years. Actually, Poincaré gave two definitions of a manifold: a 
manifold as a level set and a manifold as given by an atlas of local charts. 


>The origin of symplectic geometry dates back to Lagrange’s early work on celestial mechanics, 
see [177] and [308] for a detailed discussion by Weinstein. 


See [120-122]. Hamilton was led to the formulation of dynamics in terms of a system of first 
order differential equations for a general mechanical system through his studies in optics. 


7The word symplectic was invented by Weyl [312] to give a name to the group of linear trans- 
formations, preserving a non-degenerate, skew-symmetric bilinear form, and the term symplectic 
geometry was proposed by Siegel, see [272]. 
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of fibre bundles and connections, were developed by E. Cartan, Koszul, Ehresmann 
and Chern. Once these geometrical formulations of the fundamental forces had been 
found, another fascinating interaction of geometry and physics took place. In the 
mid-seventies physicists insisted on classifying the solutions (of a certain type) of 
the Yang-Mills field equations of classical gauge theory. This problem was finally 
solved by leading mathematicians and the techniques developed on the way, in turn, 
led to fundamentally new and deep insights in topology. In particular, exotic topo- 
logical structures on Euclidean four-space were found. We should also add that, 
starting from the work of Kaluza, Klein, Einstein and Wey] in the nineteen-twenties 
up until the present, there has been much effort devoted to searching for an ultimate 
geometrical model unifying all fundamental forces. This gave another strong impe- 
tus to the development of modern differential geometry and related fields. Some of 
the aspects just mentioned will be discussed in volume 2 of this book. 

To finish this brief historical introduction, we should make two further remarks. 
Firstly, it should be mentioned that in the process described above the concept of 
symmetry played a fundamental role. The creation of the mathematical foundations 
of this concept dates back to Lie, who in the eighteen-seventies developed a general 
theory of transformations.® Lie was influenced by the work of Galois on symmetries 
of polynomial equations, by the work of Jacobi on partial differential equations, and 
by Klein, whose aim was to unify geometry and group theory. The theory was es- 
sentially pushed forward by Killing and E. Cartan, who classified semisimple Lie 
algebras and developed their representation theory. The early period closes with 
the contributions by Weyl], who created the representation theory of semisimple Lie 
groups. It was also Weyl who first applied concepts of group theory to quantum 
mechanics. It goes without saying that the general theory of fibre bundles and con- 
nections and, consequently, also the theory of gauge fields, heavily rests on Lie 
group theory. 

Secondly, over the last few decades, it has become more and more clear that sym- 
plectic geometry plays a special role. This is not only due to the fact that there is 
merely a lot of applications of symplectic techniques in many areas of mathemat- 
ics and in physics. There is something more: a phenomenon which Arnold called 
symplectization, see [22], [111] and also [308]. Indeed, there seems to be growing 
evidence that many concepts, constructions and results from different branches of 
mathematics and mathematical physics (like the theory of partial differential equa- 
tions, the calculus of variations or the theory of group representations) can be recast 
in symplectic terms, finding in this way their ultimate ground. The theory of Hamil- 
tonian systems in its modern form is of course still one of the most prominent exam- 
ples. Via Hamilton-Jacobi theory there is a close link to the theory of linear partial 
differential equations. Here, representing a differential operator on a manifold by 
its symbol on the cotangent bundle and seeking solutions in terms of Lagrangian 
immersions and geometrical objects living on them, one arrives at a symplectized 


8 See [182-184]. For a historical overview on Lie group theory we refer to [50] and [126]. 


Vili Introduction 


theory of first order partial differential equations.’ In a similar spirit and in close 
relation, large parts of the theory of singularities have been symplectized. Another 
beautiful example is provided by the orbit method of Kirillov, Kostant and Souriau, 
which constitutes one of the cornerstones of geometric quantization. For a large 
class of Lie groups, this method yields a bijective correspondence between irre- 
ducible unitary representations of a Lie group and transitive symplectic actions of 
this group. 

We conclude with a few remarks on the structure and the contents of this vol- 
ume. It contains three building blocks, each consisting of four chapters. In the first 
four chapters, we present the calculus on manifolds. The next four chapters are de- 
voted to the theory of Lie groups and Lie group actions and to an introduction to 
linear symplectic algebra and symplectic geometry. These chapters constitute the 
link between the abstract calculus and the theory of finite-dimensional Hamiltonian 
systems, which we develop in the final four chapters. There, we had to make a rea- 
sonable selection of the topics to be presented. It is probably fair to say that our 
choice of material was made more from a physicist’s point of view,!° thus, putting 
emphasis on the concepts of symmetry and integrability and on Hamilton-Jacobi 
theory. At the same time, this means that we had to exclude a lot of interesting 
topics like, for instance, equivariant Hamiltonian dynamics or variational methods. 
Since each chapter has its own introduction, here we omit a detailed description of 
the contents. 

We assume that the reader is familiar with elementary algebra and calculus, as 
well as with the basics of classical mechanics. Some knowledge in classical elec- 
trodynamics and in thermodynamics as well as in elementary set topology will be 
helpful. The book is self-contained, that is, starting with the theory of differentiable 
manifolds, it guides the reader to a number of advanced topics in the theory of 
Hamiltonian systems. At some points, we touch on current research. It is our strong 
belief that without detailed case studies a deep understanding of the abstract mate- 
rial can be hardly achieved. Thus, we have included many worked examples, some 
of them are taken up repeatedly. Moreover, at the end of almost every section the 
reader will find a number of exercises. 


°This is due to Maslov [197] and Hérmander [141], see Chap. 12 of this volume. In this context, 
the symplectization of Morse theory plays a basic role, see Sects. 8.9 and 12.4. 


!0Tp particular, this means that we do not go into advanced topics related to symplectic topology. 
However, at some points we touch on it. For a thorough presentation of symplectic topology we 
refer to the textbooks of Hofer and Zehnder [139] and of McDuff and Salamon [206]. 
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Chapter 1 
Differentiable Manifolds 


In this chapter, we introduce the basic notions of the theory of manifolds. We start 
with the very notion of a manifold, illustrate it by a number of examples and discuss 
the class of examples provided by level sets in some detail. Next, we carry over the 
notions of differentiability, the concept of tangent space and the notion of deriva- 
tive of a mapping from classical calculus to the theory of manifolds. In this context, 
we also generalize the basic theorems of classical calculus to the case of mani- 
folds. Finally, we discuss some more advanced topics needed later on: the notion 
of submanifold and the concept of transversality. Since the notion of a submanifold 
is quite subtle, we treat this subject in detail. In our terminology, a submanifold is 
defined by an injective immersion. There are two important special classes of sub- 
manifolds showing up in various applications. They are called embedded and initial, 
respectively. Throughout the text, the reader will find a large number of illustrative 
examples. 


1.1 Basic Notions and Examples 


Manifolds are topological spaces which locally look like R”. Therefore, they allow 
for an extension of the notions of classical calculus. For the topological notions used 
in this section we refer the reader to the standard literature, e.g., [53], [55], [199] or 
[267]. 

In the sequel, we will use the following notation. An element x € R” is an n-tuple 


written as x = (x1,...,X,). The Euclidean scalar product on R” is denoted by 
n 
x y=) xy 
i=1 
and the corresponding norm is denoted by || - ||. For the standard basis in R” we 
write {e],...,€,}. The dual basis {e*!,..., e*”} in R’™ is defined by e*! (ej) =5';. 
For x € R”, its coefficients in the standard basis are given by 
x! =e (x), 
G. Rudolph, M. Schmidt, Differential Geometry and Mathematical Physics, 1 
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Thus, we have x = ei x'e;. Of course, numerically, the numbers x’ and x; coin- 
cide. If there is no danger of confusion, we will use the Einstein summation conven- 
tion, that is, we will also write x = x'e;. 


Definition 1.1.1 (Topological manifold) A topological space M is called a topo- 
logical manifold if it is Hausdorff, second countable and locally homeomorphic to 
R” for some fixed n € N. This means that for every m € M, there exists an open 
neighbourhood U of m in M and a mapping «: U > R” such that «(U) is open in 
R” and k is ahomeomorphism onto its image. The pair (U, «) is called a local chart 
for M. A family <& = {(Uy, Ky) : a € A} of local charts satisfying a Uy = M 
is called an atlas for M. The number n is called the dimension of M. 


Let (U,«) be a local chart on M. If m € U, we say that (U, «) is a chart at m. 
The functions 


Ks=eokx:U>R, 1l<ix<n, 


define a system of local coordinates on U. The numbers «'(m) are called the local 
coordinates of m in the chart (U, «). In particular, (R”, idg-) is a global chart for 
IR”, that is, the identical mapping endows R” with the structure of a topological 
manifold. The corresponding coordinates are x! (x) = x!. 


Remark 1.1.2 


1. Since R” is not homeomorphic to R” for n ¢ m, the dimension of a topological 
manifold is unique. 

2. As a consequence of being second countable, topological manifolds can have at 
most countably many connected components. Moreover, every point possesses a 
countable neighbourhood basis, so that one can use sequences to test topological 
properties like continuity of mappings or closedness of subsets. 

3. As a consequence of being locally homeomorphic to R”, topological manifolds 
inherit all the local properties of IR”. That is, they are locally compact (every 
point has a compact neighbourhood), locally connected (every point admits a 
neighbourhood basis of connected open sets), etc. 


Let (Uj, «1) and (U2, k2) be local charts on M. If U; MN U2 4 ©, the mapping 
kok, |: R™ D K1(Uy N U2) > ko(U, N U2) CR", 


pictured in Fig. 1.1, and its inverse «1 0 Ky ' are called the transition mappings 
of (U;,«,) and (U2, k2). The transition mappings are homeomorphisms. Since 
kK1(U, MN U2) and K2(U; M U2) are open subsets of R” it makes sense to ask whether 
the transition mappings are differentiable. 


Definition 1.1.3 Let M be a topological manifold and let k be a nonnegative integer 
or Oo. 
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Fig. 1.1 Two local charts and one of their transition mappings 


1. Two local charts (U;,«,) and (U2,2) are compatible of class C* if either 
U; M U2 is empty or their transition mappings x, 0 Ky ' and x20 Ky ' are of 
class C*. 

2. Anatlas of for M is of class C* if any two charts are compatible of class C e 

3. Two atlases and &% of class C* are equivalent, if any chart of .< is compat- 
ible of cles C* with any chart of .4; equivalently, if their union is an atlas of 
class C*. 


Point 3 defines an equivalence relation on the class of all C*-atlases for M. 


Definition 1.1.4 (Differentiable manifold) An equivalence class of atlases of class 
C* for a topological manifold M is called a differentiable structure of class C¥, 
or just a C*-structure, on M. A topological manifold together with a differentiable 
structure of class C# is called a differentiable manifold of class C*, or just a C*- 
manifold. 


By definition, a C°-manifold is just a topological manifold. The property to be 
of class C® will be referred to as smooth. While in Chaps. | and 2 the general 
C*-case is treated, starting from Chap. 3, everything will be assumed to be smooth, 
with occasional exceptions if necessary. 


Remark 1.1.5 


1. Let / <k. It is obvious that if two local charts are compatible of class C*, they 
are also compatible of class C’. Consequently, an atlas of class C* is also an atlas 
of class C!. The reader may convince himself that a Ck-manifold can be viewed 
as a C!-manifold if necessary. 

2. By analogy, one defines the notion of real and complex analytic structure and 
manifold. For a real analytic structure, the transition mappings are required to 
be real analytic. For a complex analytic structure, the local charts are assumed 
to take values in C” for some n and the transition mappings are assumed to be 
holomorphic. 


4 1 Differentiable Manifolds 


To define a differentiable structure it suffices to construct an atlas. Since the union 
of arbitrarily many equivalent C‘-atlases is a C‘-atlas, in each equivalence class 
there is a unique maximal atlas. Since for any chart (U,«) and any open subset 
V CU, (V,«yyv) is a local chart and since all these induced local charts are com- 
patible of class C* with one another, all of them are contained in the maximal atlas. 


Example 1.1.6 (Linear spaces) For R” itself, the single chart (R”, idg:) provides 
an atlas of class C®. The differentiable structure defined by this atlas is called the 
standard smooth structure of R” and the corresponding coordinates are called the 
standard coordinates on R”. Similarly, for an arbitrary open subset U C R”, the 
natural inclusion mapping ty : U > R” provides a single chart and hence defines a 
smooth structure. It is clear that all of this carries over to arbitrary finite-dimensional 
real vector spaces. 


Example 1.1.7 (Chart domains) Let M be a manifold of class C K and let (U, «) bea 
local chart on M. Then (U, «) is a global chart on U, thus providing a smooth atlas 
on U. Hence, independently of the differentiability class of M, U together with the 
atlas {(U, «)} is asmooth manifold. It has the same dimension as M. 


Example 1.1.8 (Open subsets) Let M be a manifold of class C* with atlas ./ and 
let W Cc M be an open subset. If (U, «) is a local chart on M with UMW 4 @, then 
U1 W is open in W, «(UN W) is open in R” and kpynw: UN W > K(UN W) is 
a homeomorphism. Therefore, 


dy = {(UNW, kunw): (U.K) € #,UNW 49} 


is an atlas for W. Since the transition mappings of this atlas are obtained from those 
of / by restriction to open subsets, they are of class C* again. Thus, the atlas Ay 
defines a differentiable structure of class C* on W and hence W is a differentiable 
manifold of the same class and the same dimension as M. 


Example 1.1.9 (Spheres) The sphere S” is the set of solutions of the equation 
||x||7 = 1 in the variable x € R’*!. In the relative topology induced from R”*!, 
S” is Hausdorff and second countable. We construct an atlas by means of stereo- 
graphic projection. Fix an arbitrary point eg € S”, put Uz := S” \ {eo} and define 
mappings x4 : Ui —> R” by 


xX — (X-e@9)e 
es eer. 
1+=x-e9 


a 


where we have identified IR” with the hyperplane of R”*+! orthogonal to eo, see 
Fig. 1.2. The mappings «+ are obviously bijective, continuous and open, hence 
homeomorphisms. Since Ui U U_ =S", the sphere S” is a topological manifold. 
The transition mapping k_ o Co maps K+ (U4 1U_) = R” \ {0} to«_ (U4 NU_) = 
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Fig. 1.2. Stereographic 
projection 
eo x 
R"” 
y = k(x) 
en 

R” \ {0}. There holds 

—xX-@9 Ky (x) 


1 
a fal gc = Te OI2” 


and hence 


—1 y 
Kok, (y)=—\. 
. lly|l? 


This mapping is of class C®. Since K+ 0 kK! =K_o Ky the local charts (U1, K+) 
and (U_,«_) define a smooth structure on S”. 


Remark 1.1.10 (Differentiable structure induced by a family of mappings) For the 
topology of S” one may also choose the initial topology! induced by the mappings 
k4 :S”" — R". This has the advantage that «, and x_ are automatically homeo- 
morphisms then, because the transition mappings K+ 0 «_! and Ko Ky. are of 
class C® and hence in particular continuous. It is not hard to show that the initial 
topology induced by «* coincides with the relative topology~ induced from R"+! 
(Exercise 1.1.1). 

More generally, let M be a set and let n € N. Assume that we are given a count- 
able covering {U,} of M such that for every @ there exists an injective mapping 
Kq : Uy — IR" whose image is open. Assume further that the transition mappings are 
differentiable of class C*. (This makes sense, because the assumptions imply that 
their domains are open subsets of R”.) Equip M with the initial topology defined by 
the family of mappings {(Uq, Kq)}. This topology is second countable, because so 
is R” and the family is countable. This topology is also Hausdorff: let mg,mp € M, 
Mg ~ mp. If there is an a such that mg,mp € Ug, the assertion is obvious. Other- 
wise, let mq € Uy and mp € Ug. Since Ug is open, there exists a neighbourhood V, 
of mg in M whose closure V, in M is contained in Uy. Since mp ¢ Uy, Up \ Ve is 
a neighbourhood of mp, and it does not intersect V,. This yields the assertion. 


'The coarsest topology such that both «; and «_ are continuous. 


>The coarsest topology such that the natural inclusion mapping S” + R”*! is continuous. 


6 1 Differentiable Manifolds 


Continuity of the transition mappings ensures that the (Uy, Kq) are local charts 
on M. Since the U, cover M and since the transition mappings are of class C*, 
these charts establish a C‘-atlas on M. The corresponding C*-structure is said to be 
induced by the family of mappings {(Uq, Kq)}. 


Example 1.1.11 (Polar and spherical coordinates) The polar coordinates r, @ in the 
plane and the spherical coordinates r, 7, @ in R? are curvilinear coordinates which 
are often used in physical problems with rotational symmetry in order to simplify 
calculations. The polar coordinate chart is defined as the inverse of the mapping 


Ry x (—2, 1) > R’, (r,@)  (rcos¢,rsing). 


Hence, the chart domain is U = R? \ {Ae; : A < 0}. This chart is smoothly com- 
patible with the standard smooth structure on R?. By restriction to S! Cc R?, the 
coordinate function ¢ yields a local chart which is smoothly compatible with the 
smooth structure discussed in Example 1.1.9. The spherical coordinate chart is de- 
fined as the inverse of the mapping 


Ri x (0, 7) x (-2,m7) > R?, (r, 3, od) (rsin’ cos¢,r sind sing, rcos?). 


Hence, the chart domain is U = R? \ {Ae, + we3 : A <0, € R}. This chart is 
smoothly compatible with the standard smooth structure on R?. By restriction to 
S* C R?, the two coordinate functions ¢ and # yield a local chart on the sphere S* 
which is smoothly compatible with the smooth structure discussed in Example 1.1.9. 


Example 1.1.12 (Mobius strip) Let M be the topological quotient of the open subset 
R x (-1, 1) C R* by the equivalence relation 


(51,1) ~ (82, fa) iff (82, 2) = (1 + 27k, (—D*n) for some k € Z. 


M is called the Mobius strip. Let p : Rx (—1, 1) > M denote the natural projection. 
As the quotient of a second countable space, M is second countable. It is Hausdorff: 
for m,,m2 € M, define 


d(my,mz) = inf{ V5 — 51)? + ( — 1)? = (s;,.1) € pom), = 1,2} 1) 


and show that d is a metric on M, compatible with the quotient topology (Exer- 
cise 1.1.3). To construct an atlas, we show that p is open. For every k € Z, the 


mapping 
go: Rx (-1,1I) > Rx (-1,)), oK(S, t) = (s + 27k, (—1)*t), 


is ahomeomorphism. If O C R x (—1, 1) is open, then g,(O) and hence 


p'(p(O)) = (J (0) 


keZ 
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is open. Therefore, p(O) is open. In particular, the open subsets 
V, := (—7, 7) x (-1, 1), V2 := (0, 277) x (—1, 1) 


of R x (—1, 1) project to open subsets U; := p(V;) of M. Since the restrictions of 
p to V; and V2 are injective, the mappings pry, : Vi > U; are homeomorphisms. 
Then, (U;, «;) with Kj := (piv)! are local charts on M. Due to M = U; UU, they 
form an atlas, thus turning M into a topological manifold. 

Finally, we check differentiability of the transition mapping K2 0 kK, it maps 
a point (s,f) in K}(U; N U2) = {(s, t) € Vi : s £0} to the unique representative of 
the class of (s,t) in K2(U; MN U2) = {(s,t) € Vo: s Ar}. Thus, on the connected 
component s < 0 of «;(U; M U2), the transition mapping k2 o Ki maps (s,¢) to 
(s + 27, —t), whereas on the connected component s > 0 it is given by the identical 
mapping. On both components it is of class C™. Therefore, the atlas just constructed 
turns M into a smooth manifold. 

In the course of this book, the Mobius strip will turn up again at several places, 
notably as an example of a vector bundle (Chap. 2) and as an example of the quotient 
of a group action (Chap. 6). 


Remark 1.1.13 (Quaternions) As a preparation for the examples to follow, let us 
recall the definition of quaternions. Let H be the real vector space spanned by the 
basis elements 1, i, j and k. H carries an associative multiplication which is defined 
by the relations 


rv =j =k =-1, ij=—ji=k, jk=—kj=i, ki = —ik =j. 


This way, H becomes a real associative unital algebra with unit element 1. As such, 
it is generated by any two out of the three quaternionic units i, j and k. Since any 
nonzero element has an inverse, HI is a division algebra (or, when viewed as a ring, 
a division ring or skew field). The assignment 


1 0 . i 0 F 0 1 0 i 
tae il in| el in [9 ab k-|\ i (1.1.2) 


extends to an injective homomorphism of real algebras from H to M2(C), the alge- 
bra of complex 2 x 2-matrices. The mapping 


X= X01 4+ x1i4+ x2j +. 23k bh X:= x01 — xi — x2j — x3k, 
where the x; are real, is an algebra involution of H, called quaternionic conjugation. 


Example 1.1.14 (General linear group) Let M, (IK) denote the algebra of n x n- 
matrices with entries from K = R, C or H and let GL(n, K) denote the subset of 
invertible matrices. GL(n, K) is a group and M,,(K) is a real vector space of di- 
mension n” for K = R, 2n? for K= C and 4n? for K = H. Using the fact that the 
operator norm on M,, (K) satisfies || A B|| < || A]|||B|| for all A, B ¢ M, (K), one can 
show that GL(7, K) is open in M,,(K), see Exercise 1.1.5. In case K= R or C, 
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openness also follows from the fact that, here, GL(n, KX) is the preimage of the open 
subset K \ {0} C K under the determinant mapping 


det : M, (K) > K, 


because the determinant is n-linear and hence continuous. As a consequence of 
Examples 1.1.6 and 1.1.8, GL(m, K) is a differentiable manifold of dimension 
n> - dime K. According to Example 1.1.8, global charts are given by the mappings 


GL(n, R) > R”, ar (a';) 
GL(n, C) > R””, ay + ia ((a1)';, (a2)'s) 
GL(n, H) > R*”, ay1+ anit azj+agk ((ay)! ;, (a2)! j, (a3) j, (a4) ;), 


where a; € GL(n, R) and a! j are the matrix entries of a.> The mapping 
GL(n, K) x GL(n, K) > GL(n, K), (a,b) ab}, (1.1.3) 


is smooth, because with respect to the above global chart, it is given by a system of 
rational functions with nonvanishing denominators. This means that GL(n, K) is a 
Lie group, cf. Chap. 5. 

Let us finish this example with a remark on the topological structure of GL(7, R). 
Since det : GL(n, R) > R is continuous and R \ {0} consists of the connected com- 
ponents x > 0 and x < 0, GL(n, R) decomposes into the open subsets 


GL(n, R)s = {a € GL(n, R) : + det(a) > 0}. (1.1.4) 


Both are manifolds of dimension n2. Due to 


-1 0 


GL(n, R)_ = 0 1 


-GL(@, R)+, 

1 

they are homeomorphic. One can show that GL(n, R)+, and hence also GL(n, R)_, 
is connected. Thus, GL(n, R); and GL(m, R)_ are the connected components of 
GL(n, R). In contrast to that, GL(n, C) is connected. For proofs, see Exercise 5.1.9 
in Chap. 5 or, for example, [129]. 


Example 1.1.15 (Projective space) Let K = R, C or H. We use the notation 
Ki = {xe K":x #0}, Kj := {xe K": ||xll = 1}, 


where || - || denotes the norm of the natural scalar product x'y = a1 wiyi- Here 
denotes the natural involution of K, that is, the identical mapping for K = R, com- 
plex conjugation for IK = C and quaternionic conjugation for K = H. The projective 


3Usually, we will denote elements of an algebra by capital A, B, C,... and elements of a group by 
small a,b,c,.... 
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space KP” is defined as the topological quotient of Ket by the equivalence rela- 
tion x ~ y iff y = x for some 4 € K,.* Its elements correspond to one-dimensional 
K-subspaces of K’+!. Let p: K"*+! + KP” denote the natural projection to equiv- 
alence classes. First, we show that KP” is Hausdorff. Let m, 4 mz be elements of 
KP”. Choose representatives x, y € kr with ||x|| = 1 = |ly||, denote 


I(x, y) = min ||xa — y|| 
AeEKy 
and consider the subsets 


Z 
—h—x 


Kx:= {2 € K"*! : max zi 
Z 


AEK; 


1 
< 5lx, »} ck 


and Ky, defined analogously. Since K; is compact, the maxima and the minimum 
exist. Due tom, # m2, the minimum is nonzero. The subsets Ky and Ky are disjoint 
and open unions of equivalence classes. Therefore, p(Kx) and p(Ky) are disjoint 
neighbourhoods of m, and mz in KP”, respectively. Next, we construct local charts. 
To this end, we observe that if an equivalence class possesses a representative with 
x; #0 for a given i, then it possesses a representative with x; = 1 and the latter is 
unique. Hence, if we put 


Vj = {xe K*!: x; = 1}, Uj := p(Vi), 


then each element of U; has a unique representative in V; and, by restriction, p 
induces bijections V; — U;. By inverting these mappings and by identifying V; in 
the obvious way with K”, we obtain bijective mappings x; : U; > K”". We show 
that (U;,«;) are local charts. Since their inverses are given by p and hence are 
continuous, we only have to show that U; is open and that «; is continuous. To this 
end we put W; := {x € K?*+! : x; 40} and consider the mapping 
x; 2 W; > K" x Ky, Xi(X) = (Gaia's i inact ma) 

Let A C K” be open. There holds pik (A) = x) (A x K,.). Since x; is con- 
tinuous, Pe (A)) is open in W; and hence open in K7+1, Then, Kk! (A) is open 
in KP”. This implies, first, that U; is open and, second, that «; is continuous. Thus, 
for each i, (Uj, kj) is a local chart on KP”, indeed. The corresponding coordinates 
are called homogeneous. Since the U; cover KP”, the local charts (Uj, «;) provide 
an atlas of KP” and thus equip KP” with the structure of a topological manifold. 
We leave it to the reader to check that the transition mappings are smooth (Exer- 
cise 1.1.6). 


Remark 1.1.16 The groups GL(n, K) are examples of Lie groups. The projective 
spaces IKP” are examples of homogeneous spaces. Both Lie groups and homoge- 
neous spaces will be treated in detail in Chap. 5. 


4We adopt the convention that scalars multiply from the right; this is of course relevant for K = H 
only. 
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To conclude this section, we discuss the operations of direct product and di- 
rect sum. Let M, and M2 be differentiable manifolds of class C¥ and dimen- 
sions n; and nz. Let % be atlases for M;. The direct product of topological spaces 
M, x Mz is Hausdorff and second countable. Any pair of local charts (U1, 1) € 4 
and (U2, k2) € & defines a local chart on M, x M2 by (U, x U2, Kk, X k2), where 


1 X k2: Uy x U2 > RT”, (1 x K2)(m1, m2) := («1 (m1), K2(m2)). 


This way, the direct product of atlases <4 x & provides an atlas for My x Mp. 
Since for the transition mappings there holds 


(k1 X kz) 0 (p1 X p2) | = (K1 op; ') x (k2 0 p;'), 


the local charts are compatible of class C kK Hence, M x Mo, together with the atlas 
I, x Gy, isa differentiable manifold of class Cé and dimension nj +n. It is called 
the direct product of M; and M2. 

Now let n} =n2 =n. The direct sum of topological spaces Mj Li M2 is Hausdorff 
and second countable. The disjoint union of atlases <4 Ui .& yields an atlas for 
M, U Mo. Since the domains of charts from 4 do not intersect the domains of 
charts from —&, this atlas is of class C*. Hence, Mj U M2 together with the atlas 
I, \1 By is a differentiable manifold of class C kK and dimension n. It is called the 
direct sum of M, and Mo. 


Example 1.1.17 


1. The n-torus T” = §! x -”. x S! is a smooth manifold of dimension n. The cylin- 
der S! x R is a smooth manifold of dimension 2. 

2. As a manifold, the general linear group GL(n, R) is the direct sum of the mani- 
folds GL(n, R)+ and GL(n, R)_, see Example 1.1.14. 


Exercises 

1.1.1 Show that the initial topology of S” induced by stereographic projection with 
respect to an arbitrary point and its antipode coincides with the relative topol- 
ogy induced from R”*!, 

1.1.2 Prove the statements of Example 1.1.11. 

1.1.3. Show that (1.1.1) defines a metric on the Mobius strip which is compatible 
with the quotient topology, see Example 1.1.12. 

1.1.4 A model of a MGbius strip can be produced by gluing the ends of a long 
narrow paper strip, with one end twisted by an angle of 180 degrees. Consider 
the strip one obtains when one end is twisted by 360 degrees instead. A strip 
with this twisting is also obtained by cutting a Mobius strip along the middle 
line. Show that this strip is homeomorphic to the untwisted strip. Why can’t 
the strip be untwisted though? 

1.1.5 Prove that GL(n, K) is open in M, (KK), see Example 1.1.14. 

Hint. For A € GL(n,K) define U4, := {B € M,(R) : ||A — Bll < 
|| A~!||-!}. Show that U, is a neighbourhood of A in M,(K) and that for 
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Fig. 1.3, Level Set Theorem 


every B € Us, the series Yeo — A~!B)*A~! converges absolutely and 
the limit is inverse to B. 
1.1.6 Verify that the local charts of Example 1.1.15 are smoothly compatible. 
1.1.7 Show the following. 
(a) KP! is homeomorphic to S#™® ®, 
(b) RP? is homeomorphic to the space obtained by contracting the boundary 
of a Mobius strip to a point. 
Hint. Study how the points of the subset {x € R} : a7 + < = 1, |x3| <a} 
of R3 get identified via the defining equivalence relation of RP?. Then, 
leta—> ow. 


1.2 Level Sets 


Level sets at regular values of differentiable mappings of IR” provide a great variety 
of examples for manifolds. Let us start this section with recalling some terminology. 
Let f : R” > R” bea differentiable mapping of class C, k > 1. Anelement x € R” 
is called a regular point of f if f’(x) has rank m; otherwise x is called a critical 
or singular point. An element c € R” is called a regular value of f if all points 
x € f—'(e) are regular; otherwise c is called a critical or singular value. If ¢ is a 
regular value, then necessarily n > m. 


Theorem 1.2.1 (Level Set Theorem) Let U C R” be open, let f : U > R” bea 
differentiable mapping of class C¥ and let ¢ € R be a regular value of f such that 
the level set M := f—'(e) is nonempty. Then, M is a differentiable manifold of class 
C* and dimension n — m. 


Proof Let xo € M. We define 


Xo :=ker f’(xo) = {x € R": f’(xo) (x) =0} CR’. 
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Since f’(xg) has rank m, Xo is an (n — m)-dimensional subspace of R”. Let x 
denote the orthogonal complement of Xo with respect to the Euclidean metric and 
let po: R” — Xo be the orthogonal projection to Xo, see Fig. 1.3. We show that po 
defines a local chart on M. The mapping 


h:Xox Xi > RR", h(y, Zz) := f(%o +y+zZ) —¢, 
is of class C¥ and satisfies h(0, 0) = 0. Since for any WE Xa 
h'(0, 0)(0, w) = f’ (xo) (w), 


the equality 4’(0, 0)(0, w) = 0 implies w € Xo and hence w = 0. According to the 
Implicit Function Theorem, there exists an open neighbourhood V of 0 in Xo and 
a mapping F: V > <T of class C* such that h(y, F(y)) = 0 for all y € V. Then, 
xo + y+ F(y) € M for all y € V, so that we obtain a continuous mapping 


A:V>M, — Ky) :=xXo+y+ Fy), 
see Fig. 1.3. Let U := A(V) and define the mapping 
K:U>V, K(X) := po(K — Xo). 


Check that VU = MN (x9 + Po (V)), kK oA=idy and dA ox = idy. It follows that 
U is open in M and k is a homeomorphism, that is, (U,«) is a local chart on M. 
Application of this construction to each x € M yields an atlas for M. We check that 
the local charts are compatible of class C K let (U ,K) be the chart associated with 
Xy € M and assume UU 4 @. The transition mapping is given by 


KOK 


7 1 ee te 
Xo DK(UNU) —> KUNU)CXo, yk> po(xo +y + Fy) — Xo). 


As a composition of F with affine mappings it is of class C*. 


Remark 1.2.2 


1. The subspace Xo = ker f’(xo) is spanned by the tangent vectors of differentiable 
curves in M passing through xo. It is, therefore, called the tangent space of M 
at xq. By shifting this subspace to xo one obtains an affine subspace, the tangent 
hyperplane, which is tangent to M in xp and whose vector space of translations 
is given by the tangent space Xo, see Fig. 1.3. 

2. In the case m= 1, f : U > R is a C*-function and f’(x) is the gradient of f 
at x. Since the rank can be at most 1, a point x € U is regular iff f’(x) 4 0. Thus, 
c € R is a regular value of f and hence the level set f~!(c) is a Cé-manifold iff 
the gradient does not vanish on f~!(c). 

3. Theorem 1.2.1 and the above remarks generalize in an obvious way to the situa- 
tion where U is an open subset of a finite-dimensional real vector space W and 
f takes values in a finite-dimensional real vector space V. We also note that, as 
a consequence of the proof, there exists an atlas of the level set M C W whose 
local charts (U, «) fulfil the following conditions: 
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c>0 c=0 c<0 


Fig. 1.4 The level sets of Example 1.2.4 


(a) x is the restriction to U of a linear mapping W > R&™”, 

(b) the mapping 10 x~!:«(U) > W, where: M — W is the natural inclusion 
mapping, is of class C*. 

It follows that, given level sets Mj C W;, i = 1,2, of C*-mappings at regular 

values and a C‘-mapping ® : W, + W2 satisfying ®(M)) C Mp, the induced 

mapping M; — M) is of class C* (Exercise 1.2.1). 


Example 1.2.3 (Spheres) The sphere S” is the level set of the function 
fi: R SR, fo = Ix? 


at the value c = 1, cf. Example 1.1.9. Since x = 0 is the only singular point of f, 
the assumptions of Theorem 1.2.1 are satisfied. Therefore, this theorem yields a 
smooth structure on S”. One can show that this structure coincides with the one 
constructed by means of stereographic projection in Example 1.1.9 (Exercise 1.2.4). 
For x € S”, the subspace ker f’(x) consists of those vectors which are orthogonal 
to x. By shifting this subspace to x, one obtains the tangent plane of the sphere at 
this point, indeed. 


Example 1.2.4 (Hyperboloid) We consider the function 
f: RR, F(x) = x2 +23 — x2. 


The only singular point of f is x = 0. Hence, the only singular value of f is c= 
f (0) = 0; all c #0 are regular. We determine the level sets M = f—!(c). Since f 
is invariant under rotations about the x3-axis, M is the surface of revolution of the 
curve x, a a =c in the x1—x3-plane. This is a hyperboloid which is one-sheeted 
for c > 0 and two-sheeted for c < 0, see Fig. 1.4. In both cases, Theorem 1.2.1 
yields a smooth structure. As a link between these two cases, for the singular value 
c =0, M is a double cone. In this case, Theorem 1.2.1 does not apply. In fact, the 
double cone is not a topological manifold (Exercise 1.2.2). In the case c < 0, using 
xo = (0, +./c, 0), one obtains a global chart for each sheet. Since the domains of 
these two charts do not intersect, this atlas is automatically smooth. In the case 
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c > 0, one needs at least three charts of the type used in the proof of the Level Set 
Theorem. We leave this case to the reader (Exercise 1.2.3). 


Remark 1.2.5 In contrast to the double cone {x € R?: ae + a _ ae = 0} of Ex- 
ample 1.2.4, the single cone {x € R?: a + x = Ea = 0, x3 > 0} is a topological 
manifold, and even a manifold of class C°. Indeed, orthogonal projection to the 
x —x2-plane yields a global chart. 


Example 1.2.6 (Classical groups) Let K= R, C, H. Forn,m=0, 1, 2,..., define 


[tn 0 0 4% 
Liam — 0 ea ’ Sn — ce ‘l 


and recall the following inner products on K”: 
n n 
Ky=)ouy, Sy=)oay, xi, EK. 
i=l i=l 


As before, x; denotes the natural involution on K. By a classical group one means 
the general linear group GL(n, K) or one of the following subgroups. 


1. The unimodular group SL(n, K), K=R, C, is the group of linear mappings of 
unit determinant, 


SL(n, K) = {a € GL(n, K) : det(a) = 1}. 


2. The real orthogonal group O(n,m) C GL(n + m, R) is the group of isometries 
of the symmetric bilinear form x 1h, my on R'", 


O(n, m) = {a €GL(n+m,R): Dy wel = leis bs 


In case m = 0 one writes O(n) = O(n, 0); this is the group of isometries of the 
Euclidean scalar product. 

3. The special real orthogonal group SO(n,m) is the subgroup of O(n, m) of 
isometries with unit determinant, 


SO(n, m) = O(n, m) 1SL(n + m,R). 


In case m = 0, one writes SO(7). 
4. The complex orthogonal group O(n, C) C GL(n, C) is the group of isometries 
of the symmetric bilinear form x'y on C”, 


O(n, C) = {a € GL, C):a'a=1}. 


5. The special complex orthogonal group is the subgroup of O(n, C) of isometries 
of unit determinant, 


SO(n, C) = O(n, C) NSL(@, C). 
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6. The unitary group U(n,m) C GL(n + m, C) is the group of isometries of the 
Hermitian form x"1y,my on C”*", 


U(n, m) = {a e€GLn+m,C): a’1n ma = Lion (3 


In case m = 0 one writes U(n) = U(n, 0); this is the subgroup of isometries of 
the natural scalar product on C”. 

7. The special unitary group SU(n, m) is the subgroup of U(n, m) of isometries of 
unit determinant, 


SU(n, m) = U(n,m) NSL(n +m, C). 


In case m = 0 one writes SU(n) = SU(m, 0). 
8. The symplectic group Sp(n, K) C GL(2n, K), K=R, C, is the group of isome- 
tries of the antisymmetric bilinear form w(x, y) = x' J,y on K", 


Sp(n, K) = {a € GL(2n, K) : a" Jna = Jn}. 


9. The quaternionic symplectic group Sp(n,m) C GL(n + m, H) is the group of 
isometries of the Hermitian form x‘1,,y on H’"*”, 


Sp(2,m) = {a €GL(n+m,H): a'Inma = Hh oa te 


In case m = 0 one writes Sp(n) = Sp(n, 0); this is the group of isometries of the 
natural scalar product on H!”. 


By writing down defining relations and applying the Level Set Theorem one 
can show that all the classical groups are smooth manifolds (Exercise 1.2.6). As an 
example, we carry out the proof for the groups O(n) and SO(n). 

Let S,,(R) denote the linear subspace of M,,(R) of symmetric matrices. O(n) is 
the level set at the value c = 1 of the mapping f: M,(R) > S,(R), f(a) :=a'a. 
Hence, for O(7) it suffices to show that 1 is a regular value of f. To do so, we 
calculate the derivative f’ of f ina € M,(R): 


f'(a(X) =a'X + XTa, XEM,(R). 


If f(a) = 1, then for any B € S,,(R) there holds f’(a)(aB) = 2B. This shows that 
f' (a) is surjective for alla € f~!(4), that is, 1 is a regular value of f, indeed. Then, 
for SO(n) it suffices to show that it is open in O(7). Since for any a € O(n) there 
holds det(a”) = det(a'a) = 1, we have det(a) = +1. Hence, 


SO(2) = O(2) N GL(m, R)+, 


cf. Example 1.1.14. Now, the assertion follows from the fact that GL(7, R)+ is open 
in M,(R). 

Finally, the Level Set Theorem also yields the dimensions of the classical groups. 
For example, since S, (IR) has dimension 5n(n +1), O(m) has dimension 


Sh 
nh ——-—n\n = -n\n — F 
2 2 
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The same is true for the open subset SO(7). It turns out that the groups O(n, m) 
and SO(n, m) have the same dimension as O(n + m). The dimensions of the other 
classical groups are 


SL(7,R): n?—1, SL(n,C): 2n? —2, 
O(n, C), SO(n, C): n(n 1), U(n,m):  (n+m)’, 

SU(n,m): (n+m)? -1, Sp(n,R): nQn+1), 

Sp(n,C): 2n(2n +1), Sp(n,m):  (n+m)(2(n +m) + 1). 
Remark 1.2.7 


1. 


Let G C GL(n, K) denote one of the above classical groups. With respect to the 
smooth structures provided by the Level Set Theorem, the mapping G x G > G, 
given by (a,b) +> ab™!, is smooth. This follows from Remark 1.2.2/3 and from 
the smoothness of the mapping (1.1.3). We conclude that the classical groups are 
Lie groups and, in addition, Lie subgroups of GL(n, KX), cf. Chap. 5. 


. One has O(n) \ SO(n) = O(n) NGL(m, R)_, and this subset is also open in O(). 


Hence, SO(7) is closed in O(7). One can show that SO(n) is connected; this is in 
fact part of the proof of the connectedness of GL(n, R)+, see the corresponding 
remark in Example 1.1.14. Thus, O(7) consists of the connected components 
SO(n) and O(n) \ SO(n). 


. Being level sets, the classical groups are closed subsets of GL(n, KK). As isome- 


try groups of scalar products, the groups O(n), U(m) and Sp(n) are compact, see 
Exercise 1.2.7. Since SO(n) and SU(n) are closed subsets of O(7) and U(n), re- 
spectively, they are compact, too. None of the other classical groups is compact. 


Exercises 


1.2.1 Show that the restriction of a C‘-mapping between vector spaces to level sets 


is of class C*, cf. Remark 1.2.2/3. 


1.2.2 Show that the double cone {x € R?: a + a = xe = 0} is not a topological 


manifold, cf. Example 1.2.4. 


1.2.3 Using the method of the proof of the Level Set Theorem, construct an atlas 


for the one-sheeted hyperboloid of Example 1.2.4. 


1.2.4 Show that the smooth structure on S” provided by the Level Set Theorem 


coincides with the smooth structure constructed by means of stereographic 
projection in Example 1.1.9. 


1.2.5 Show that the following level sets are smooth manifolds and construct at- 


lases: 
(a) the paraboloid M = {x € R*: x3 = ae + eran 


2 2 2 
(b) the ellipsoid M = {xe 3: $4+343=1), 
1 2 3 


(c) the rotational torus M = {x € R?: (,/x2 + x2 — a)* + x? = b*}, where 
1 2 3 
O0<b<a. 


1.3 Differentiable Mappings 17 


Fig. 1.5 Local representative of a continuous mapping between manifolds. Here, 
U=UN®@ (Vv) 


1.2.6 Use the Level Set Theorem 1.2.1 for showing that the classical groups listed 
in Example 1.2.6 are smooth manifolds. 

1.2.7 Show that the isometry group of the natural scalar product on K” is compact. 
Hint. Choose an appropriate norm on M,, (IK), K = R, C, H, and show that the 
subset of isometries is bounded. 


1.3 Differentiable Mappings 


In this section, we carry over the notion of differentiability from classical calculus 
to the theory of manifolds. Let M and N be C*-manifolds and let 6 : M —> N be 
a continuous mapping.’ As usual, we will refer to M, to N, to the elements of N 
and to ®(M) as, respectively, the domain, the range, the values and the image of 
@. Let (U,«) be a local chart on M and let (V, e) be a local chart on N such that 
UN @—!(V) #@. The mapping 


Dep :kK(UN®|(V)) > p(V), Dy (x) = poPok |(x), 


pictured in Fig. 1.5, is called the local representative or the local representation of 
@ with respect to the charts (U,«) and (V, ¢). Since by continuity of @, @-'(V) 
is open, the local representatives are mappings between open subsets of R&™™ and 
R‘™ Hence, it makes sense to ask whether they are differentiable. 


Definition 1.3.1 (Differentiable mapping) Let M and N be C*-manifolds. A map- 
ping ®: M > N is called differentiable of class C*, or just a C‘-mapping, if it is 
continuous and if for any pair of local charts (U, «) on M and (V, e) on N such that 
UN @~!(V) ¥@, the local representative ®,,p is of class C*. The set of all such 
mappings is denoted by C*(M, N). 


In this book, mappings between manifolds are usually denoted by capital Greek letters like 
@,W,... or small Greek letters like g, w, x,.... 
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This definition yields the notion of C‘-mapping of C!-manifolds for any / > k by 
viewing the C!-manifolds as Ck-manifolds according to Remark 1.1.5/1. By anal- 
ogy, one defines the notions of real analytic mapping between real analytic mani- 
folds and of complex analytic mapping between complex analytic manifolds by re- 
quiring the local representatives to be real or complex analytic, respectively. Contin- 
uing previous terminology, C°-mappings will be referred to as smooth mappings. 


Remark 1.3.2 


1. An equivalent definition of differentiability is the following. A mapping @ : 
M — N is differentiable of class C* iff there exist atlases {(U;,«;):i € 1 } on 
M and {(V;, 0;): j € J} on N such that for every i € J there is j € J such that 
®(U;) C V; and the local representative Px;,p; * K(U;) > e(V;) is of class Cc 
Note that, here, continuity of f need not be required. Indeed, for every pair of 
charts (U;, «i), (Vj, 0;) such that ®(U;) C Vj we have ®py, = ss © Dx; p; OKis 
which is continuous as a composition of continuous mappings. 

2. A mapping between open subsets of finite-dimensional vector spaces, endowed 
with the standard smooth structure (Example 1.1.6), is of class C k in the sense of 
Definition 1.3.1 iff it is of class C* in the sense of classical calculus. To see this, 
choose global charts corresponding to two chosen bases. In particular, multilinear 
mappings between finite-dimensional real vector spaces are smooth. 

3. Let (Uj, «;) and (V;, p;), i = 1,2, be local charts on M and N, respectively, such 
that 


W:=ULNWNGN(ViNVW) #2. 
Then, for any m € W, 


Puy, py) (mM) = (p2 < p;') © Py ,p; © («1 5 «z')(m). 


This shows that the local representatives with respect to two different pairs of 
charts are related via the transition mappings between these charts. In particular, 
in order to decide whether a continuous mapping ® : M > N is differentiable it 
suffices to test the local representatives with respect to arbitrary but fixed atlases. 
4. Consider the case N = R with the standard smooth structure. A C‘-mapping 
f :M = Ris called a C*-function. The space of C*-functions is denoted by 
C*(M). It carries the structure of a real associative algebra with operations 


(af + g)(m) :-=af(m) + gim), (f - g)(m) := f(m)g(m), 


where f, g € ck(M), aé€R,andme M. 

5. Consider the case where M = J C Ris an open interval with the standard smooth 
structure. A C‘-mapping y : I > N is called a C*-curve. We say that y is a 
curve through p € N if0€ J and y(0) = p. 

6. Let (U,«) be a local chart on the C*-manifold M. Consider the smooth mani- 
fold U with global chart (U,«). Viewed as a mapping from U to R” (with its 
standard smooth structure), « is smooth. Similarly, viewed as functions on U, 
the coordinate functions x! are smooth. 
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Definition 1.3.3 (Diffeomorphism) Let M and N be differentiable manifolds of 
class C*. A differentiable mapping ® : M > N of class Cé is called a diffeomor- 
phism of class C*, or just a C‘-diffeomorphism, if it is bijective and the inverse 
mapping is of class C*, too. If a diffeomorphism of class C* exists, one says that M 
and N are diffeomorphic of class C*, or just C'-diffeomorphic. 


As in the case of C‘-mappings in Definition 1.3.1, Definition 1.3.3 yields the 
notion of C*-diffeomorphism of C!-manifolds for all / > k as well. If the differen- 
tiability class of M and N is fixed, we will usually just speak of diffeomorphisms. In 
addition, we will use the following terminology. By a local diffeomorphism from M 
to N we mean a diffeomorphism from an open subset of M onto an open subset of 
N. By saying that a C’-mapping ® : M — N is locally a diffeomorphism we mean 
that for every point of M there exists an open neighbourhood U such that (UV) is 
open in N and the mapping U — ®(U) induced by @ is a diffeomorphism. 


Example 1.3.4 (Diffeomorphic differentiable structures) Define mappings x; : RR > 
R,i=1, 2,3, by 


2x|x => 0 3 
K(x) :=x, Ko(x) = K3(x)1= x”. 
x|x <0, 


Each x; is a homeomorphism and hence a global chart on R. Therefore, each x; 
defines a smooth atlas .e% and hence a smooth structure on R. However, no two of the 
charts «; are compatible, because none of the mappings x2 0K, - K2OK3 ork 10K ! 
is differentiable at the origin. Hence, the atlases .% are pairwise incompatible. The 
corresponding smooth structures are diffeomorphic though: consider the mappings 
@; : (R, YJ) > (R, H), O; :=k;, i = 2,3. Since they are homeomorphisms, they 
are bijective and continuous and their inverses are also continuous. The (global) 
representative of }; with respect to the charts «; and x; is 


=| : 
(Disc 4c} =k,oO ®; OK; = idp. 


Analogously, (Gg. x; = ldp. Hence, #; is a diffeomorphism. In contrast, 
the identical mapping ® = idg is not a diffeomorphism. Indeed, as a mapping 
(R, 4) — (R, &), idp has the global representative (id), ,«) = K2 and is thus not 
differentiable at the origin. As a mapping (R, .%) —> (R, &), idp has the global 
representative (id), «; = K3 and is, therefore, of class C~. Its inverse, however, is 
not differentiable at the origin. Finally, as a mapping (R, .) > (R, &), idp has 
the global representative 


(5x))|x=0 


i 
( R)K2,«3 xix <0 


and is, therefore, of class C*. Again, the inverse mapping is not differentiable at the 
origin. 
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Remark 1.3.5 (Differentiable structures on IR”) One can show that on the topolog- 
ical space R, all smooth structures are diffeomorphic to the standard smooth struc- 
ture. This is also true for R” except for n = 4. On R*, besides the standard smooth 
structure, there exist so-called exotic smooth structures [98]. A further example of 
a topological manifold which possesses non-diffeomorphic smooth structures is the 
sphere S’ [159]. 


Next, we discuss partitions of unity, a tool which will be used for example for 
integration of differential forms on manifolds. Recall that a family of subsets {V; : 
i € I} of M is called locally finite if for any m € M there is a neighbourhood U,, 
such that U;, 0 V; = © for all but a finite number of i € J. 


Definition 1.3.6 (Partition of unity) Let M be a C*-manifold. A partition of unity 
of M isa family {g; :i € 1} of C*-functions on M such that 


1. the family of supports {supp(g;) : i € } is locally finite, 
2. gi > Oforallie J, 
3. Vier gi(m) = 1 for allm € M. 


As a consequence of property 3, the family of supports of a partition of unity 
forms a covering of M. Recall that a covering {Vy : a € A} of M is subordinate to 
a covering {Wg : 6 € B} of M if for any a € A there is an element 6 € B such that 
Va C Wg. One says that a partition of unity is subordinate to a given covering of M 
if the family of supports of the partition is subordinate to that covering. 


Proposition 1.3.7 (Existence) For any open covering {Ug : a € A} of a Ck- 
manifold there exists a partition of unity {g; :i € I} with the following properties: 


1. I is countable, 
2. supp(g;) is compact for alli € I, 
3. {g;:i € I} is subordinate to {Uy :a € A}. 


Proof See, for instance, [73, §16.4] or [180, $11.3]. 


Remark 1.3.8 Let us conclude this section with a remark on the relation between 
the differentiability classes C* for different k. Let M be a topological manifold 
and let k > 1. One can show that every C*-structure on M is C‘-diffeomorphic to 
a C™-structure and that this C®-structure is unique up to C®-diffeomorphisms 
[130, Ch. 3]. One may even replace C™ by real analytic here. Another question 
is how many diffeomorphism classes of differentiable structures (hence of smooth 
structures) exist on a given topological manifold. On the one hand, according to 
Remark 1.3.5, there are topological manifolds admitting more than one diffeo- 
morphism class. On the other hand, there exist topological manifolds which are 
non-smoothable, that is, which do not admit a C*-structure with k > 1 at all, see 
[158, 276]. 
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Exercises 


1.3.1 Show that R” \ {0} is diffeomorphic to the cylinder S”~! x R over the sphere 
S”—!. For n = 2 and n = 3, how is this fact related to the definition of polar 
and spherical coordinates? 

1.3.2 Show that the mapping ®: R* — R°, defined by 


@(x):= (x1 sin(x3) + x2 cos(x3), x1 cos(x3) — x2 sin(x3), x3), 


maps the unit sphere S* C R? diffeomorphically onto itself. 
2 2 2 
1.3.3 Consider the ellipsoid M = {x € R?: 4 + +3 = 1} of Exercise 1.2.5/(b). 


Define mappings ®, W : (—4, 4) x @. ory > R by 


@(a, B) := (a, cosa cos B, a2 cosa sin B, a3 sina), 


W (a, B) := (—a, cosa cos f, a2 sina, a3 cosa sin B). 


Show that 

(a) the images of ® and W are open subsets of M and hence smooth mani- 
folds, 

(b) @ and W are diffeomorphisms onto their images. 

1.3.4 Consider the rotational torus M = {x € R3: ae + a =) + a = b’}, 
0 <b <a, of Exercise 1.2.5/(c). Show that 
(a) M is diffeomorphic to the direct product T? = S! x S!, 

(b) the mapping ®: M — S? C R’, defined by ®(x) := Tal? is differen- 
tiable. What is the image of ®? 

1.3.5 Identify S! with the unit circle in C and define an equivalence relation on 
S! x (-1, 1) by (a, t) ~ (8, s) if B =a,s =t or B = —a, s = -¢. By analogy 
with Example 1.1.12, construct a differentiable structure on the corresponding 
topological quotient. Show that the manifold so obtained is diffeomorphic to 
the Mobius strip. 
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In this section, we generalize the notion of tangent space, defined in Remark 1.2.2/1 
for level sets of C‘-mappings R” — R” at regular values, to arbitrary differentiable 
manifolds. Let k > 1. 

In case M is a level set, the elements of the tangent space at m may be viewed as 
the tangent vectors of C*-curves y in M with y(0) = m. To determine the tangent 
vector of y one makes use of oe fact that y is also a curve in the ambient linear 
space IR”, where the derivative # red (t) is well defined. Since this model of tangent 
space is not intrinsic for M, i.e., Le it requires more than just the differentiable 
structure of M, it cannot be cnied over to abstract manifolds. The notion of a curve, 
however, is intrinsic, see Remark 1.3.2/5. Let K;,(M) denote the collection of all 
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C*-curves through m. In the situation where M is a level set, two curves through 
m define the same tangent vector in m if their derivatives at t = O coincide. In this 
case, they should be viewed as being equivalent. By means of local charts, this 
equivalence relation will be carried over to abstract manifolds. For that purpose, we 
have to make sure that it does not depend on the choice of chart. 


Lemma 1.4.1 Let M be a C*-manifold, let m € M and let y1, y2 be curves of class 
Cc through m. fe to (Koy,)(t) = “ to (k 0 y2)(t) for some local chart (U, kK) atm, 
then this holds for every such chart. 


Proof The proof is an exercise in applying the chain rule and is, therefore, left to 
the reader (Exercise 1.4.1). 


Thus, we can define two curves 7), y2 in K,,(M) to be equivalent if for some 
(and hence any) local chart (U, «) at m there holds 


d d 
Gi a (1.4.1) 


Let T,,M denote the set of equivalence classes. On T,,M, a linear structure can 
be defined as follows. Let (U,«) be a local chart at m. Due to Lemma 1.4.1, the 


mapping 


d 
Fn? TnM > R", [yl Fall) = Ge & oO. (1.4.2) 
0 


is well defined and injective. Since x € R” is the image under F* of the equivalence 
class of the curve 


y*(t):=K«~'(k(m) +1x), (1.4.3) 


F* is also surjective. The inverse mapping is given by 


(FX)! =[y*]. (1.4.4) 


By means of F“, we transport the linear structure of R” to T,, M, that is, we define 


o [yi] + Bly := (FS) | (FX (Ini) + BES (f21)). 


This definition does not depend on the choice of chart, because for a second chart 
(V, eo) one has 


FP ([v1) = (@0«7')' (km) - FA (Ly1) (1.4.5) 
(Exercise 1.4.2). As a result, T,, M is a real linear space of the same dimension as 


M and the mappings F* are vector space isomorphisms. 


Definition 1.4.2 (Tangent space) The real linear space T,, M is called the tangent 
space of M at m. Its elements are called tangent vectors at m. 
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Tangent vectors will usually be denoted by X,,, Y;, etc. The local representative 
of a tangent vector X,, € T,, M with respect to the chart (U, «) is defined to be 


X® = FK (Xm). (1.4.6) 


The relation between X*, and the local representative X/, with respect to another 
chart (V, pe) can be read off from (1.4.5): 


Xi =[(e 0%) (€m)] Xn. (1.4.7) 
The coefficients of the local representative X*, with respect to the standard basis of 
R” are® 


‘ d ; 
Xm => : (k! oy)(t). (1.4.8) 


For these coefficients, (1.4.7) yields the transformation law (summation convention) 


xPi = [ (pox!) (e(m))] Xm, (1.4.9) 


that is, the transformation is given by the Jacobi matrix of the coordinate transfor- 
mation. 


Example 1.4.3 


1. Let M be an open subset of a finite-dimensional real vector space V. For every 
vu € M, the assignment of the velocity vector ¢ 1p O) € V to the class [y] € 
T,M defines an isomorphism of the tangent space T, M with the ambient vector 
space V. The inverse of this isomorphism is obtained by assigning to u € V the 
class of the curve t +> v + tu. This isomorphism will be referred to as the natural 
identification of T, M with V. 

2. Let V and W be finite-dimensional real vector spaces and let M C V be the 
level set of a regular value of a Ck-mapping f : V > W. For every v € M, the 
assignment of the velocity vector ¢ pV € V to the class [y] € T, M defines a 
natural isomorphism of the tangent space T, M with the subspace ker f’(v) of V. 
The proof is left to the reader (Exercise 1.4.4). This shows that Definition 1.4.2 
formalizes the idea of the tangent space at a point of a level set, indeed. 


Next, we discuss an equivalent, more algebraic view on tangent vectors. Consider 
the algebra ck (M) of C*-functions on M. Let Xm € Tm and let y € Km(M) 
be a curve representing X,,. By analogy with the directional derivative of analy- 
sis in IR” we define the directional derivative of f € Cé(M) along the curve y by 
¢ abt oy)(t). Due to 


d mee) d 
ae en [(fox')' (k(m))]- Le 


Tf there is no danger of confusion, we will usually omit the chart label and just write X/,. 
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and (1.4.1), for another curve y representing X,, there holds 


¢ ( Co y \(t 
Gn =a ), 


That is, the directional derivative depends on X,, only. Therefore, X,, defines a 


linear mapping, denoted by the same symbol, 


d 
Xm:CK(M)>R,  Xm(f):= ee (foy)(t), (1.4.10) 
0 


where y is some curve representing X,,. For f, g € C*(M), the product rule yields 


Xm(f +8) = Xm(f)gm) + fmXm(8), (1.4.11) 


that means, X,, is a derivation at m. 


Definition 1.4.4 (Derivation at a point) Let M be aC k_manifold and let m € M. 
A derivation at m is a linear mapping D,,: C‘(M) —> R satisfying 


Dn f +8) = Dm(f)g(m) + f(m)Dm(g) forall f,geC*(M). (1.4.12) 


The set of all derivations at m will be denoted by D,,M. It carries the structure of 
a real linear space. Assigning to a tangent vector its directional derivative (1.4.10) 
defines an injection of T,, M@ into D,, M, which preserves the linear structure, 


(Xm + Ym)(f) = Xm(f) + ¥m(P) (1.4.13) 


(Exercise 1.4.7). Thus, Tj, M may be identified with a linear subspace of D,, M. Via 
the isomorphism F'*, the standard basis {e;} of IR” induces a basis in T,,M. We 
determine the derivations which correspond to the elements of this basis. Due to 
(1.4.3) and (1.4.4), for f € C*(M), 


d 


(Fn) Cd) =F, (feet 0 (em) +149) = 


a(f ox!) 
——— ((m). 


Thus, the derivations’ 


(fox) 


ax: CK(M) > R, ys i= ra 


see (k(m)), i=1,...,n, (1.4.14) 
form a basis in T,, M. By construction, the coefficients of a tangent vector X;, with 


respect to this basis are given by the components of the local representative X*,, 


Xm = XS" oF (1.4.15) 


i,m* 


7As for the local representative of a tangent vector, we will usually omit the chart label and just 
write 0j,m- 


1.4 Tangent Space 25 


Moreover, (1.4.8) implies 


Xe Sle) (1.4.16) 

If (V, @) is another local chart at m, the two bases {07 nS and {a? m} Of TmM are 
related by (Exercise 1.4.3) 

3 m=[(« 007!) (0m) 59 n- (1.4.17) 


Proposition 1.4.5 Let M be a C*-manifold, letm € M and let Dm be a derivation 
at m. For arbitrary f, g € C(M), the following holds. 


1. If f is constant, then Dm(f) = 9. 
2. If fru = tu for some neighbourhood U of m, then Dn(f) = Dm(g). 


Proof 1. Write f =21, where 4 = f(m) and | denotes the constant function with 
value 1. Then, 


Dm(f) =ADm (0) =ADm Cd + 1) = d(Din (1) -1+1- Dm(1)) 
= 20Dm (1) = 2D (f). 


2. By assumption, (f — g);u = 0. There exists a C*-function h on M such that 
h(m) = 1 and hyy\u = 0. For example, choose a smooth function on R” which 
has the value 1 at the origin and vanishes outside some e-ball, transport it by an 
appropriate local chart to U and extend it by 0 to a function on M. Then, (f — g)h= 
0 and hence 


0= Dn((f — 8)h) = Dn f — g)h(m) + (f (m) — g(m)) Din (h) = Dn (f — 8). 


Remark 1.4.6 The notion of derivation at a point is related to the notion of derivation 
in the theory of modules over algebras as follows. Let 21 be an algebra and let 9M be 
a bimodule over 2. A linear mapping D : 2l — St is called a derivation if it satisfies 


D(ab) = D(a)b+aD(b), a, ber. 


Definition 1.4.4 is obtained from this more general definition by choosing C*(M) 
for 2 and R for 9, with the bimodule structure being defined by f - x := f(m)x 
and x - f :=xf(m), where f € C'(M), x ER. 


We now show that in the case k = ow, every derivation at m comes from a tangent 
vector. That is, for smooth manifolds, the subspace T,, M in fact coincides with 
DM and the linear spaces T,, M and D, M may be viewed as different realizations 
of the same object. 


Proposition 1.4.7 For every point m of a C®-manifold M there holds TmM = 
DnM. 
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Proof Let (U,«) be a local chart at a chosen point mpg. It suffices to show that 


: : ae e i 
every Ding € Dy can be written as a linear combination of 0; greets OF 6 Let 


f €C™%(M). According to Taylor’s Theorem, ina neighbourhood V of x» =k (mo), 
the local representative f := f ox! can be written in the form 


f (xo) (x' — x6) + fij (x) (x! — xp) (x! — x9), (1.4.18) 


FX) = f@0) + 55 
Xx 


where fii are smooth functions on V, given by 


af 


ax! ax 


1 
fiyoo = | ds(1—s) ((1 — s)xo + sx). 
0 


Since 


af a(f ox!) 


axi 0 oy 


Formula (1.4.18) yields the following decomposition on k~!(V) C U: 


((mo)) = ng Ps 


fm) =f (m0) + ng (F)(K! (n) — x9) 
+ (fij o€Cm)) (On) — x9) («Om = x9). (4.19) 


There exist open neighbourhoods W 1, W2 of mg and a smooth function h on M 
such that W) C W2, W2 Cx ~!(V), hypw, = 1 and hp, = 0. Multiply «! and 
fi j; 0 On W) by A and extend the resulting functions by 0 to M. With «' and 
fi j; °« modified in this way, (1.4.19) holds on Wj and all the functions involved 
are from C™(M). The latter ensures that we can apply Dj, to both sides of this 
equation. The second assertion of Proposition 1.4.5 ensures that after application 
of Dm, both sides are still equal. Then, a brief calculation using the first assertion 
of this proposition yields Ding(f) = Ding (kar ina’ J). Since this holds for all f € 
C™(M), the assertion follows. 


Remark 1.4.8 


1. Let M be a smooth manifold, let m € M and let (U, «) be a local chart at m. As 
a by-product of the proof of Proposition 1.4.7 we note that for a given derivation 
Dm € DnM, the tangent vector corresponding to D,, can be represented by the 
smooth curve tt «~!(k(m) +t Dm (k')e;). 

2. The proof of Proposition 1.4.7 does not work in the case of finite k. Indeed, for 
k = 1 there is no decomposition of f like (1.4.19) and for k > 2, the functions 
fi j ok are of class C k~2 ‘hence one cannot apply a derivation of C*-functions 
to them. As it turns out, this failure cannot be repaired. In fact, for finite k there 
is a big difference between T,,M and D,M, because D,,M can be shown to 
be infinite-dimensional in this case [229]. To summarize, in the case of finite k, 
TM is identified with a proper subspace of D,,M. This subspace consists of 
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the directional derivatives defined by C*-curves through m and is spanned by 
the derivations ar ane i=1,...,n, associated with an arbitrary local chart (U, «) 
atm. 


To conclude this section, we briefly discuss the dual vector space of T,, M. Recall 
that the dual vector space V* of a vector space V over a field K consists of all K- 
linear mappings é : V > K. With respect to the operations 


(k1 + &2)(v) :-=kE(v) + G2(v), keR,veV, 
V* is a vector space over K of the same dimension as V. 


Definition 1.4.9 (Cotangent space) Let M be a C*-manifold and let m € M. The 
dual vector space of T;, M is called the cotangent space of M at m and is denoted 
by T;, M. Its elements are called cotangent vectors or covectors in m. 


Covectors will be denoted by am, Bj, etc. Evaluation of covectors on tangent 
vectors will often be written in the form of a pairing: (Xm) = (@m, Xm). Every 
fec k(M) defines a covector 


(df)\m(Xm) = Xm(f), (1.4.20) 


which is called the differential of f at m. For example, for the coordinate functions 
«! of a local chart (U, x) at m we obtain 


(dc) Catal axe. (1.4.21) 


As a consequence, the set of ae pied ...,;(dk”)m} yields the basis in 
TM which is dual to the basis {df Lm? }in TM. In particular, every covector 
can be written in the form 


On, m 


Gin Ot (de) g Of y= aula y), (1.4.22) 


and the system of real numbers (a ,,,,---, n,m)» Viewed as an element of R”*, i 
called the local representative of a, with aeepect to the local chart (U,«). As = 
tangent vectors, we will usually omit the chart label and just write aj. 

Let (V, e) be a second local chart at m. In view of (1.4.21) and (1.4.22), we can 
read off the transformation laws for the basis and for the coefficients from (1.4.9) 
and (1.4.17), respectively: 


do’ =[(o0K%-!)'(c(m)) Jide, a? =[(« 0 p')'(o(m)) Par, (1.4.23) 


Exercises 
1.4.1 Prove Lemma 1.4.1. 
1.4.2 Prove Formula (1.4.5). 
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1.4.3. Prove Formula (1.4.17). 

1.4.4 Prove the assertion of Example 1.4.3/2. 

1.4.5 For every point of one of the level sets of Exercise 1.2.5, determine the tangent 
space as an affine subspace of R?. 

1.4.6 For each of the classical groups of Example 1.2.6, determine the tangent space 
at the unit element as a subspace of the corresponding matrix algebra M,, (KK). 
Show that all the subspaces so obtained are closed under the operation of tak- 
ing the commutator of matrices. (Later on, we will see that these subspaces, 
together with this operation, are realizations of the Lie algebras of the classi- 
cal groups.) 

1.4.7 Prove Formula (1.4.13). 

1.4.8 Prove Remark 1.4.8/1. 


1.5 Tangent Mapping 


In this section, we generalize the notion of the derivative of a mapping from classical 
calculus to the theory of manifolds. Thereafter, we extend the basic theorems of 
classical calculus to manifolds. 

In the sequel, assume k > 1. Let M and N be C k_manifolds and let ®: M— N 
be a C‘-mapping. ® maps a C*-curve y in M to the curve @ 0 y in N, which 
is of class C* again. If y € Km(M), then Poy € Keim) (N). If 1 ~ 2, then 
Poy, ~ © o 2, because for local charts (U,«) on M at m and (V, ¢e) on WN at 
@(m) there holds 


: @ t)=(po®ox!) t 
ae” oyi)(t)=(poPok ) hel) <= tee) 


d 
= oPox!y eae ° 
=(po®Pok ) (k(m)) P mi y2)(t) 


0 


d 
a DP t). 
di (p 0 Bo y2)(t) 
Thus, ® induces a mapping of the tangent spaces. 


Definition 1.5.1 (Tangent mapping at a point) Let M and N be C*-manifolds, let 
& €C*(M, N) and let m € M. The mapping ©: T,,M — Tom)N, defined by 


P) (Xm) :=[Poy], 
where y is some curve representing Xj, is called the tangent mapping of ® at m. 


The tangent mapping has the following properties. 


Proposition 1.5.2 Let M and N be C*-manifolds, let © € C’(M,N) and let 
memM. 
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1. ®! is linear. 


2. (idw)\, = idr,,m- 
3. If P is another C‘-manifold and YW € C*(N, P), then (WY 0 ®) iy = Yon) 0 Pin 


4. If ® is adiffeomorphism, then ©’, is bijective and one has (/,)~! = (O7"\oeny: 


Assertion 3 is referred to as the chain rule. 


Proof 1. Choose local charts (U,«) on M at m and (V, e) on N at ®(m) and con- 
sider the isomorphisms F* : T,M —> Rm” and FP: Toi N > Rim | defined 
by (1.4.2). It suffices to show that the composition 


Feed! o(re)y sh’ = pen 


is linear. To see this, let X € R&™™” and let 7 denote the curve tH K(m) + fX. 
Then, 


2 d 
FP o@! 0 (FX) O=F, (po Pox! of)t) = (po Pox!) (c(m)) X, 
0 


that is, Ff o @',0 (re)? is given by the derivative of the local representative of ® 
at K(m). 

2. This follows immediately from the definition. 

3. Let X, € T;,M be represented by y € K,,(M). Then, 


(Wo ®), (Xm) =[W oPoy]= Yom) (IP o y]) =Wy (m) © G', (Xm). 


4. Assertions 2 and 3 imply (®~')/, (ny ° Pn = (6-!o@), = (dy), =idr,, uv 
and, analogously, @/, © Ca re AGT cay 


The tangent mapping can be expressed on the level of derivations, too. Let Xi € 
TmM be represented by y € Ky, (M). Then @/, X,, is the derivation at ®(m) on N 
which is given by the directional derivative along the curve @ o y: 


d 
(®),Xm)(f) = Gf oPoNO=Xm(fo), fe CN), °U.5.1) 
0 
The assignment of f o ® to f defines a mapping 
@*:ck(N) > CK(M), O* f:= fo. (1.5.2) 


In this notation, we have 


BD! Xn = Xn 0 O*. (1.5.3) 
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Remark 1.5.3 


1. The mapping ®%* is called the pull-back (of functions) by ®. Later on, it will be 
generalized to differential forms. ®* is ahomomorphism of algebras and satisfies 


for all 6 e Ck(M, N) and W € C*(N, P). In the case of finite k, where T,,M 
is a proper subspace of D,,M according to Remark 1.4.8/2, the right hand side 
of (1.5.3) may be taken as the extension of the tangent mapping from T;,,M to 
DnM. 

2. From the proof of point 1 of Proposition 1.5.2 we conclude that for local charts 
(U,«) on M at m and (V, pe) on N at ®(m) one has 


(2), Xm) pin) = (0.0 ® 0-1)’ (e(m))/ Xm. (1.5.4) 


That is, locally the tangent mapping of @ is given by the derivative (matrix of 
partial derivatives) of the local representative ®,, = po ®o k—! at k(m). 


Example 1.5.4 Let M and N be open subsets of the finite-dimensional real vector 
spaces V and W, respectively. Let 6 ¢ C‘(M, N) and v € M. We determine the 
tangent mapping &’ using the natural identifications of T, M with V and of Tow) N 
with W, cf. Example 1.4.3/1. Since u € V corresponds to the tangent vector X, € 
TM, represented by the curve tt» y(t) := uv + tu, it is mapped by ®/ as follows: 


d 
GY Vee Vl ae P(v+tu) = O'(v)-u, 
0 


where ®'(v) denotes the ordinary derivative of mappings between open subsets of 
finite-dimensional real vector spaces. This shows that the notion of tangent mapping 
generalizes the notion of derivative of calculus in R”. 


Example 1.5.5 Let M be a Ck-manifold, let m € M and let f € Ck(M). We calcu- 
late the tangent mapping f;, : TM — T f(m)R under the natural identification of 
T fanm)R with R, see Example 1.4.3/1. Let X», € TM be represented by the curve 
y. Then, 


d 
Fen) =[foyl= di} (foy)@) = Xm(f) = dfn (Xm), 
0 


see (1.4.20). Hence, the tangent mapping f,, is given by the differential (df) m. 


Example 1.5.6 This example explains the concept of a tangent vector of a curve. 
Let M bea C*-manifold, let J C R be an open interval, let y: 1 > M bea cK. 
curve and let t <¢ J. The tangent vector of / at t represented by the curve st> t+ 5 
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corresponds to the derivation f bt a} f(t). Therefore, it will be denoted by i P 
The tangent vector y(t) of the curve y at ¢ is defined by 


(4 
y(t) =) (<=, : 


It is represented by the curve s+ y(t +5). On the level of derivations, y(t) corre- 
sponds to 


_d 
yO(p= din ° Y). 


Let us determine the local representative of y(t) with respect to the identical chart 
on J and a local chart (U, «) on M. Since the local representative of the unit tangent 
vector at f is given by 1, (1.5.4) yields 


oe ; d 
(YO) Lo =(koy)(@)-l= ae (ko y)(S). (1.5.5) 


Thus, the local representative of y(t) is given by the tangent vector of the curve 
Koy inR"”, 


We now carry over the Inverse Function Theorem, the Implicit Function Theorem 
and the Constant Rank Theorem of calculus in R” to manifolds. 


Theorem 1.5.7 (Inverse Mapping Theorem) Let M and N be C*-manifolds, let 
® € C*(M, N) and let m € M. If the tangent mapping D1: TmM > ToanN is 
bijective, there exist open neighbourhoods U of m in M and V of ®(m) in N such 
that ® restricts to a diffeomorphism of class Ck from U onto V. 


Proof Choose local charts (U,«) on M at m and (V, p) on N at ®(m). Consider 
the local representative Pp, = po @o «—!. By the chain rule, 


/ 


—1 
(Dp. icon) = Pom) 2 Pin 2 (« Neeny? 


By assertion 4 of Proposition 1.5.2, py (m) and Ce , (ny are bijective. Hence, due 
to the assumption, (®p «)/. tid is bijective. Then, the Inverse Function Theorem of 
classical calculus implies that U and V can be shrunk so that ®,, restricts to a 
diffeomorphism from «(U) onto p(V). Then, ®(U) = V and the restricted mapping 
® :U => V isa diffeomorphism. 


Corollary 1.5.8 Let M and N be C*-manifolds and let ® € C‘(M,N). If © is 
bijective and ®}, is bijective for allm € M, then ® is a diffeomorphism of class CF. 


Proof Since @ is bijective, it has an inverse @~! : N > M. Since @', is bijective 
for all m € M, Theorem 1.5.7 says, in particular, that every point of N has a neigh- 
bourhood on which @~! is differentiable of class C*. This yields the assertion. 
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Remark 1.5.9 In Corollary 1.5.8 it suffices to assume that ©’, is injective for all 
m € M. This is due to the fact that bijectivity and differentiability of ®, together 
with the property of manifolds to be second countable, imply that M and N have 


the same dimension, see Exercise 6 of Chap. 1 in [302] for instructions on a proof. 


Theorem 1.5.10 (Implicit Mapping Theorem) Let M,, Mz and N be C*-manifolds 
and let ® € Chim, x M2, N). Let (moi, mo2) € M1 x M2 and p = ®(mo1, m2). If 
the tangent mapping at mg of the induced mapping 


Ping, : M2 > N, Pio, (M2) := P(mo1, m2) 


is bijective, there exist open neighbourhoods Uj; of mo; in M; and a C*-mapping 
W :U, > U2 such that for all (m,,mz2) € U, x U2 there holds ®(m,,mz2) = p iff 
mz =W(m}). 


Proof Choose local charts at 01, mo2 and p and apply the Implicit Function The- 
orem of calculus in R” to the corresponding local representative of . The details 
are left to the reader (Exercise 1.5.2). 


Theorem 1.5.11 (Constant Rank Theorem) Let M and N be C*-manifolds, let 
beck (M, N) and let mo € M. If the linear mapping ®},: TnM > Toon)N has 
constant rank r for all m in a neighbourhood of mo € M, there exist local charts 
(U,«) on M at mo and (V, pe) on N at ®(mo) such that the local representative 
Pp coincides with the restriction to UN @—!(V) of the mapping 


Rh’ ROY a (x, y) + (x, 0) ER’ x Roy 


Proof Choose local charts (U,«) on M at mo and (V,¢) on N at ®(mo), apply 
the ordinary Constant Rank Theorem of calculus in R” to the local representative 
®, and redefine (U, x) and (V, p) accordingly. The details are left to the reader 
(Exercise 1.5.2). 


According to the Inverse Mapping Theorem, bijectivity of the tangent mapping 
has important consequences for the local behaviour of the mapping itself. By weak- 
ening the requirement of bijectivity to injectivity or surjectivity, one arrives at the 
notions of immersion and submersion. 


Definition 1.5.12 (Immersion and submersion) Let M and N be C*-manifolds, let 
® <¢ Ck(M, N) and let m € M. @ is called an immersion at m if 9’, is injective. It is 
called a submersion at m if ®/, is surjective. It is called an immersion (submersion) 
if it is an immersion (submersion) at every m € M. 


Equivalent characterizations are 


® is an immersion at m iff rank ®/, = dim M iff ker ®/, = 0, 
® is a submersion at m iff rank &/, = dim N iff im®), = Tein) N, 
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where rank &/, denotes the rank of the linear mapping &/,. To be an immersion or 
a submersion is a local property, whereas to be injective or surjective is a global 
property. Therefore, an immersion need not be injective and a submersion need not 
be surjective. Conversely, if ® is injective it need not be an immersion and if it is 
surjective it need not be a submersion. 


Example 1.5.13 In the examples to follow, the tangent mapping is given by the 
ordinary derivative of mappings of R”, see Example 1.5.4. 


1. Let 6: R > R?, B(x) := (cos(x), sin(x)). Since P'. = (—sin(x), cos(x)) 4 0 
for all x € R, ® is an immersion. It is not injective though. 

2. Lett ®: RR, G(x) :=e*. Since &. = e* £0 for all x € R, & is a submersion. 
It is not surjective though. 

3. The mapping ® : R > R, defined by ®(x) := x’, is injective and surjective. The 
tangent mapping is ®) = 3x. Since it is neither injective nor surjective at x = 0, 
@ is neither an immersion nor a submersion. 


Remark 1.5.14 Let M, N and P be C*-manifolds and let @ ¢ C*(M,N) and 
Ww €C*(N, P). By means of the chain rule one can show the following. 


1. If @ and W are immersions (submersions), Y o @ is an immersion (submersion). 
2. If W o @ is an immersion, so is @. If it is a submersion, so is W. 


These assertions hold also pointwise. 


Proposition 1.5.15 Let M and N be C*-manifolds and let ® € C*(M, N). The set 
of points of M at which ® is an immersion (submersion) is open in M. 


Proof For a natural number r, define M, := {m € M : rank ©’, > r}. Since for all 
m &€ M there holds rank ®!, < dim M, the set of points at which @ is an immersion 
coincides with Maim w. Analogously, since for all m € M one has rank @/, < dim N, 
the set of points at which @ is a submersion coincides with Mgim y. Therefore, it suf- 
fices to show that M, is open in M for all natural numbers r. Let mo € M,. Choose 
local charts (U,«) on M at mo and (V,) on N at ®(mo) such that @(U) CV. 


Then, rank @/, = rank(p 0 ® ox~!)/(k(m)) for all m € U. Now, 
rank(p 0 ® o x—')'(«(mo)) >r 


is equivalent to the existence of a minor D, : L(RY™™” , R4&™) _, R of rank r 
satisfying D,((9 0 B o «—!)'(«(mo))) + 0. Since the mapping 


xt> D,((poPo x—')'(x)) 
is continuous, there exists a neighbourhood U of kK (mo) such that 


D,((p°®o x—')'(x)) #0 
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for all x € U. Then, «7! (U) is a neighbourhood of mo in M and rank ®/, > r for 
allm €x7!(U). 


Remark 1.5.16 (Basic properties of submersions) In the following, proofs are left 
to the reader (Exercise 1.5.4). According to the Constant Rank Theorem 1.5.11, 
locally, submersions look like the natural projection to a factor of a direct product. 
This has the following consequences. 


1. Submersions are open mappings. 

2. Submersions admit local sections. This means the following. Let ® : M — N be 
a submersion of class C*. For every p € N, there exists an open neighbourhood 
U anda C*-mapping s : U > M such that ® os =idy. The mapping s is called 
a local section of @ at p. 


The existence of local sections implies, in turn, the following. 


3. Let ®: M > N be a surjective submersion of class C* and let ¥™: M—> Pa 
C*-mapping. If there exists a mapping W:N > P such that Wo =¥Y, then 
this mapping is unique and of class C¥. 

4. Let M be a C-manifold, N a set and ®: M > N a surjective mapping. If N 
admits a C*-structure such that ® is a submersion, this structure is unique. 


Next, we generalize the notions of regular and critical point and regular and crit- 
ical value to differentiable mappings between manifolds and state Sard’s Theorem. 


Definition 1.5.17 Let M and N be C*-manifolds and let @ € C*(M, N). A point 
m € M is called regular if ® is a submersion at m. Otherwise, m is called singular 
or critical for ®. A point p € N is called a regular value of ® if 6~!(p) = @ or if 
all points m € ®~!(p) are regular. Otherwise, p is called a singular or critical value 
of @. 


To state Sard’s Theorem for manifolds, one needs the notion of Lebesgue mea- 
sure on a manifold. Such measures® are constructed by using an atlas and a subor- 
dinate partition of unity, see [73, §16.22.2]. 


Theorem 1.5.18 (Sard) Let M and N be C*-manifolds where k > dim M — dim N 
and let ® € CK(M, N). The set of critical values of ® has measure zero with respect 
to the Lebesgue measures on N. The set of regular values is dense in N. 


Proof See [130, Thm. 3.1.3], or [73, §16.23.1] for the C°-case. 


Corollary 1.5.19 Let M and N be C*-manifolds and let ® : M — N be of class 
cc. If dim M < dim N, then ®(M) is a set of measure zero in N and N \ ®(M) is 
dense. 


8 All such measures are equivalent, that is, they have the same sets of measure zero. 
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Proof Due to dim M < dim N, Sard’s Theorem can be applied. For the same reason, 
all elements of M are critical points for ® and hence all elements of ®(M) are 
critical values of ®. This yields the assertion. 


Exercises 


1.5.1 Let M and N be Ck-manifolds and let ® € Ck(M, N). Show that if M is 
connected and if ®), = 0 for all m € M, © is constant. 
1.5.2 Provide the details for the proofs of the Implicit Mapping Theorem 1.5.10 and 
the Constant Rank Theorem 1.5.11. 
1.5.3 Prove the statements of Remark 1.5.14. 
1.5.4 Prove the properties of submersions stated in Remark 1.5.16. 
1.5.5 Let M, N and P be C*-manifolds and let 6 e Ck(M, N) and W € C*(N, P). 
Show the following. 
(a) If @ is a submersion and W is an immersion, then YW o ®: M — P has 
locally constant rank. (W o @ is referred to as a subimmersion.) 
(b) If, on the contrary, ® is an immersion and W is a submersion, then W o ® 
need not have locally constant rank. 
Hint. Consider the mappings ®: R > R?, @(t) := (t,t, 13) and 
Ww: R? > R?, W(x, y, z) := (9, 2). 


1.6 Submanifolds 
Let k > 1 and let N be a Ck-manifold. 


Definition 1.6.1 (Submanifold) A C*-submanifold of N is a pair (M, y), where M 
is a C‘-manifold and g : M — N is an injective immersion of class C*. Subman- 
ifolds (M1, ¢1) and (M2, g2) are said to be equivalent if there exists a diffeomor- 
phism w : M, — Mp2 such that g20W = ¢). 


Remark 1.6.2 


1. Let (M,g) be aC k_submanifold of N and let g : M - @(M) denote the in- 
duced mapping. Since ¢ is bijective, one can use it to carry over the topological 
and differentiable structure from M to g(M), thus making ¢ into a diffeomor- 
phism. Therefore, (g(M),i) with the natural inclusion mapping i : g(M) > N 
is a submanifold of N equivalent to (M, gy). This shows that, up to equivalence, 
every submanifold of N may be assumed to be given by a subset and the corre- 
sponding natural inclusion mapping. We stress that the topology of this subset 
(coming with its manifold structure) need not coincide with the relative topology 
induced from N. Consequences of this fact will be discussed below. 

2. The concept of equivalence carries over in an obvious way from submanifolds 
to immersions. Two immersions yg; : M; — M and g2: M2 > M are said to be 
equivalent if there exists a diffeomorphism yy : M, — Mp2 such that g20 yf = 9}. 
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Vas 
4 5 


Fig. 1.6 The figure eight submanifolds y, (/eft) and y_ (middle) and the figure eight immersion 
(right) of Example 1.6.6/2. The arrows mean that the curves approach themselves without touching 


Let us start with a couple of examples. 


Example 1.6.3 (Open subsets) Let N be a C«-manifold, let M be an open subset 
of N with the induced smooth structure and let g: M — N be the natural inclusion 
mapping. For all m € M, T,M =T,,N and ¢', is the identical mapping. Hence, 
is an immersion and (M, ¢g) isa C k submanifold. 


Example 1.6.4 (Level sets) Let V and W be finite-dimensional real vector spaces 
and let M be the level set of a regular value of a function f : V > W of class C*. Let 
t: M — V denote the natural inclusion mapping. Then, (M, 1) is a C’-submanifold 
of V. Indeed, according to the Level Set Theorem 1.2.1, M isaC k_manifold. To 
see that « is of class C*, choose a basis in V to identify V with R” and recall the 
construction of local charts on M in the proof of this theorem. The local represen- 
tative of « with respect to such a chart is given by the mapping which in this proof 
is denoted by A. By the Implicit Function Theorem, this mapping is of class C*. Fi- 
nally, for every v € M, under the natural identifications of T, M with ker f’(v) and 
of T, V with V, see Example 1.4.3/1, v', is given by the natural inclusion mapping 
of the subspace ker f’(v) of V and is thus injective. 


Example 1.6.5 (Graphs) Let M = R, N = R? and let f : RR > R be a function 
of class C*. Define v(x) := (x, f(x)), x € R. The image of ¢ is the graph of the 
function f. By construction, g is of class C* and injective. The tangent mapping is 
gy}. = (1, f’(x)); it is injective for all x € R. Hence, (R, ¢) is a submanifold of class 
CK of R’. 


Example 1.6.6 (Curves) More generally, let M = J C R be an open interval and let 
y:1—N beacurve of class C*. If »(t) £0 for all t € J, y is an immersion. If 
y is also injective, that is, if the curve y does not intersect itself, then (J, y) is a 
submanifold of class C* of N. This holds, in particular, for curves in N = R? of the 
form y(t) = (t, f(t)), where f:R— Risa C*-function (Example 1.6.5). We list 
three typical examples of curves which are submanifolds. 


1.6 Submanifolds 37 


No 


4 5 6 


Fig. 1.7 The curves 4-6 of Example 1.6.6. The arrows in 4 mean that the curve approaches the 
vertical axis without touching 


ra 


. Let ] =R, N =R? and y(t) = (t, 0). The image of y is the x-axis. 


2. Let I = (0, 1), N = R? and y(t) = (#sin(4zt), sin(27t)). Both curves have 


the same image as the corresponding curves with J = R, which are just immer- 
sions and not submanifolds. The image is a closed subset of R* known as the 
figure eight, see Fig. 1.6. Correspondingly, submanifolds or immersions with 
that image will be referred to as a figure eight submanifold or immersion, re- 
spectively. 


. Let 1 =R, N=T* =S! x S!, realized as the subset {(z, w) € C? : |z|/? = 


|w|* = 1}. For (z, w) € T* and 3 ER, let 


Yz,w),9(t) = (ze, werm?t) 

This is an immersion. It is injective and hence defines a submanifold iff % is irra- 
tional. In this case, it is known that the image y (R) is dense in T?. Let us remark 
that the curves (; ),9 With @ irrational are usually referred to as the orbits of 
the irrational torus flow. The notion of flow will be introduced in Chap. 3. 


For illustration, we also list three curves which are not submanifolds, see Fig. 1.7. 


Let N = R? and J = (—1, 1). 


4. 


Nn 


The curve y(t) = (sin(27(t + 1)), cos(r(t + 1))) is an immersion but intersects 
itself, hence it is not injective.” 


. The curve y(t) = (f, |t|) is not differentiable at 0. 
. The curve y(t) = (t3, t7) is not an immersion at 0. 


Next, we prove that every submanifold admits an atlas whose charts are induced 


from charts of the ambient manifold. 


Proposition 1.6.7 (Charts adapted to a submanifold) Let (M, v) be a C*-submani- 
fold of N. For every m € M there exists an open neighbourhood U of m in M anda 
local chart (V, e) on N such that 


°The image of this curve is not a figure eight, because it is missing the point (0, 1). 


38 1 Differentiable Manifolds 


1. pU)CV, | | 
2. p(p(U)) is an open subset of (R?™™ x {0}) CREM, 
3. the chart (U, p 0 gy) induced on M is compatible with the C*-structure of M. 


Proof Let r= dimM and n = dimN. The Constant Rank Theorem 1.5.11 yields 
local charts (U,«) on M at m and (V, e) on N at g(m) such that 


pogok !(x1,...,Xn) = (x1,...,X7,0,..., 0), (X1,...,Xn) EK(U). 


Since p(g(U)) is an open subset of the subspace R” and p o gry is a homeomor- 
phism onto its image, (U, 0 gy) is a local chart on M. Let (W, o) be a local chart 
of the C*-structure on M. Since the transition mapping between (U, p o ¢;u) and 
(W,©o) is given by the local representative of g with respect to the charts (W, a) 
and (V, p), the charts (U, p o gry) and (W, o) are compatible. 


Motivated by the observations made in Remark 1.6.2/1, we continue with in- 
troducing two special classes of submanifolds. Let (M, g) be a submanifold of N. 
Since g is continuous, the topology induced on g(M) by means of the induced bi- 
jection g : M > g(M) is at least as fine as the relative topology induced from N. 
It may be finer, though, see Examples 1.6.12/3 and 1.6.12/5 below. The first class 
of submanifolds to be introduced is characterized by the property that these two 
topologies coincide. A necessary and sufficient condition for this is that g be open 
onto its image.!° 


Definition 1.6.8 (Embedded submanifold) A C<-submanifold (M, ¢) of N is called 
embedded if ¢g is open onto its image. 


The second class of submanifolds is characterized by a property related to map- 
pings. Let (M,@) be a submanifold of N and let P be another manifold. For a 
C*-mapping x : N > P, we define x1u:M — P by 


X{M =X O*, 


and for a C*-mapping w:P— N with y(P) C g(M), we define wy! : P > M 
by 


poy!” = Ww. 


By a slight abuse of terminology, we refer to x}, as the restriction of x in do- 
main to M and to w!™ as the restriction of w in range to M. Obviously, x; is a 
C*-mapping again. For y!™ this need not hold, because if the topology on g(M) 
induced from M is finer than the relative topology induced from N, w!™ need not 
be continuous; see Example 1.6.12/5. 


!OBy definition, this means that the induced mapping @ is open with respect to the relative topology. 
In this case, g is called an embedding. 
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Definition 1.6.9 (Initial submanifold) A C<-submanifold (M, ¢) of N is called ini- 
tial!! if for any C*-manifold P and any C‘-mapping wy : P > N which satisfies 
w(P) Cc g(M), the restriction in range y 'M . P -» M is continuous. 


Since for a mapping between topological spaces, the property of being contin- 
uous is preserved under arbitrary restrictions to subsets with the relative topology, 
every embedded submanifold is initial. 


Proposition 1.6.10 (Restriction in range) Let N, P be C*-manifolds, let (M, v) be 
a C*-submanifold of N and let: P > N be a C*-mapping with (P) C g(M). 
The restriction in range IM: P+ Mis of class ou iff it is continuous. 


Proof Denote dim M =k and dim N = 1. Continuity of y!™ is of course necessary 
for differentiability. To see that it is also sufficient, it is enough to show that it implies 
that yw! is of class C* in a neighbourhood of an arbitrary point p € P. According 
to Proposition 1.6.7, there exists an open neighbourhood U of w!“(p) in M anda 
local chart (V, p) on N at p(w 1M (p)) = (p) such that p o gy takes values in the 
subspace IR‘ x {0} of R! and thus induces a chart (U, pogty) on M at wv! (p). 
Since yl! is continuous, (w [M)-l (ry) is open in P. Hence, there exists a local 
chart (W,o) on P at p such that wiM(W) CU. Due to pogowlMoat= 
powoa!, the local representative of 1M with respect to the charts (W, 0) and 
(U,p ° gu) coincides, up to the embedding IRk — R! which is suppressed here, 
with the local representative of w with respect to the charts (W, o) and (V, pe). This 
shows that y!™ is of class C*. 


Corollary 1.6.11 The restriction of a C*-mapping in range to an initial C*- 
submanifold is of class C*. 


Example 1.6.12 


1. Open subsets (Example 1.6.3), level sets of C*-functions at regular values (Ex- 
ample 1.6.4) and graphs of C*-functions on R (Example 1.6.5) are embedded 
submanifolds. In the first case, g itself is open. In the second case, M is a subset 
of R” and is equipped with the relative topology, so that the natural inclusion 
mapping is open onto its image by construction. In the third case, the image of 
an open interval (a, b) under g can be written as 


y((a, b)) = ((a,b) x R)N g(M) 


and is hence open in g(M). 

2. The statement about the graph of a C*-function on R generalizes to arbitrary 
C*-mappings. Let M and N be C*-manifolds and let y € C*(M, N). Define 
g:M—>MxN by g(m) := (m, W(m)). Then, (M, ¢) is an embedded ct. 
submanifold of M x N. The proof is left to the reader (Exercise 1.6.1). 


‘Or, alternatively, weakly embedded. 
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3. Let M=R, N =T’ and lety = Y(z,w),9 be one of the curves of Example 1.6.6/3, 
with % irrational. The submanifold (M, g) is initial but not embedded. That it 
is not embedded is due to the fact that g(M) is dense in N, because this im- 
plies that the preimage under @ of the intersection of g(M) with an arbitrary 
open subset of N cannot be bounded. To see that these submanifolds are ini- 
tial, let yy: P > N be a Ck-mapping with y(P) C g(M) and let pe P, te M 
such that y(p) = g(t). There exist an open interval J containing ¢ and an open 
subset U C N such that g(/) is an arcwise connected component of the sub- 
set p(M) NU of N with respect to the relative topology; and ¢ restricts to a 
homeomorphism from J onto g(/). Let W, C P denote the arcwise connected 
component of y~!(U) containing p. As a mapping to y(M) with respect to 
the relative topology, w is continuous and thus preserves arcwise connectedness. 
Hence, y(W,) C g(/). Now let {py} be a sequence in P converging to p. Since 
W, is an open neighbourhood of p in P, we may assume p, € W for all n. Then, 
W (pn) € GU) for all n and 


o(v™ (pn) = Vpn) > V(p) = 90), 


hence y!™(p,) > t. Since p was arbitrary, this shows that y!™ is continuous. 

4. More generally, every orbit of a Lie group action is an initial submanifold. It is 
embedded if the action is proper, see Chap. 6. 

5. Let (M, 9) be the figure eight submanifold of N = R? given by the curve y, 
of Example 1.6.6/2. (M, @) is not initial, because for P = R and w = y_, the 
restriction in range y!™ maps the convergent sequence {4} in P to a divergent 
sequence in M. 


Remark 1.6.13 


1. Let (M, g) be a C*-submanifold of N. Proposition 1.6.7 shows that for any m € 
M there exists an open neighbourhood U of m in M such that (U, yyy) is an 
embedded C*-submanifold of N. That is, every submanifold (M, ¢) is locally 
embedded, where locally refers to the topology of M. More generally, one can 
show that for every compact subset A of M there exists an open neighbourhood 
U of A in M such that (U, gy) is an embedded C*-submanifold of N; see 
Exercise 1.6.2 for instructions on a proof. 

2. A compact C*-submanifold is always embedded. To see this, consider the in- 
duced mapping g: M —> g(M), where y(M) carries the relative topology in- 
duced from N. Since @ is a bijection, it is open iff it is closed. We show the 
latter. Let A C M be closed. Since closed subsets of compact spaces are com- 
pact, A is compact. Since compactness is preserved under continuous mappings, 
(A) is compact. Since compact subsets of Hausdorff spaces are closed, g(A) is 
closed in g(M). 

3. The following criterion is simple yet useful. Let M be a C*-manifold and 
let gE ck (M,N). (M,¢) is an embedded C k submanifold of N iff for ev- 
ery m € M there exists an open neighbourhood V of g(m) in WN such that 
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(pl(V), Pty-!(v)) is an embedded C*-submanifold of V or of N. The state- 
ment remains true if embedded is replaced by initial. The proof is left to the 
reader (Exercise 1.6.3). 

4. The property of being initial or embedded is stable with respect to taking preim- 
ages of submanifolds under so-called transversal mappings, see Sect. 1.8. 

5. Initial Ck-submanifolds (M1, ¢1) and (M2, ¢2) of N are equivalent iff g) (M1) = 
¢2(M2) as subsets of N. In particular, if a subset of N admits a differentiable 
structure which makes it into an initial submanifold, this structure is unique. 
To see this, it suffices to show that g1 (M1) = g2(M2) implies equivalence. Let 
w:M, — M)» be the restriction in range of g; to the submanifold (M2, ¢2). 
Then, ¢; = ¢2 o y. The restriction in range of g2 to the submanifold (M1, ¢1) 
yields wl, Since both (M1, g1) and (Mo, ¢2) are initial, yy is a diffeomorphism. 

6. In contrast to the case of initial submanifolds, in the general case, one and the 
same subset may be the image of non-equivalent submanifolds. For example, 
let N = R* and let M be given by the figure eight, see Example 1.6.6/2. By 
viewing M as the image of R under the curves y+ or y_, we obtain two smooth 
structures on M, denoted by M. Let j : M — N denote the natural inclusion 
mapping. Both (M,, j) and (M_, j) are smooth submanifolds of R?. While the 
manifolds M, and M_ are diffeomorphic, because they are both diffeomorphic 
to R, the submanifolds (M,, j) and (M_, j) of R? are not equivalent, because 
the only mapping ¥: M, — M_ satisfying j = j o w is the identical mapping 
M, — M_ which however is not continuous. 


Proposition 1.6.14 Let M, N and P be C*-manifolds. 


1. Let (M,¢) bea C*-submanifold of N and let (N,w) bea C*-submanifold of 
P. Then, (M, Ym) isa C*-submanifold of P. If (M, 9) and (N, Ww) are initial 
(embedded), so is (M, ym). 

2. Let (M, @) and (P,w) be C*-submanifolds of N such that w(P) Cc g(M) and 
assume that (M, @) is initial. Then, (P, yl!) is a C*-submanifold of M. If 
(P, wr) is initial (embedded), so is (P, w'™). 


Proof 1. A composition of injective mappings is injective. A composition of immer- 
sions is an immersion, see Remark 1.5.14. By definition of being initial, (M, W)7) 
is initial if so are (M, ~) and (N, yw). Due to the fact that a composition of mappings 
which are open onto their images is open onto its image (Exercise 1.6.4), (M, Wm) 
is embedded if so are (M, g) and (N, w). 

2. The restriction in range y!™ is differentiable by definition of initial subman- 
ifold. It is injective, because so is w. It is an immersion, because so is yf; see 
Remark 1.5.14. Hence, (P, wit) is a Cé-submanifold of M, indeed. Next, as- 
sume that (P, w) is initial. Let Y be a C*-manifold and let y € C*(Y, M) such 
that x(Y) Cc Ww!“ (P). For the restriction x!? of x in range to P there holds 
gox=wox!?. Since y o x is of class C* and since (P, y) is initial, x!? is 
of class C*. Hence, (P, w 1M ) is initial. Finally, assume that (P, w) is embedded. 
Then, yy is open onto its image and we have to show that the same holds for y!™. 
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Let A C P be open. Then, W(A) is open in w(P), that is, wW(A) = W(P) 2 B for 
some open B C N. Thus, 


go '(W(A)) ="! (W(P)NB) =e"! (W(P)) Ng |(B). 
By definition of y!”, g~!(W(A)) = w™ (A) and g!(W(P)) = vw! (P). Since 


vy !(B) is open in M by continuity of y, the above equality shows that y!™” (A) is 
open in wy! (P). Hence, w!™ is open onto its image, as asserted. 


Exercises 


1.6.1 Show that the graph of a C‘-mapping yg : M — N is an embedded submani- 
fold of M x N, cf. Example 1.6.12/2. 

1.6.2 Prove the statement made in Remark 1.6.13/1 that for every compact subset 
A of a C*-submanifold (M, ¢) of N there exists an open neighbourhood U 
of A in M such that (U, g;y) is an embedded C k submanifold of N. 
Hint. Use local charts to show that A possesses an open neighbourhood U 
of A in M with compact closure. By means of the argument which was used 
in Remark 1.6.13/2 to prove that a compact submanifold is necessarily em- 
bedded, show that the restriction of g to the closure of U is open onto its 
image. 

1.6.3 Prove the criterion for the embedding property of Remark 1.6.13/3. 
Hint. Show that a continuous mapping f : X — Y of topological spaces X, 
Y is open onto its image if and only if for every y € f(X) there exists a 
neighbourhood V of y in Y such that the induced mapping f~!(V) > V is 
open onto its image. 

1.6.4 Complete the proof of Proposition 1.6.14 by showing that the composition of 
mappings which are open onto their images is open onto its image. 
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In this section we discuss the question how to characterize those subsets M of N 
which are images of submanifolds, cf. Remark 1.6.2/1. This will be needed for the 
Transversal Mapping Theorem in the next section and for the discussion of distribu- 
tions and foliations in Sect. 3.5. The question under consideration may be rephrased 
as follows. Let M C N and let 1: M — N denote the natural inclusion mapping. 
Under which conditions does there exist a C*-structure on M such that (M, 1) is 
a submanifold (initial submanifold, embedded submanifold) of N? Here, by a C Ke 
structure on M we mean both a topology on M and a maximal atlas whose charts are 
local homeomorphisms with respect to this topology. Of course, if (M, 1) is embed- 
ded, the underlying topology coincides with the relative topology induced from NV. 
Below we will derive existence criteria in terms of local charts on N, starting with 
the general case and then turning to the initial and embedded cases. 


1.7 Subsets Admitting a Submanifold Structure 43 


Proposition 1.7.1 Let N be a C‘-manifold and let M C N be a subset. Consider 
the following condition. 


(S) There exists 1 € N and a countable covering {Mj} of M such that, for every i, 
one can find a local chart (Vj, p;) on N satisfying 
(S1) M; Cc Vj and p;(M;) is an open subset of the subspace R! x {0} of RUMN 
(S2) for alli A j, p;(M; MO M;) is an open subset of R! x {0}. 


If condition (S) holds for some | €N, the family of bijections {(Mj, p;}m,;)} induces 
a C*-structure of dimension | on M. With respect to this structure, (M,1t) is a cK. 
submanifold of N. Conversely, if there exists a C*-structure of dimension | on M 
such that (M, 1) is a Ck-submanifold of N, then (S) holds for this | and the family 
{M;} can be chosen so that the C*-structure so induced on M coincides with the 
original one. 


Condition (S2) is necessary, because otherwise one could, for example, cover 
the figure eight in R* by subsets M, and M> forming a figure S and a reversed S, 
respectively. Then, there exist local charts on R* such that condition (S1) holds. 
However, since M,M M2 contains an isolated point (the origin), it cannot be mapped 
to an open subset of R by any of these charts. Hence, the figure eight cannot become 
a topological manifold this way. 


Proof Assume that (S) holds. According to Remark 1.1.10, the family of bijections 
{(M;, pit; )} defines a topology on M which is Hausdorff and second countable and 
with respect to which the (M;, 0; ;m;) are local charts of dimension / on M. Here, by 
abuse of notation, we view ; |, aS a Mapping to R! x {0}. The transition mappings 
between two such local charts on M are obtained by restriction of the transition 
mappings between the original local charts on N to subsets of the subspace R! x {0} 
of R“™% which are open by condition (S2). Hence, the transition mappings are 
of class C* and thus the charts (Mj, pitm;) define a C k_structure of dimension / 
on M. Since for every 7, the local representative of 1 with respect to the local charts 
(M;, pitm;,) and (Vj, p;) is given by restriction of the natural embedding R! x {0} > 
R&™N to an open subset of R! x {O}, visa C*-immersion. Thus, (M,) is a cr. 
submanifold. 

The converse assertion is due to Proposition 1.6.7 and the fact that M contains a 
countable dense set (because it is second countable). 


For initial submanifolds, following [211] we introduce the following notion. Let 
N be a C‘-manifold. A piecewise C‘-curve in N is a continuous mapping [a, b] > 
N for which there exist a < t) <--- <t, <b such that the restriction to (¢;, tj+1) is 
of class C¥ for alli =0,...,7r, where to =a and tr+1 = b. A subset A C N is said 
to be C*-arcwise connected relative to N if any two points of A can be joined by a 
piecewise C*-curve in N which is contained in A. For points of A, the property of 
being joinable by such a curve defines an equivalence relation in A. The equivalence 
classes are called the C‘-arcwise connected components of A relative to N. Below 
we will need the following evident facts. 
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(a) Any subset of A that is C k_arcwise connected relative to N is contained in a 
C*-arcwise connected component of A relative to N. 

(b) If A is open in N, its C*-arewise connected components relative to N are open 
in N.!2 

(c) If A Cc V for some local chart (V, o) on N, then A is C*-arewise connected 
relative to N iff o(A) is C k_arcwise connected relative to R&™ 


Proposition 1.7.2 Let N be a C*-manifold and let M C N be a subset. Consider 
the following condition. 


(I) There exists 1 € N and a countable covering {M;} of M such that, for every i, 
one can find a local chart (V;, p;) on N such that 
(Il) M; C V; and p;(M;) is an open subset of the subspace R! x {0} CR&™, 
(12) M; is a C’-arcwise connected component of V; 1 M relative to N. 


If condition (1) holds for some | € N, the family of bijections {(M;, pitm;)} induces a 
C*-structure of dimension | on M. With respect to this structure, (M, 1) is an initial 
C*-submanifold of N. Conversely, if there exists a C*-structure of dimension | on 
M such that (M, 1) is an initial C* -submanifold of N, then (1) holds for this | and 
the C*-structure so induced on M coincides with the original one. 


Proof First, assume that (I) holds. The proof that (M, 1) is a submanifold of N is 
analogous to that of Proposition 1.7.1, except for the fact that, here, we have to give 
an argument that the domains p;(M; M M;) of the transition mappings between the 
charts induced on M are open subsets of IR! x {0}. The case where M; 1M ; is empty 
is trivial. Thus, assume M; 1 M; 4 @. Let mo € M; 1 M;. Let A denote the ck- 
arcwise connected component of M; 1 V; relative to N which contains mo. It is easy 
to see that A C M; 1 Mj. Then, p;(mo) € pj(A) C 9; (Mj O Mj). Moreover, p;(A) 
is a C‘-arcwise connected component of the open subset p;(M; 1 V;) of R! x {0} 
relative to R“'™ , hence relative to R! x {0}. Hence, p;(A) itself is open in R! x {0}. 
This shows that p;(M;  M;) is open in R! x {0}, as asserted. 

It remains to show that (M,v) is initial. Let P be a C‘-manifold and let we 
C*(P, N) be such that w(P) Cc M. For every p € P, there exists an i such that 
w(p) € M;. Since wv (Vi) is open, there exists a local chart (W,o) on P at p 
such that w(W) Cc V;. Then, w(W) C V; NM. W can be chosen to be C*-arewise 
connected relative to P (e.g. by choosing it so that o(W) is convex). Then, w(W) 
is C*-arcwise connected relative to N, so that Ww (W) Cc M,;. It follows that the lo- 
cal representative (y!),. PitM; is given by the restriction in range of Wo,p, to the 
subspace R! x {O}, hence it is of class C*. Since the domain o(W) of (W Lee 
contains o(p) and since p was arbitrary, it follows that y!™ is of class C*. 

Now, assume that there exists a C*-structure on M such that (M, t) is an ini- 
tial submanifold of N. Since M is second countable, there exists a countable dense 


'2Tn fact, they coincide with the connected components of A. 
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family of points {m;} in M. For each i, Proposition 1.6.7 yields an open neigh- 
bourhood U; of m; in M and a local chart (V;, 6;) on N at m such that p;(U;) 
is an open subset of the subspace R! x {O} Cc R'™N For ¢ > 0, let Bie de- 
note the open ¢-ball in R&™ about pi(m;). Choose ¢; so that Bj,<, C bi(Vi) and 
(Bic, 1 (R! x {0})) C (Uj). Define 


Viz= 6 '(Bie), pi=hiv,,  Mi:=6;'(Bie, N(R! x {0})). 


Since the subsets M; are open in M and since the family of points {m;} is dense, 
the family {M;} covers M. By construction, it satisfies condition (I1). It remains to 
check condition (12). Since Mj; is C k_arewise connected relative to N by construc- 
tion, it remains to show that M; contains all m € V; N M for which there exists 
a piecewise C*-curve y : [0,1] — MN such that y((0,1]) C Vi NM, y(O) =m; 
and y(1) =m. Let such m and y be given. Let 0 < t) <--- <t, < 1 be such 
that y}(2;,1;,,) is of class CK for alli =0,...,r, where t9 = 0 and tr+1 = 1. Since 
(M, t) is initial, (Vtejtiey) is continuous for all 7, hence yiM is continuous. 
Thus, if m ¢ M;, since M; is open in M, there must exist t € [0,1] such that 
y(ne M; \ M;, where M; denotes the closure of M; in M. Since /; maps U; home- 
omorphically onto f;(U;), there holds j;(M;) = Bes. A(R! x {0}). This implies 
ily (t)) € (R! x {0}) \ B;,-, and hence y(t) ¢ V; (contradiction). Thus, m € M; and 
condition (I2) holds, indeed. Now, since the family of subsets M; so constructed 
satisfies (I1) and (12), it induces a C*-structure on M with respect to which (M, 1) 
is an initial submanifold of M. According to Remark 1.6.13/5, this C*-structure 
coincides with the original one. 


In the case of an embedded submanifold, M already carries a topology, namely, 
the relative topology induced from NV. 


Proposition 1.7.3 Let N be a C’-manifold and let M C N be a subset. Assume that 
M is equipped with the relative topology induced from N. Consider the following 
condition. 


(E) There exists 1 € N and a family of local charts {(V;, p;)} on N such that 
(El) MCU; Vi. 
(E2) for every i, p)(Vi OM) = p(V;) N(R! x {0}). 


If condition (E) holds for some 1 € N, the local charts (Vj, p;) induce local charts 
(Vi OM, pitv;nm) of dimension | on M. These charts establish a C*-atlas and 
hence a C*-structure on M. With respect to this structure, (M, 1) is an embedded 
C*-submanifold of N. Conversely, if there exists a C*-structure on M such that 
(M, 1) is an embedded C*-submanifold of N of dimension 1, then (E) holds for this 
1 and the C*-structure so induced on M coincides with the original one. 
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Proof Assume that (E) holds. As a topological subspace of N, M is Haus- 
dorff and second countable.!* For every i, the subset V;  M of M is open in 
the relative topology. Being the restriction of a homeomorphism onto its image, 
Pitv;am 1s a homeomorphism onto its image. Since by assumption the image is 
pi(V;) O (R! x {0}) and is hence open in R! x {0}, the pair (Vi N M, p; tv;nm) is a 
local chart on M. By the same arguments as in the proof of Proposition 1.7.1, the 
transition mappings are of class C* and (M,1) is a C*-submanifold of N. Since, 
by definition of the relative topology, 1 is open onto its image. Thus, (M,v) is an 
embedded C k_submanifold, as asserted. 

For the converse assertion, let {U;} be the family of open subsets of M and let 
{(V;, 0;)} be the family of local charts on N provided by Proposition 1.6.7. Since M 
carries the relative topology, for every i, there exists an open subset W; in N such 
that U; = MO W;. By replacing V; by V; 1 W; we obtain the desired family of local 
charts of N. Since embedded submanifolds are initial, Remark 1.6.13/5 yields that 
the C*-structure on M induced by this family coincides with the original one. 


Remark 1.7.4 Conditions (S), (1) and (E) of Propositions 1.7.1, 1.7.2 and 1.7.3 can 
be reformulated in terms of submanifolds rather than charts in various ways. The 
following are particularly convenient. Since the proofs are straightforward, they are 
left to the reader (Exercise 1.7.3). 

Condition (S) of Proposition 1.7.1 can be replaced by 


(S) There exists | € N and a countable covering {Mj} of M such that, for every i, 
the subset Mj; carries a C*-structure satisfying 
(S1) (Mj, t+u,) is an embedded ck -submanifold of N of dimension 1, 
(S2) for alli, j, the intersection M; 1 M; is open in M; and M;. 


Condition (I) of Proposition 1.7.2 can be replaced by 


(1) There exists 1 € N and a countable family of open subsets {V;} of N such that 
G1) McU; Vi. 
(I2) for every i, Vij AM admits a C*-structure of dimension | with respect to 
which it is an initial C’-submanifold of N. 
Condition (E) of Proposition 1.7.3 can be replaced by 
(E) There exists 1 € N and a family of open subsets {V;} of N such that 
(El) MCU; Vi. 
(E2) for every i, Vi 1M admits a C*-structure of dimension | with respect to 
which it is an embedded C*-submanifold of N. 


For later use, from Proposition 1.7.3 we extract 


Corollary 1.7.5 Let N be a C*-manifold and let M C N be an embedded C*- 
submanifold. For every m € M there exists a local chart (V,p) on N such that 


'3Since M already carries a topology, there is no need to require the family {(V;, 9;)} to be count- 
able. 
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MV is the set of solutions of the equations p' =0,i=dimM +1,...,dimN, 
and (VM, (p!, say pt™™) vam) is a local chart on M at m. 


As an application of Proposition 1.7.3, one can prove (Exercise 1.7.4) 


Proposition 1.7.6 (Level Set Theorem for mappings of locally constant rank) Let 
N and P be C*-manifolds, letge C¥(N, P) and let Dp € P such that M := go '(p) 
is nonempty. Assume that every mo € M has a neighbourhood in N where rank ¢',, 
is constant.'4 Then, every connected component? Mo of M is an embedded C*- 
submanifold of N of dimension dim N — rank Brags where mo € Mo. 


Finally, we note that the terminology concerning submanifolds used here is con- 
sistent with e.g. [166] and [302]. It is common as well to take condition (E) of 
Proposition 1.7.3 as the definition of submanifold (which hence corresponds to our 
embedded submanifold) and to refer to our submanifolds as immersed or virtual 
submanifolds, as is done for example in [73], [130], [180] and [232]. 


Exercises 


1.7.1 Use Proposition 1.7.2 to show that the submanifold of T? given by the curve 
V(z,w),9 Of Example 1.6.6/3, with & irrational, is initial. 

1.7.2 Use Proposition 1.7.2 to show that the figure eight submanifolds of Exam- 
ple 1.6.6/2 are not initial. 

1.7.3 Prove that conditions (S), (I) and (E) of Propositions 1.7.1, 1.7.2 and 1.7.3 
can be reformulated as given in Remark 1.7.4. 

1.7.4 Use Proposition 1.7.3 to prove Proposition 1.7.6. 


1.8 Transversality 


The Level Set Theorem 1.2.1 states that the preimage of a regular value of a differ- 
entiable mapping from an open subset of R” to R” is an embedded submanifold of 
IR”. This has a generalization to differentiable mappings between manifolds and to 
preimages of submanifolds rather than just points. In this context, the condition of 
regularity is replaced by the condition of transversality. 


Definition 1.8.1 (Transversality) Let N, P and Q be C*-manifolds. 


1. C*-mappings g:N— P and w: Q — P are transversal if for all y € N and 
q € Q such that y(y) = w(q) = p there holds y/,(TyN) + WG (Ty Q) =T>pP (the 
sum need not be direct). , 


14Tn other words, gy is a subimmersion at every mo € M, cf. Exercise 1.5.5/(a). 


'SSince the connected components of M may have different dimensions, M as a whole may not be 
a manifold. We leave it to the reader to provide examples. 
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2. A C-mapping y : N > P is transversal to a C*-submanifold (Q, y) of P if g 
and yw are transversal. 

3. C*-submanifolds (N,v) and (Q, w) of P are transversal if y and y are transver- 
sal. 


If the submanifolds (NV, g) and (Q, Ww) are given by subsets, the condition of 
transversality is usually written in the form 


TypN+TpQ=TpP, pEeNng@. 


Theorem 1.8.2 (Transversal Mapping Theorem) Let N and P be C‘-manifolds, let 
(Q,9) bea C*-submanifold of P and let w € CK(N, P) be transversal to (OQ, 9). 
Assume that M := w—!(p(Q)) is nonempty and let: M — N denote the inclusion 
mapping. 


1. There exists a C*-structure on M such that (M,1) is a C-submanifold of N of 
dimension dim N — dim P + dim g.'6 For everyme M, 


Un (TmM) = {Xm ETnN: Win (Xm) € 9, (Tq Q)}, 


where q € Q such that p(q) = w(m). 
2. If (Q, @) is initial, so is (M, 1). 
3. If (Q, g) is embedded, so is (M, 1). 


Proof First, we use transversality to reduce the problem locally to the Level Set The- 
orem 1.2.1. Then, in order to prove the assertions 1-3, we apply Propositions 1.7.1- 
1.7.3. Denote n = dim N, r = dim P and s = dim Q. Write R” = R* x R’~ and 
let pr> : R’ — R’~* denote the projection onto the second factor. Choose a count- 
able subset {m; :i € I} of M which is dense with respect to the relative topology 
induced from N. According to Proposition 1.6.7, for every i, there exists an open 
subset Q; of Q and a local chart (W;, 0;) on P such that w(m;) € g(Q;) C Wi, the 
image 0; (¢(Q;)) is an open subset of the subspace R* x {0} C R” and the local chart 
(Qi, 0; © gy Q;) 1s compatible with the C k_structure on Q. Given (Wj, oj), choose a 
local chart (V;, 0;) on N at m; such that y(V;) C W; and define 


Mi=¥'(9Q))OVi, — Wi= Po Vo:,; 2 01(Vi) > R. 
Then, m; € M; and p;(M;) = wv, '(0). We check that w; is a submersion at x for 


all x € p;(M;): denote m := p, |(x) and let g € Q be such that g(q) = w(m). By 
transversality, 


R® x {0} + i)y¢my © Vin TmN) = (1)'y my (Gq (Tq Q) + Vin (Pm N)) = RY 


‘That is, the codimension of M in N equals the codimension of Q in P. 
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and hence 


(Wix(R") = pry (Ci) ym) 2 Vin TmN)) =R™, 


as asserted. Thus, the Level Set Theorem 1.2.1 implies that o;(M;) is an embedded 
C*-submanifold of pi(V;) of dimension n — r + 5. Then, M; is an embedded sub- 
manifold of N of the same dimension and we can modify (V;, o;) (and hence M;) 
in such a way that 


pi(Mi) = pi(Vi) N (R"-"** x {0}). (1.8.1) 


1. In order that we can apply Proposition 1.7.1, it remains to check that for all 
iy A iz, pi, (Mi, M;,) is open in R"-'tS x {0}. For that purpose, it suffices to show 
that M;, 1 Mi, is open in M;, with respect to the relative topology induced from NV. 
Since @ is injective, 


Mi, O Mi, =! (9(Qi, N Qin)) A Vi A Vin. 


Since Q;, 1 Qj, is open in Q;, and since P1Oi, is an embedding, g(Q;,  Qi,) 
is open in g(Q;,) with respect to the relative topology induced from P. Thus, 
9(Qi, NM Qi.) = e(Qi,) ON W for some open W C P and we obtain 


Mi, 9 Mi, = 0! (9(0i,)) NW (W) 0 Vi, Vig = Mi, NY (WY Vin, 


which is an open subset of M;,, indeed. Now, Proposition 1.7.1 yields the desired 
C*-structure on M. To determine the tangent spaces, let m € M; and x = p;(m). 
According to the Level Set Theorem 1.2.1, Tx (0; (Mi)) = ker(W),. Hence, 


TmM;j = ker(pry 0 oj 0 WhV;))n- 


Since ker(pr, 0 Oi), (m) = Qq1q Q, the assertion follows. 

2. If (Q, @) is initial, according to Proposition 1.7.2, the subsets Q; and W; 
can be chosen so that y(Q;) is a C k_arcwise connected component of W; N ¢(Q) 
relative to P. We show that, then, M; is a C*-arcwise connected component of 
V; OM relative to N. In view of Proposition 1.7.2, this yields the assertion. Thus, let 
m € V;M be such that there exists a piecewise C‘-curve y in N which is contained 
in V; M M and joins m to a point of M;. We have to show that m € M;. Now, Woy 
is a piecewise C*-curve in P which is contained in W(V; AM) C Wi 1 @(Q) and 
joins y(m) to a point in w(M;) C g(Q;). Since g(Q;) is a C*-arcwise connected 
component of W; 1 ¢(Q) relative to P, it follows that wo y lies in g(Q;) and hence 
y lies in Vi N w—!(p(Q;)) = M,. Thus, m € Mj, as asserted. 

3. If g is an embedding, the subsets Q; and W; can be chosen so that g(Q;) = 
g(Q) W;. This implies Mj = M1 V;. Then, (1.8.1) and Proposition 1.7.3 yield 
the assertion. 


Since single points are embedded submanifolds of dimension zero and a mapping 
N — P is transversal to a point of P if and only if this point is a regular value, 
Theorem 1.8.2 implies 
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Corollary 1.8.3 (Level Set Theorem for manifolds) Let N and P be C*-manifolds, 
letwe C*(N, P) and let c € P._ be a regular value of wy such that M := wc) is 
nonempty. Let .: M — N denote the inclusion mapping. Then (M, t) is an embed- 
ded C*-submanifold of N of dimension dim N — dim P and for every m € M there 
holds ,,(TImM) = ker y;,. 


Corollary 1.8.3 can also be proved directly by means of the Level Set Theo- 
rem 1.2.1 for R” and Proposition 1.7.3 in the formulation of Remark 1.7.4 (Exer- 
cise 1.8.1). 


Remark 1.8.4 Given a C*-manifold N of dimension n and an embedded C*- 
submanifold (M, v) of N of dimension J, there exists a C‘-mapping f : N > R"~! 
such that p(M) = f—!(0). For the proof, without loss of generality, one may as- 
sume that M C WN and that ¢ is the natural inclusion mapping. According to Propo- 
sition 1.7.3, there exists a family of local charts {(V;, 0;) :i € J} on N such that 
the V; cover M and p;j(M 1 V,;) = 9;(V;) NR’ x {0}. Complement this family to 
an atlas on N by adding local charts whose domains do not intersect M and whose 
image does not intersect the subspace {0} x R”~!. According to Proposition 1.3.7, 
there exists a partition of unity {g; : j € J} of class C * subordinate to the open cov- 
ering {V; :i ¢ J} of N. Then, for each j, there is ani such that supp(g;) C Vi. By 
choosing one such i for each j and dropping all the other local charts we obtain a 
new atlas whose local charts we denote by (V;, 0;). Define 


fpVjoR, f= ((e),....(0%))- 


By extending f;g; by 0 to N one obtains a family of C*-mappings 7 :N>R™, 
Then, f := >> jes fi has the desired properties. 


Corollary 1.8.5 (Intersection of transversal submanifolds) Let N be a C‘-manifold 
and let (M,,@¢,) and (M2, $2) be transversal C*-submanifolds of N. Let M := 
¢1(M1) 1M @2(M2) be nonempty and let 1: M — N denote the natural inclusion 


mapping. 
1. There exists a Ck-structure on M such that (M, 1) is a C‘-submanifold of N of 
dimension dim M, + dim M2 — dim N. For everyme M, 
TinM = ((Y1)in, Tm, M1)) 9 (G2) ing (Tg M2), 


where m; € M; such that m = g;(m;), i = 1, 2. 
2. If both (M,, ¢1) and (M2, $2) are initial, so is (M, 1). 
3. If both (Mj, ¢1) and (M2, $2) are embedded, so is (M, 1). 


If (M1, ¢1) and (Mo, ¢2) are given by subsets, one may write 


M=M,0M) and TyM=TyM,{OTmMo,me M, 
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where T,,,M;, i = 1, 2, is viewed as a subspace of T,,, NV, defined by the C k structure 
on M;. 


Proof \. Application of Theorem 1.8.2/1 to the mapping an M, — N and the 
submanifold (M2, g2) of N yields a C k_structure on M := Q, "(@(M2)) such that 
M, together with the natural inclusion mapping 7: M > Mj, is a C*-submanifold 
of M, of dimension dim M, — dim N + dim M. According t to Proposition 1.6.14/1, 

then (M, gi, 0 ot) is a Ck-submanifold of N. Since gy, 0 i(M) = M, 9 01 induces 
a bijection @ : M —> M. We use this bijection to transport the C*-structure of M 
to M. Then, = g olog —! and Q lisa diffeomorphism, hence (M, t) is a ct. 

submanifold of N of the same dimension as M. To determine the tangent spaces, let 
m € M and m; € M; such that g;(m;) =m. Theorem 1.8.2/1 yields 


Ty (Tm, M) = { Xm, © Tiny M in, Xm) © G2)iny (Tm M2)}.- 


Apply (¢1);,, to both sides of this equality and use ¢/, F (Tiny M ) =T,,M to obtain 
the assertion. 

2. and 3. According to Theorem 1.8.2/2, since (M2, @2) is initial, so is (M,i). 
Since (M1, ¢1) is initial, Proposition 1.6.14/1 yields that (M, ¢ 01) is initial. Then, 
so is (M, 1). A similar argument proves assertion 3. 


Exercises 

1.8.1 Prove the Level Set Theorem for manifolds (Theorem 1.8.3) directly by 
means of the Level Set Theorem for R” (Theorem 1.2.1) and Proposition 1.7.3 
in the formulation of Remark 1.7.4. 


Chapter 2 
Vector Bundles 


Vector bundles constitute a special class of manifolds, which is of great importance 
in physics. In particular, all sorts of tensor fields occurring in physical models may 
be viewed in a coordinate-free manner as sections of certain vector bundles. We start 
with the observation that the tangent spaces of a manifold combine in a natural way 
into a bundle, which is called tangent bundle. Next, by taking its typical properties as 
axioms, we atrive at the general notion of vector bundle. In Sect. 2.2, we discuss el- 
ementary aspects of this notion, including the proof that—up to isomorphy—vector 
bundles are completely determined by families of transition functions. In Sect. 2.3 
we discuss sections and frames,! and in Sect. 2.4 we present the tool kit for vec- 
tor bundle operations. We will see that, given some vector bundles over the same 
base manifold, by applying fibrewise the standard algebraic operations of taking the 
dual vector space, of building the direct sum and of taking the tensor product, we 
obtain a universal construction recipe for building new vector bundles. In Sect. 2.5, 
by applying these operations to the tangent bundle of a manifold, we get the whole 
variety of tensor bundles over this manifold. The remaining two sections contain fur- 
ther operations, which will be frequently used in this book. In Sect. 2.6, we discuss 
the notion of induced bundle and Sect. 2.7 is devoted to subbundles and quotient 
bundles. There is a variety of special cases occurring in applications: regular distri- 
butions, kernel and image bundles, annihilators, normal and conormal bundles. 


2.1 The Tangent Bundle 


Let M be a Ck-manifold, let J C R be an open interval and let y : J > M bea 
C*-curve. According to Example 1.5.6, for every t € /, the tangent vector y(t) of y 
at t is an element of the tangent space T,,.;)M. Hence, while f runs through /, y(t) 
runs through the tangent spaces along y, see Fig. 2.1. 


‘Here, as well as in Sect. 2.5, in order to keep in touch with the physics literature, the local de- 
scription is presented in some detail. In particular, we discuss transformation properties. This way, 
we make contact with classical tensor analysis. 
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Fig. 2.1 Tangent vectors along a curve y in M 


To follow the tangent vectors along y it is convenient to consider the totality of 
all tangent spaces of M. This leads to the notion of tangent bundle of a manifold M, 
denoted by TM. As a set, TM is given by the disjoint union of the tangent spaces at 
all points of M, that is, 


TM := |_| TM. (2.1.1) 
meM 


Let x : TM — M be the canonical projection which assigns to an element of T,, 
the point m for every m € M. TM can be equipped with a manifold structure as 
follows. Denote n = dim M. Choose a countable atlas {(Ug, Ky): a € A} on M and 
define the mappings 


«1:2 —1(Uy) > R” x R", Ka (Xm) = (kan), X@). (2.1.2) 


The image of x} is given by k4(Uq) x R” and is hence open in R” x R”. Using 
(1.4.9), for the transition mappings we obtain 


KB 0 (kd) ' (x, X) = (kp omy !(X), (kg 0 ky !)'(x)-X), @A3) 


where (x, X) € Ky(Ug N Ug) x R”. Since ky o ka! is of class C*, the transition 


mappings are of class C‘—!. Finally, it is obvious that the subsets 7~!(Uy) cover 
TM. Thus, according to Remark 1.1.10, the family of bijections {(—!(Uy), Ki) : 
a € A} defines a differentiable structure of class C k-I and dimension 2n on TM ; 
which has the following properties. First, due to (2.1.3), it is independent of the 
choice of an atlas on M used to construct it. Second, the local representative of the 
projection z : TM — M with respect to the charts rod and Ky is given by the natural 
projection pr, to the first factor in Ky(Uq) x IR”. Hence, z is a submersion of class 
C*—!. Third, the charts ran identify the open submanifolds 7~'(U,) of TM with 
direct products of an open subset of M with a copy of R”. Under this identification, 
both the natural projection and the vector space structure on every tangent space 
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TnM, m € Uy, is preserved. To formalize this, for every a € A, define a mapping 
Xa 207 '(Uy) > Ug x R", Xa (Xm) = (m, XK). (2.1.4) 


Then fet = (Ky X idgn) o Xq. In particular, the local representative of x. with respect 
to the global charts ra on 2~!(Uy) and Ky X idpn on Uy x R” is given by the 
identical mapping of R” x R”, restricted to the open subset ky(Uq) x R”. Hence, 
Xe is aC k—1_diffeomorphism. Moreover, pry ° X¥y = Tl h-!(Uy) and the restrictions 
Xa}T,M are vector space isomorphisms for all m € Uy. Let us summarize. 
Proposition 2.1.1 Let M be a C*-manifold of dimension n and let TM be defined 
by (2.1.1). There exists a unique C’~!-structure on TM such that for every local 
chart (U, x) on M, the mapping x! :1~!(U) > R" x R", defined by (2.1.2), is a 
local chart on TM. With respect to this structure, TM has dimension 2n and the 
following holds. 


1. The natural projection x :TM — M is a surjective submersion. 

2. There exists an open covering {Uy} of M and an associated family of diffeomor- 
phisms xq :1~'(Ugy) > Ug x R" such that 
(a) the following diagram commutes, 


pry 


(b) for every m € Ug, the induced mapping pty © Xa}T,nM ? TmM —> R" is a 
vector space isomorphism. 


Definition 2.1.2 The triple (TM, M, z) is called the tangent bundle of M. TM is 
called the total space or the bundle manifold, M the base manifold and z the natural 
projection. Form € M, xT! (m) = TM is called the fibre over m. The vector space 
R” is called the typical fibre and the pairs (Ug, Xq) are called local trivializations of 
TM over Ug. 


By an abuse of notation, the tangent bundle will usually be denoted by TM. 


Example 2.1.3 Let M = S! be realized as the unit circle in R?. For every x € S!, the 
tangent space TS! can be identified with the subspace of vectors orthogonal to x. 
This yields a bijection ® from TS! onto the subset 


T = {(x,X) €S' x R*:x 1X} 
of R*. This is the level set of the smooth mapping 


F :R* > R?, F (x, X) := ((Ixil?,x- X) 
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at the regular value c = (1,0). Hence, it carries a smooth structure. One can 
check that ® is a diffeomorphism with respect to this structure. (To see this, let 
pry: IR? — R denote the natural projection to the k-th component and choose the 
charts on S! and T to be restrictions of pr, and pr; x pr;, respectively, k = 1, 2.) 
Thus, TS! can naturally be identified with 7. The construction carries over to 
higher-dimensional spheres: as a manifold, the tangent bundle TS” can be identified 
with the subset {(x, X) € S” x R’*+! : x _L X} of R?@+) which is the level set of a 
function similar to F at the regular value c = (1, 0), see also Remark 2.1.4/2 below. 


Remark 2.1.4 


1. Let V be a finite-dimensional real vector space and let M be an open subset of V. 
The natural identifications of the tangent spaces T, M with V for all v € M, cf. 
Example 1.4.3/1, combine to a smooth diffeomorphism x : TM — M x V which 
is fibrewise linear. We will refer to x as the natural identification of TM with 
M x V. After choosing a basis in V, this bijection coincides with the (global) 
trivialization induced via (2.1.4) by the corresponding global chart on M. 

2. The construction of Example 2.1.3 generalizes to arbitrary level sets. Let V, 
W be finite-dimensional real vector spaces and let M be the level set of a C*- 
mapping f : V > W at a regular value c € W. Identifying the tangent space 
TyM with ker f’(v) for all v € M, see Remark 1.2.2/1, we obtain a bijection ® 
from TM onto the subset 


T={(v,X)EeMxV: f'(v)X =0} 
of V x V. This is the level set of the C‘~!-mapping 
F:VxV>WxW, F(v, X):=(f(v), f’(v)X) 


at the value (c,0), whose regularity follows from that of c with respect to /. 
It follows that T is an embedded C'—!-submanifold of V x V and that @ is a 
C*~1_diffeomorphism (Exercise 2.1.1). Thus, the tangent bundle of a level set in 
V can be naturally identified with a level set in V x V. 


Just as the tangent spaces of a manifold combine to the tangent bundle, the tan- 
gent mappings of a differentiable mapping combine to a mapping of the tangent 
bundles. 


Definition 2.1.5 (Tangent mapping) Let M, N be C‘-manifolds and let ® : M > N 
be a C‘-mapping. The tangent mapping of @ is defined by 


g’ :TM > TN, ®'(Xm) = P,, (Xm). 


The tangent mapping is of class Ck! (Exercise 2.1.6). The basic properties of 
the tangent mapping are stated in the next section (Proposition 2.2.9). 
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Exercises 

2.1.1 Prove that the mapping ® of Remark 2.1.4/2 is a diffeomorphism. 
Hint. As local charts on M, use those constructed in the proof of the Level 
Set Theorem 1.2.1. 

2.1.2 Determine the tangent bundle in the form of the level set T of Remark 2.1.4/2 
for 
(a) the spheres S”, see Example 1.2.3, 
(b) the hyperboloid of Example 1.2.4, 
(c) the paraboloid, the ellipsoid and the rotational torus of Exercise 1.2.5, 
(d) the classical groups, see Example 1.2.6. 
Compare your result for the spheres S” with Example 2.1.3. 

2.1.3 Let M be the level set of a differentiable mapping f : R” — R” at a regular 
value c € R” . Identify TM with the level set T of Remark 2.1.4/2. The bundle 
of unit tangent vectors of M is defined to be EM := {(x, X) € TM: ||X]| = 1}. 
Show that EM is an embedded submanifold of TM. What does one get for 
ES! and ES?? 

2.1.4 Let (U, «) be a local chart on M and let x" be the local chart induced by « on 
the tangent bundle TM via (2.1.2). Determine the local trivialization (2.1.4) 
of the tangent bundle T(TM) of TM induced by x". 

2.1.5 Iterate the construction of Remark 2.1.4/2 by determining the level set T for 
the tangent bundle T(T™) of the tangent bundle TM of a level set M. Write 
down the defining equations explicitly for M = S”. 

2.1.6 Let ®: M— N be of class C*. Show that &’ is of class Ck~!. 


2.2 Vector Bundles 


The notion of vector bundle arises from the notion of tangent bundle of a manifold 
by allowing the fibres to be arbitrary finite-dimensional vector spaces, rather than 
the tangent spaces of that manifold. 


Definition 2.2.1 (Vector bundle) Let K = R or C and let k > 0. A K-vector bundle 
of class C* is a triple (E, M, 77), where E and M are C*-manifolds and x: E—> M 
is a surjective C‘-mapping satisfying the following conditions. 


1. For every me M, En := x! (m) carries the structure of a vector space over K. 
2. There exists a finite-dimensional vector space F' over K, an open covering {Uy} 
of M and an associated family of C k diffeomorphisms Xa: x} (Uy) > Uy xX F 
such that, for all a, 
(a) the following diagram commutes, 


Xa 
a (0g) ——_$_._ == 0 < F 


s— * 
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(b) for every m € Ug, the induced mapping Xa,m ‘= Plo ° XatE 
linear. 


: Em > F is 


m 


Like in the case of the tangent bundle, by an abuse of notation, a vector bundle 
(E, M,z) will usually be denoted by E alone. Like for the tangent bundle, E is 
called the total space or the bundle manifold, M the base manifold, z the bundle 
projection and F the typical fibre. For m € M, Ej, is called the fibre over m and 
m is called the base point. The pairs (Ug, Xq) are called local trivializations. A lo- 
cal trivialization (U, x) with U = M is called a global trivialization. If a global 
trivialization exists, the vector bundle is called (globally) trivial. 


Remark 2.2.2 


1. By condition 2a, since the xq are diffeomorphisms, the bundle projection z is 
a submersion (because so is the projection to a factor of a direct product) and 
the fibres E,,, are embedded submanifolds (because by x, they are mapped onto 
the subsets {m} x F of Uy x F’). Being bijective and linear, the mappings X,m 
are vector space isomorphisms. Hence, all fibres have the same dimension as 
F:; this number is called the dimension or the rank of the vector bundle. Thus, 
dim E = dim M + dim F for K=R, and dim E = dim M + 2dim F for K=C. 
For a K-vector bundle of dimension n, one can always choose F = K”. 

2. Let A denote the index set of a family of local trivializations {(Ug, xa)}. The 
mappings 


XB °Xq 1: Ua NUg x F > UgNUgx F, (a, B)€Ax A, (2.2.2) 


which are of class C*, are called the transition mappings of the system of lo- 
cal trivializations {(Uq, Xv): @ € A}. Since for every (a, 8) € A x A, xg 0 xa! 
maps the subsets {m} x F, where m € Ug M Ug, linearly and bijectively onto 
themselves, there exists a mapping ~gq : Ug 1 Ug > GL(F) such that 


XB Xq (m,u) = (m, ppa(m)u) (2.2.3) 


for all m € Ug 1 Ug and u € F. The mappings pzq are called the transition 
functions of the system of local trivializations {(Uq, x,) : a € A}. To see that 
they are of class C* it suffices to check that for every (a, 8) € Ax Aandué F, 
the mapping Uy 1 Ug — F defined by mt> (m, pga (m)u) is of class C*. This 
follows at once from the differentiability of the transition mappings xg 0 xj I. 
One can check that the transition functions satisfy 


Pye () Ppa(m) = Pya(m) (2.2.4) 


for alla, B, y ¢ Aandm€ UygN Ug Uy. 
3. A vector bundle is said to be orientable if there exists a family of local trivializa- 
tions whose transition mappings have positive determinant. 
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Example 2.2.3 


1. Let M be a C/-manifold, let F be a vector space of dimension r over K and let 
pty: M x F — M denote the natural projection to the first component. Then, 
(M x F, M, pry) is a K-vector bundle of class C k and dimension r. It is called 
the product vector bundle of M and F. A product vector bundle is obviously 
trivial. 

2. According to Proposition 2.1.1, the tangent bundle of an n-dimensional C*- 
manifold is an n-dimensional real vector bundle of class C'!. 

3. Let (E, M,7) be a vector bundle of class C* and let U C M be open. Define 
Ey:= mx '(U). This is an open subset of E and hence a C*-manifold. By re- 
striction, x induces a surjective C-mapping zy : Ey — U, and a system of 
local trivializations {(Uq, Xa)} of E induces the system of local trivializations 
{Ua OU, Xatuynu)} of Eu. Thus, (Ey, U, my) is a K-vector bundle of class 
C*. It has the same dimension as E. 


Example 2.2.4 (Mobius strip) Let M = S! be realized as the unit circle in C and let 
E be the Mobius strip of Example 1.1.12 with the open interval (—1, 1) replaced by 
the whole of R. That is, E := R?/~, where (81, t) ~ (52, f2) iff there exists k € Z 
such that s7 = s} + 27k and ft) = (—1)*t,. Define the projection by 


x: ES, n([(s, t)]) =e), 


Using the local charts on E constructed in Example 1.1.12 one can easily check that 
z is smooth. The fibres are E,is = a—'(el’) = {[(s, t)] : t € R}. For every s ER, 
define 


AL(s, 1) + [(s, tr) = [(s, At) + to), A,t1,2€R. 


This way, the fibres become real vector spaces of dimension one. To construct local 
trivializations, we choose U := S! \ {+1} and define mappings 


x4: 07'(Us) > Ut xR, x+([ts,0]) := (e*, 2), 


where in case of x4 and x_ the representative (s, t) of [(s, t)] used to compute the 
right hand side is chosen from ]0, 27[ x R and from ]—z, z[ x R, respectively. We 
leave it to the reader to check that the x+ are diffeomorphisms and satisfy condi- 
tions 2a and 2b of Definition 2.2.1. Thus, (EL, M, 77) is a smooth real vector bundle 
of dimension 1. Figure 2.2 shows E together with the product vector bundle S! x R. 
It is quite obvious that F is not trivial. We will be able to give a precise argument 
for that in the next section. 


Remark 2.2.5 Let M be a C*-manifold, let E be a set and let 7: E > M be 
a surjective mapping such that conditions | and 2 of Definition 2.2.1 are satis- 
fied, however, with the following difference. Instead of assuming the xq to be C ke 
diffeomorphisms, assume that they are bijective and that their transition mappings 
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Fig. 2.2. The product vector 
bundle S! x R and the 
M6bius strip as a vector 
bundle over S! 


(2.2.2) are of class C*. Since M is second countable, the open covering {U,} con- 
tains a countable subcovering. According to Remark 1.1.10, the corresponding sub- 
family of the family {x} defines a Cé-structure on E. With respect to this structure, 
(E, M, 1) isa K-vector bundle of class C* and the (Uy, Xqw) are local trivializations. 
Conversely, if (EZ, M, 7c) is a vector bundle of class C Ke then the C*-structure on E 
induced in this way coincides with the original one (Exercise 2.2.1). 


Next, we consider mappings of vector bundles. 


Definition 2.2.6 (Vector bundle morphism) Let (£), M1, 71) and (E2, M2, 12) be 
K-vector bundles of class C’. A C*-mapping ® : E, > E? is called a morphism if 
for every m, € M, there exists m2 € M2 such that 


1. P(E 1m) Cc Edm, 
2. the induced mapping ®y,, := DE : E1 im, > E2,m, is linear. 


The rank of @ is defined to be the integer-valued function which assigns to m1 € 
M, the rank of the linear mapping ®,,,,. Incase Mj = M2 = M, @ is called a vertical 
morphism or a morphism over M if conditions | and 2 hold with mj =m2=m. 


As usual, together with the notion of morphism there comes the notion of iso- 
morphism (a bijective morphism whose inverse is also a morphism), endomorphism 
(a morphism of a vector bundle to itself), automorphism (an isomorphism of a vec- 
tor bundle onto itself). For a vector bundle morphism @ to be an isomorphism it is 
obviously sufficient for ® to be a diffeomorphism. If @ is vertical, it is sufficient 
for ® to be bijective, because then the tangent mapping ®’ is bijective at any point 
and Theorem 1.5.7 yields that the inverse mapping is of class C*. 


Remark 2.2.7 


1. Since @ is a mapping, condition | implies that the point m2 is uniquely de- 
termined by m,. Thus, every morphism ® induces a mapping g: M; > Mo, 
defined by 

gom=m0®. 


One says that ® covers ¢ and calls gy the projection of ®. If ® is of class C¥, 
so is ~. Indeed, if (Uj, x1) is a local trivialization of EF), g;y, coincides with 
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the composition of the embedding U; — U, x {0} C U; x F, with the mapping 
Tw20Po ae In case M; = M2 = M, @ is a vertical morphism iff g = idy. 
2. Let (Uj, x;) be local trivializations of E;, i = 1,2. The mapping 


x20 Pox, ': (Ui Ng |(U2)) x Fi > Wx Fh (2.2.5) 


is called the local representative of ® with respect to (Uj, x1) and (U2, x2). 
A fibre-preserving and fibrewise linear mapping ® : FE; — E> is a morphism iff 
all of its local representatives are of class C*. 

3. Let E}, Ex be K-vector bundles over M of class C*. For 4 € K and vertical 
morphisms ®, YW: E; — E2 we can define 


AS+W)(x):=AGP(x)+ V(x), x EF, 


because for all x € E;, ®(x) and W(x) belong to the same fibre of Ez. This 
provides a K-vector space structure on the set of vertical morphisms from Ey 
to E>. 


Example 2.2.8 A local trivialization (U, x) of a K-vector bundle (EF, M, 7) with 
typical fibre F is a vertical isomorphism from the vector bundle Ey, see Exam- 
ple 2.2.3/3, onto the product vector bundle U x F’. Accordingly, a global trivializa- 
tion is a vertical isomorphism from E onto M x F. Thus, a vector bundle is trivial 
iff it is isomorphic to a product vector bundle. 


Probably the most important example of a vector bundle morphism is the tan- 
gent mapping. The reader may convince himself that Proposition 1.5.2 implies the 
following (Exercise 2.2.4). 


Proposition 2.2.9 (Properties of the tangent mapping) Let M and N be Ck- 
manifolds and let gp € CK(M, N). The tangent mapping y' : TM —> TN has the 
following properties. 


1. g! is a vector bundle morphism of class C‘~! with projection 9. 

2. (idy)’ =idry. 

3. If P is another Ck-manifold and wy € C*(N, P), then (yoo) =W' og’. 
4. If v is a diffeomorphism, then g’ is an isomorphism and (g')~! = (g7!)’. 


Remark 2.2.10 (Partial derivatives and product rule) Let M,,M2,N be C _ 
manifolds and let g € Ck(M, x M>, N). We discuss the properties of the tangent 
mapping gy’ which are related to the direct product structure of its domain. Proofs 
are left to the reader (Exercise 2.2.5). The induced partial mappings 


Qm,:M,—> N, Om, (M1) = g(m1,m2), m2 € M2, 
Pm, :M2.—> N, @m, (m2) = (m1,m2), m, eM, 
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are of class C*. Their tangent mappings combine to C'—!-mappings 


TM, x M2 > TN, (X1,m2) > (Ym>)' (X1), 
M, x TM, > TN, (m1, X2) > (@m,)' (X2), 


called the partial derivatives of g. They fulfil the product rule, 
Glin amy) (X15 X2) = mg)! (X1) + Wim) (Xa). mi E Mi, Xi ET, Mi 2.2.6) 


If M,; = My = M and if @ is composed with the diagonal mapping 
A:M-— M x M, then 


(p 0 A), (X) = (Pim) (X) + (G2,m)(X),  meM, X ET nM. (2.2.7) 


In particular, if M = J is some open interval, then g o A, @;, and g;, are C k curves 
in N. For the corresponding tangent vectors at t € J there holds 


d d d 
— g(s,s)=— g(s,th+— olt,s), tel. (2.2.8) 
ds f, ds f, ds}, 


To conclude this section, we show that—up to isomorphy—vector bundles are 
completely determined by the family of transition functions associated with a sys- 
tem of local trivializations. 


Theorem 2.2.11 (Reconstruction theorem) Let M be a C k manifold. Assume that 
the following data are given: 


1. a finite-dimensional vector space F over K, 

2. an open covering {U,: a € A} of M, 

3. a family of C*-mappings Ppa : Uy 1 Ug > GL(F), (a, B) € A x A, satisfying 
(2.2.4). 


Then, there exists a K-vector bundle E over M of class C* and a family of local 
trivializations {(Uq, Xa): @ € A} of E whose transition functions are given by the 
functions pga. E is uniquely determined up to vertical isomorphisms. 


In particular, the last assertion implies that if the pyg are the transition functions 
of a vector bundle, then the vector bundle provided by Theorem 2.2.11 is isomorphic 
over M to the original one. 


Proof First, we prove existence. Since M is second countable, the covering {Ug : 
a € A} contains a countable subcovering. Hence, for the following construction 
we may assume that A is countable. Moreover, we notice that (2.2.4) implies that 
Paw = 1 for all a € A. Take the topological direct sum 


B= |_| UyseF, 


acA 
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denote its elements by (a@,m,u), where a € A, m € Uy, and u € F, and define a 
relation on 2 by (a1, mM\, uy) ad (a2, m2, u2) iff mm, =m2 and U2 = Para, (m1)uy. 
Due to Pag = 1 and (2.2.4), this is an equivalence relation. Let E denote the set 
of equivalence classes. The mapping 2: E —> M, given by z[(a,m,u)] :=™m, is 
well-defined and surjective. To construct a vector space structure on 2~!(m) for 
every m € M, choose a such that m € Uy. Every class in 1~!(m) has a unique 
representative of the form (a, m, u) with u € F. Using this, we transport the linear 
structure from F to x7! (m), 


A[(a,m, u1)| + [(a,m, u2)| = [(a,m, Au} +u2)], uj,u2EF, AEK. 


By linearity of the mappings pga (mm): F — F, this definition does not depend on 
the choice of a. The natural injections Uy x F > 2 induce mappings Uy x F > 
E. Due to pag = 1, these mappings are injective and hence induce bijective map- 
pings Xe: a~!(Uy) +> Uy x F. A brief computation shows that the transition 
mappings of the family of bijections {xq : a € A} are given by (2.2.3). Therefore, 
they are of class C* and hence define a C*-structure on E with respect to which 
(E, M,7z) is a K-vector bundle of class C kK and the Xa are local trivializations, 
see Remark 2.2.5. To prove uniqueness up to vertical isomorphisms, let E bea K- 
vector bundle over M of class C* with projection 7 and let xq : it~! (Ug) > Uy x F 
be local trivializations whose transition functions coincide with the pga. Then, on 
a '(Ugn Ug) C E we have <.: Xe = Kp © xg and on atl Ug) Cc E there 
holds i 0X = in o Xg. Hence, the mappings ie © Xy and a 0Xqg, HEA, 
combine to mappings E —> E and E> E, respectively, which are morphisms and 
inverse to one another. 


Remark 2.2.12 


1. Given two finite-dimensional vector spaces F;, Fz and two open coverings 
{Uig; :ai € Ai}, i= 1,2, of M with associated systems of C*-mappings 


Pi,Bi0; > Via; 1 Ui,g,; > GLU), (ai, Bi) € Ai x Ai, 


there arises the question under which conditions the vector bundles E; and Eo, 
defined by these data according to Theorem 2.2.11, are isomorphic over M. The 
answer is as follows. First, F; and F2 have to be isomorphic so that they can 
be replaced by K” for some r € N. Second, there exists a common refinement 
{Uq : a € A} of the open coverings {Uj,q; : a; € Ai}, i = 1, 2. By restriction, the 
/i,f;o; induce mappings 


pi,pa Uy Ug > GL(r,K), (a, B)€ Ax A, i= 1,2. 


Now, E and E> are isomorphic iff there exists a system of C‘-mappings py : 
Uy — GL(r, K), a € A, such that 


P2, Ba (m) = pz (m) *Pi,pa(M™) > Palm), meUg Uz. (2.2.9) 
The proof is left to the reader (Exercise 2.2.6). 
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An open covering {Uy : a € A} together with an associated family of C*- 
mappings pga : Ue M Ug > GL(r, K), (a, B) € A x A, with the property (2.2.4) 
is called a 1-cocycle on M with values in the structure group GL(r, K). Two 
1-cocycles are called cohomologous if there exists a system of C<-mappings 
Pu: Uy > GL(7, K), a € A, such that (2.2.9) holds. To be cohomologous is an 
equivalence relation in the set of 1-cocycles. Passage to equivalence classes, that 
is, cohomology classes of 1-cocycles, yields a cohomology theory on M which 
is called the first Cech cohomology of M with values in the structure group 
GL(r, KK). According to point 1, the cohomology classes correspond bijectively 
to the isomorphism classes of K-vector bundles over M of class C* and dimen- 
sion r. 


. One can show that the first Cech cohomology of M and, correspondingly, the set 


of isomorphism classes of vector bundles over M do not depend on the degree of 
differentiability k, see [130, Ch. 4, Thm. 3.5]. 


Exercises 


2.2.1 Let(E, M,7)beaC k_vector bundle. Consider the C*-structure on E induced 


by a system of local trivializations via the method of Remark 2.2.5. Show that 
this structure coincides with the original C*-structure. 


2.2.2 Let M be a C*-manifold. Use the system of bijections (2.1.4) associated 


with an atlas on M to construct a C*-structure on TM via the method of 
Remark 2.2.5. 


2.2.3 Construct a smooth structure on the Mobius strip by means of the method 


of Remark 2.2.5, using the local trivializations (U4, x+) of Example 2.2.4. 
Show that this structure coincides with the one constructed in Example 1.1.12. 


2.2.4 Prove Proposition 2.2.9. 
2.2.5 Prove the assertions about partial derivatives stated in Remark 2.2.10. 
2.2.6 Prove the criterion for the isomorphy of two vector bundles over M stated in 


Remark 2.2.12/1. 


2.2.7 Let (E1, 7, Mj) and (£2, 12, M2) be K-vector bundles of class C* and of di- 


mensions r; and r2. Define E := E, x Er, M:= M, x Mo anda :=7| xm: 
Ey x Ey > M, x Mp. For (mj), m2) € M, x Mp, equip Eqn, m3) -= a —'(m) = 
Em, X E2,m, with the linear structure of the direct sum Fm, ® E2,m,. Show 
that (E, M, 7) is a K-vector bundle of class C* and dimension rj +12. It is 
called the direct product of (£1, 771, Mj) and (E2, 12, M2). 


2.2.8 Let M; and M2 be C*-manifolds. Let pr; : M@, x Mz — M; denote the natural 


projections to the factors. Show that the following mapping is a vertical vector 
bundle isomorphism: 


® :T(M, x Mo) > TM, x TM, ®(X) := (pr (X), pr (X)). 


2.3 Sections and Frames 65 


2.3 Sections and Frames 


The notion of section generalizes the concept of a function on a manifold with values 
in a finite-dimensional vector space. 


Definition 2.3.1 (Section) Let (E, M, 7) be a K-vector bundle of class C*. A sec- 
tion (or cross section) of (FE, M,z) isa C*-mapping s:M-— E such that 7 os= 
idy. 


A local section of (EF, M,z) over an open subset U C M is a section of the 
vector bundle (Ey, U, zy). 


Remark 2.3.2 


1. Let s be a section and let (U, x) be a local trivialization of the K-vector bundle 
(E, M, 7) of class C. The mapping 


prroxosty:U>F (2.3.1) 


is called the local representative of s with respect to (U, x). Since local trivial- 
izations are diffeomorphisms, a mapping s: M — E satisfying m os = idy is of 
class C¥ (and hence a section) iff so are all local representatives of s with respect 
to a system of local trivializations. 

2. Every vector bundle admits a distinguished section m +> Om, called the zero 
section. 

3. The set of all sections of a K-vector bundle (E, M, 7) of class C K is denoted by 
I’(E). It carries the structure of a real vector space and of a bimodule over the 
algebra C°(M), with all operations defined pointwise (Exercise 2.3.1). 

4. A local section need not be extendable to a global section, as is shown by the 
example of M=R, E=M xR, U=R, and s(x) =(4, 1), There holds, how- 
ever, the following weaker extension property. For every m € U, there exists an 
open neighbourhood V of m in U and a section § of E such that spy = Sty 
(Exercise 2.3.2). 


Example 2.3.3 


1. (Local) sections of the tangent bundle TM of a C k_ manifold M are called (lo- 
cal) vector fields on M. They will usually be denoted by X, Y,... and the vector 
space I’(TM) will be denoted by X(M). To be consistent with the previous no- 
tation X,, for a tangent vector at m € M, for the value of the vector field X at 
the point m we will often write X,, instead of X(m). Note that since TM is 
of class C*—!, so are vector fields. If (U,«) is a local chart on M, for every 
i=1,...,dimM, the mapping 

de :U > (TM), a* (m) = OF 
is a section of (TM)y = TU. Since the representative of this section with respect 
to the global trivialization of TU induced by x is given by the constant mapping 
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whose value is the i-th standard basis vector in R&™™, 0* is of class ce, 
Hence, ar is a local vector field on M. 

2. If the vector bundle E over M is given in terms of a K-vector space F’,, an open 
covering {U,} of M and a family of C*-mappings Ppa : Uy 1 Ug > GL(F) 
satisfying (2.2.4), then its sections of class C* correspond bijectively to fami- 
lies {sy} of C*-mappings Sq : Uy — F satisfying sy(m) = ~og(m)sg(m) for all 
a,B€Aandme Ug Ug. 


Remark 2.3.4 


1. Let V be a finite-dimensional real vector space and let M C V be an open subset. 
Via the natural identification of TM with M x V of Remark 2.1.4/1, vector fields 
X on M correspond bijectively to smooth mappings” X : M — V. By construc- 
tion, for all v e M and f € C™(M), we have 


d 
X(f)= a f(v+tX(v)). (2.3.2) 
0 


2. Let V and W be finite-dimensional real vector spaces and let M C V be the level 
set of a C'-mapping f : V > W ata regular value. Via the natural identification 
of TM with the embedded C*~!-submanifold {(v,X)€MxV:X eker(f’(v))} 
of V x V, see Remark 2.1.4/2, vector fields X on M correspond bijectively to 
Ck! mappings X:M — V satisfying X(v) € ker f’(v). 


By means of a local trivialization, sections are identified locally with the graphs 
of their local representatives. This implies 


Proposition 2.3.5 Let (E,M,z) be a K-vector bundle of class C* and let 
s €I'(E). Then, (M,s) is an embedded C*-submanifold of E. 


Proof Let m € M. According to Remark 1.6.13/3, we have to show that there exists 
an open neighbourhood V of s(m) in E such that (s-'(V), Sts-lvy) is an embed- 
ded C*-submanifold of E. Choose a local trivialization (U, x) of E at m and let 
V =x~!(U). Then, s~!(V) = U and hence we have to show that (U, Spy) is an em- 
bedded C*-submanifold of 2~!(U). Since x is a diffeomorphism and (U, x 0 s;y) 
is the graph of the local representative of s with respect to the local trivialization 
(U, x), the latter follows from Example 1.6.12/2. 


Now let (£,, Mj, 7) and (E2, M2, 72) be K-vector bundles of class C*, let 
@ : E; — Ez be a morphism and let gy : Mj — Mp be the projection of @. 


?Denoted by the same symbol. 
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Definition 2.3.6 (@-relation and transport operator) 


1. Sections s; € (£1) and sz € I'(E2) are said to be @-related if they satisfy 
Dos, =S20Q. 


2. If y is a C*-diffeomorphism, the following mapping is called the transport oper- 
ator of @: 


®,:(E\) > T(E), D,5:=Dosog!. (2.3.3) 


The following proposition lists the properties of the transport operator (Exer- 
cise 2.3.3). 


Proposition 2.3.7 Let (E,, M,, 7) and (E2, Mo, 12) be K-vector bundles of class 
C* and let ® : Ey > E> be a morphism whose projection y : M\ > Mz is a diffeo- 
morphism. The transport operator ®,, has the following properties. 


1. ®,, is linear. If ® is an isomorphism of vector bundles, ®, is an isomorphism of 
vector spaces and there holds (@-!), =(@,)7!. 

2. For every s €I'(E}), s is ®-related to ®,s. 

3. For every s € '(E;) and f € C*(Mj), there holds ®,( fs) = ((p~!)* f) ®,s. 

4. If WV : Ex > E3 is another morphism whose projection is a diffeomorphism, then 
(Wo), =Yo,. 


Remark 2.3.8 In the case of vector fields, it is common to speak of g-relation rather 
than g’-relation. Thus, X; € X(M;), i = 1, 2, are g-related iff 


gy’ oX, =X209. (2.3.4) 
Next, we turn to the discussion of (local) frames. 


Definition 2.3.9 (Local frame) Let (E, M,2) be a K-vector bundle of class C* 
and dimension /, let U C M be open and let 4 = {s1,..., 5,} be a system of local 
sections of E over U. & is said to be pointwise linearly independent if the system 
{s1(m), ..., 5,-(m)} is linearly independent in E,, for all m € U. In this case, Z is 
called a local r-frame (frame if r =/) in E over U. If U = M, Bis called a global 
r-frame (global frame if r = /). 


Local frames provide bases in the fibres over their domain and hence allow for 
the expansion of local sections. 


Proposition 2.3.10 Let (E, M,z) be a K-vector bundle of class Ck, let UCM 
be open and let {8}, ..., 51} be a local frame in E over U. The assignment of f's; 
(summation convention) to an |-tuple (f!,..., f!) of K-valued C*-functions on U 
defines a bijection from The ck, K) onto (Ey). 
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Proof Obviously, for every (f!,..., f’) € i C*(U,K), the sum f‘s; is a C*- 
section of Ey. Conversely, let s « (Ey). By expanding s(m) with respect to the 
basis {s(m),...,s;(m)} of Em for all m € U, we obtain functions f' : U > K 
satisfying spy = = Fle. For every m € U, (f!(m),..., f!(m)) is the unique solution 
of a system of linear equations whose coefficients depend differentiably of class Cé 
on m. Hence, f' eCk(U). 


Example 2.3.11 


1. Let M be a Ck ees of dimension n and let (U,«) be a local chart on M. 
Since {Of 7,5+++»9n,m} is a basis in T,,M for all m € U, the system {d7,..., 0} 
is a local fered in TM over U. Thus, over U, vector fields X € X(M) can be rep- 
resented as X}y = X‘a* with X' ¢ CK~!(U). According to (1.4.15) and (1.4.16), 
the coefficient functions X! are given by X'(m) = X(x'), where i =1,...,7. 

2. Let (E, M, 7) be a K-vector bundle of class C* with typical fibre F,, let (U, x) 
be a local trivialization and let {e;, ..., e,} be a linearly independent system in F’. 
Define local sections s; of E over U by 


si(m):=x~'(m,e;), i=1,...,r. (2.3.5) 


These sections are of class C*, because their local representatives with respect 
to (U, x) are the constant mappings m +> e;. Hence, the system {s1,...,5,} isa 
local r-frame in E over U. 


As the second example suggests, local frames are closely related to local trivial- 
izations. 


Proposition 2.3.12 Let (E,M,2) be a K-vector bundle of class C* with typical 
fibre F and let U C M be open. By virtue of (2.3.5), every basis of F defines a 
bijection between local trivializations x :m~'(U) — U x F and local frames in E 
over U. In particular, 


1. there exists a local trivialization of E over U iff there exists a local frame in E 
over U. 
2. E is trivial iff there exists a global frame. 


Proof Let {e1,..., e:} be a basis of F. That a local trivialization over U defines a 
local frame over U has been shown in Example 2.3.11/2. Conversely, for a given 
local frame {s;,..., 57} in E over U, expand x € Ey asx = x's; (a(x)) and define 


a mapping x :al\(U) > Ux F by x(x) := (a(x), x'e;), x € Ey. The mapping 
x is a bijection and satisfies conditions 2a and 2b of Definition 2.2.1. Thus, to show 
that x is a local trivialization, it remains to check that y and y~! are of class C* 
(Exercise 2.3.4). Finally, assertions | and 2 are obvious. 
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Example 2.3.13 


1. Let M be a C*-manifold of dimension n, let (U,«) be a local chart on M and 
let (U, x) be the local trivialization of TM induced by this chart via (2.1.4). 
The bijection between local frames over U and local trivializations over U, de- 
fined by the standard basis of R” via (2.3.5), assigns to (U, x) the local frame 
{OF esas Oe}. 

2. Consider the smooth real vector bundle E given by the Mobius strip, cf. Ex- 
ample 2.2.4. Since FE has dimension 1, a global frame in E is just a nowhere 
vanishing section. Since the base manifold is S!, sections of E correspond to 
closed smooth curves in E winding around exactly once.* Since any such curve 
must cross the zero section, E does not admit a global frame and is hence not 
globally trivial, cf. Proposition 2.3.12. 


Remark 2.3.14 Using the description of vector bundles in terms of coverings and 
transition functions as explained in Remark 2.2.12, one can show that, up to iso- 
morphy over S!, the Mébius strip and the product vector bundle S! x R are the only 
real vector bundles of dimension | over S!. 


The following proposition collects useful extension results. The proof is left to 
the reader (Exercise 2.3.5). 


Proposition 2.3.15 Let (E, M,2) be a K-vector bundle of class C k and dimension 
land letme M. 


1. Let {e,,...,e;} be a basis of Ej. There exists an open neighbourhood U of m 
and a local frame {s1,...,5,} over U such that s;(m) =e;,i=1,...,1. 

2. Let s1,..., 8; be local sections over neighbourhoods U,,...U; of m such that 
the system {s|(m),...,8-(m)} is linearly independent in Ej. Then, there ex- 
ists an open neighbourhood U C Ui -:--NU, of m such that the system 
{situ,--+5Srtu} is a local r-frame in E over U. 

3. Let {s1,...,8-} be a local r-frame over a neighbourhood U of m. Then, there 
exist local sections 8,41, ...,8, over V CU such that the system {s\;y,....Srtv, 
Sr+1,--+, 51} is a local frame over V. 


As an application, we briefly discuss manifolds whose tangent bundle is trivial. 


Definition 2.3.16 A C*-manifold is called parallelizable if its tangent bundle is 
trivial. 


According to Proposition 2.3.12, a differentiable manifold M of dimension n is 
parallelizable iff there exist n pointwise linearly independent vector fields on M. 


3And with tangent vectors being nowhere parallel to the fibres, but this is not relevant for the 
argument. 
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Proposition 2.3.17 The spheres S', S* and S! are parallelizable. 


Proof Since 8” is a level set of the smooth function f : Rt! + R, f(x) = |Ix\l’, 
we can use the natural representation of smooth vector fields on S” by smooth map- 
pings X :S” — R”*! satisfying x- X (x) = 0, cf. Example 2.1.3 and Remark 2.3.4/2. 
In the case of S! we identify R? with C via x = (x1,x2) X= x1 + ix2. Then, 
x-y= Re(&f) and vector fields on S! are represented by mappings X:S!' CC >C 
satisfying Re(zX (z)) = 0. This condition holds for example for X (z) := zi. Since 
this function is nowhere vanishing, the corresponding vector field is nowhere van- 
ishing and hence forms a frame in TS!. In the case of S*, we identify R* with the 
quaternions H via x X := x,1 4+ x2i + x3j + x4k. Then, x- y= Re(%¥), where 
X now denotes quaternionic conjugation, and vector fields on S* are represented 
by mappings X : S*? C H > H satisfying Re(qX (q)) = 0. For / = 1, 2,3, define 
X;:H— H by 


X1(q) = gi, X2(q) = qj, X3(q) = qk. 


Then, Re(qX7(q)) = 0 and Re(X7(q)X ;(q)) = 6);. Hence, the X7 restrict to vector 
fields on S? and these vector fields are pointwise linearly independent. In the case 
of S’, the proof is analogous, with quaternions replaced by octonions.* 


Remark 2.3.18 


1. Since TS! is isomorphic to the product vector bundle S! x R, one can rephrase 
Remark 2.3.14 as follows. Up to isomorphy over S!, the tangent bundle of S! 
and the Mobius strip are the only real vector bundles of dimension 1 over S!. 

2. The construction of pointwise linearly independent vector fields on the spheres 
S!, S° and S’ presented in the proof of Proposition 2.3.17 carries over to the 
unit spheres of C*, H* and O*, where O denotes the octonions. Thus, for r = 
2,4,8 and k= 1,2,... there exist r — 1 pointwise linearly independent vector 
fields on the sphere S’*~!. In case k = 1, these vector fields constitute a global 
frame, whereas in the other cases they constitute just a global (r — 1)-frame. 
While there may exist more than r — | pointwise linearly independent vector 
fields, there does not exist a global frame for any odd-dimensional sphere except 
for S', S? and S’. More precisely, Adams showed that the maximum number 
of pointwise linearly independent vector fields on an odd-dimensional sphere is 
given by the corresponding Radon-Hurwitz number [4]. On the other hand, on 
an even-dimensional sphere, every vector field has a zero. This is known as the 
Hairy Ball Theorem. For a proof, see for example [6]. As a consequence, S!, S? 
and S’ are the only spheres which are parallelizable. 


Exercises 
2.3.1 Show that "(E) carries the structure of a real vector space and of a bimodule 
over the algebra C°(M), cf. Remark 2.3.2/3. 


*For a guide to octonions, see [29]. 
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2.3.2 Prove the statement of Remark 2.3.2/4. 

2.3.3 Prove Proposition 2.3.7. 

2.3.4 Complete the proof of Proposition 2.3.12 by showing that the mapping x 
defined there as well as its inverse are of class C*. 

2.3.5 Prove Proposition 2.3.15. 


2.4 Vector Bundle Operations 


Every operation with vector spaces defines an operation with vector bundles by 
fibrewise application. Below, we will discuss the most important of these operations 
in the form of examples. The construction uses the method of Remark 2.2.5. It will 
be explained in some detail for the dual vector bundle and the direct sum of vector 
bundles. The other operations are then given without further explanations. 

Throughout this section, let FE, E; and Ez be K-vector bundles over M of class 
CK. Let, respectively, 2, 2, and zr be their projections and /, /; and /2 their di- 
mensions. Choose, respectively, typical fibres F, Fy and F2 and local trivializations 
(Ua, Xa), Ua; Xia) and (Ug, X2q~) Over an appropriate open covering {Uy : a € A} 
of M. 


Example 2.4.1 (Dual vector bundle) Take the dual vector space E*, of each fibre 
Em of E and define the set E* as the disjoint union 


E*= |_| Ef. 


meM 


Let ©" : E* > M be the natural projection to the index set. Define mappings 
* *\—] as -1 
Xa (8?) Ua) > Ua x FY, xg @=(m, (Xm) ©), 241) 


where m = 2” (é) and Tee mE * — E®, denotes the dual linear mapping of Xa,m : 
Em — F. The corresponding transition mappings are given by 


XE ° (xe) Tm, 9) = (m, (Xe ° Xe ) can) 


with m € Uy Ug and pw € F*. They are of class oun because so are the transition 
mappings xq 0 Xp of E. Thus, according to Remark 2.2.5, if we equip E* with the 
C*-structure induced by the family of mappings ie a € A}, then (E*, M,2®") 
is a K-vector bundle of class C k called the dual vector bundle of EZ. It has the 
same dimension as E, typical fibre F*, and {(U., XE) :a € A} is a system of local 
trivializations. 

Let {51,...,5;} be a local frame in E over U C M. For me U, let s(m)*!, 
...,5(m)*! denote the elements of the basis of E* which is dual to the basis 
{sj(m), ..., j(m)} of Em. Define local sections s* in E* by 


s"(m):=s(m)",  i=1,...,1. (2.4.2) 
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Using Proposition 2.3.12 it is easy to see that these local sections are of class Cé 
and form a local frame of E*, called the dual local frame or coframe. 
The pointwise evaluation mappings E*, x E,;, — K combine to a natural pairing 


1 (Ej) x P(Ev) > C*(U,R), (0,5) + (a,8), 
also denoted by o(s) or s(o). In terms of this pairing, sections of E* can be ex- 
panded over U as 
o =0(s;)s*! = (a, 5;)5*". (2.4.3) 


Let Eq and E; be K-vector bundles of class C* over Mz and Mz, respectively, and 
let ® : Ey — Ep be a morphism projecting to a diffeomorphism g : Mg — Mp. For 
every m € Mp, the linear mappings ®g-1¢,) : Eg.g-!¢m) > Eb,m induce dual linear 
mappings which combine to a fibre-preserving and fibrewise linear mapping 


OT he > Ee, (STE), x)= (E, By-1¢(X)), (2.4.4) 


where m € My, € € Ej ,, and x € Eg.o-!(m)> Which is a morphism projecting to 


the C*-diffeomorphism y~! (Exercise 2.4.1). It is called the dual morphism of ®. 
Via (2.3.3), the dual morphism induces a transport operator og} of sections. More 
generally, by duality, every morphism @® : E, — E>» induces the following operation 
on sections, called the pull-back, 


&* : T(E) > (Ez), ((*o)(m), x) :=(o o g(m), B(x), (2.4.5) 


where m € M, and x € Eq. Indeed, if ® projects to a diffeomorphism, then the 
pull-back is given by 


6*0 = 0 oo og= Ola, (2.4.6) 


that is, it coincides with the transport operator of the dual morphism.° 


Example 2.4.2 (Direct sum) Take the direct sum FE) m ® E2,m of the fibres over each 
point m € M and define 


E\ ® E2 = |_| E\m @ Eom. 
meM 


Let r® : E; ® Eo > M be the natural projection. Define mappings 
-1 
x2 : (x®) (Ua) > Ua x (Fi ® Fa) 
by 


Ke as XQ) = (m, (tiem (*1), X2a,m (x2))), 


Taking into account that 7 projects to g~!. 
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where m = © ((x,,x2)). By similar arguments as for the dual vector bundle, one 

can check that (E} @ Er, M,2®) is a K-vector bundle of class C* and that the 

mappings x© provide a system of local trivializations. E; ® E? is called the direct 

sum of E; and E>. It has dimension /; + /2 and typical fibre F; @ Fy. Next, note 

that every local section s; of E; over U can be viewed as a local section of EF; @ E2 

in an obvious way. Thus, given local frames {s;1,..., 8;;,} in E;, the collection 
{S115 -6-5 SLL 82,15 +++ S25} 


constitutes a local frame in E; ® £2. Finally, let Eg,, Eq and Ep,, Ep2 be K-vector 
bundles of class Ck over Mz and Mp, respectively, and let ®; : Eg; > Epi, i = 1, 2, 
be morphisms projecting to the same mapping g : Mz — Mp. The linear mappings 
Dim: Eagim — Epi,gm) induce linear mappings 


Pin @ Drm : Eai,m ® Eq2,m ay Eb1,g(m) ® E52,g(m): 
which combine to a morphism projecting to ¢, 
P| @ Oo: Eq) © Eq2 > Epi © Ep, (D1 B D2)m = Pim B P2,m, (2.4.7) 


where m € Mg. It is called the direct sum of ®; and @2. 
Example 2.4.3 (Tensor product) Define 


FE, @®E£2= |_| E\.m ® Erm, 
meM 


denote the canonical projection by z® : E; @ E, > M and take the system of in- 
duced local trivializations x® : (7®)~!(Uy) > Ug x (Fi ® F2) defined by 
XQ (x1 @ x2) = (mM, X1a,m(*1) ® X20,m (x2)), 


where m = 22 (x1 @ x2). Then, (E} @ Ex, M,x®) is a Cé-vector bundle, called the 
tensor product of FE; and E>. Its typical fibre is F; ® F2 and its dimension is 1/2. 
Every pair of local sections s; of E; over U, i = 1, 2, defines a local section 51 ® s2 
of FE; ® E2 by 

(sj ® s2)(m) :=s51(m) @ so(m), mew, (2.4.8) 


which is called the tensor product of s; and s9. If {s;,..., ;,} are local frames in 
E;,i = 1, 2, then 


{s1,; @ 92, :7=1,...,4,j=1,...,l} 


is a local frame in E; ® EF. For K-vector bundle morphisms ©; : Eq; > Epj, 
j = 1,2, projecting to the same mapping g : M, — Mp, the tensor product is the 
morphism @; © 2: Eq; ® Eqg2 > Ep ® Ep? defined by 


(D1 @ Dz) m (x1 ® X2) = Him (X1) @ B2,m (x2). (2.4.9) 
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Example 2.4.4 (Tensor bundles) The tensor bundle of E of type (p, q) is defined to 
be 

TLE := E*@-". @E* @ E@-'- @E. 


Its fibres are the p-fold covariant and q-fold contravariant tensor products T, Em. 
Hence, the dimension is /?*? and the elements of T Ey» are linear combinations of 
§1@---@E,p @xX1 @---@xXy, where x; € Em and § € E*. . The projection is denoted 
by 7®: TLE — M and the typical fibre is TAF . We will view elements of T Em 
as (p + q)-linear mappings 


P q 
u:EmxX-++ XEy X EXx ++) xEX >R, 


thus using the natural isomorphism which assigns to | @ --- @ &) @x1 @---@Xxq 
the mapping 
U(Y1, +++) Yps M1 --+s Ng) =€1 01) ++ Ep Vp) 1) ++ Ng %q)- 


Then, the tensor product of u; € 1 Em, 1 = 1,2, is given by 


uy @ u2(X1,.--Xpytpos &1, +++ s Fg 4a) 
=u (x},... Xp, 41, a », &4, U2 (Xp) 41, ee »Xpi+pr> &q, 41: : + gitar) 
(2.4.10) 


for all x; € Em and &; € E*.. Accordingly, local sections t of TLE over U can be 
viewed as mappings 


q 


en Pieyis ] sey x [(EG)x ++ x (EG) > c*(U) (2.4.11) 


which are C«(U)-linear in every argument. Every pair of local sections t; of Te E, 
— ; : qi+q2 
i = 1, 2, defines a local section tj ® T2 in pipe by 

(T1 ® T2)(m) = tT (mM) @ t2(m). 


On the level of the mappings (2.4.11), tT; ® T2 is given by (2.4.10), with u; replaced 
by t; and x; and &; replaced by local sections in E and E%*, respectively. In partic- 
ular, if {s,,...,s,;} is alocal frame in EF over U, then 


Ls @ +s Qs? Gy Ors O8y Tit nvcglpsflesves dg = lest 
is a local frame in TE . Every te I (Ze ) can be decomposed over U as 


Je Jq oxi j 
Tu =F pS @--- Os"? @sj, @--- @Sj, 


~~ “Upp 
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with 


(an) = t(m)(sj,(m), ..-,8;,(m),8*/'(m), ..., 5/9 (m)) (2.4.12) 


ij..1p 


(Exercise 2.4.2). Finally, according to (2.4.9), every isomorphism ® : E, > Ep of 
KK-vector bundles of class C* induces isomorphisms 


@® TYE, > T4Ep, 02:=(67')'@-’.@(07')' @ 08-1. 99, 
(2.4.13) 
with the same projection. On the level of (p + q)-linear mappings, ®® takes the 
form 


(S2u)(x1, wes Xp,81,.-., &) 
=u(®"(x1),--., Pp), OTE), ---, PMEq)) (2.4.14) 


for all u € Uy one Xi € Ep gam) and &; € ER wa The corresponding transport 
operators ®® satisfy 


D2 (Tt ® ) = (271) @ (Er) (2.4.15) 
for all tj € ry Eq), and 


(OP) Gigi. 58g Gixsin yes) 


Sa (Oo Six. Py Sp Pie, Poy) OO (2.4.16) 


for all TE (Th Ea ), 8; € (Ep) and oj; eI(E ). Here, p : Mg — Mp is the pro- 
jection of @. 


Example 2.4.5 (Exterior powers) The r-fold exterior power /\’ E* has the vector 
spaces /\" E* of flee eau r-linear forms on E,, as its fibres. Hence, the di- 
mension is ('. In particular, \° E* = M x K, (\' E* = E* and /\" E* =M x {0} 
(the zero-dimensional vector bundle over M) for r > 1. The projection is denoted 
by 2’: /\’ E* > M and the typical fibre is /\" F*. The exterior product of 
ni € /\"' EX is defined to be the (r} + r2)-linear form on E,, given by® 


(m1 A n2)(X1, tee Xr tr) 


1 . 
= » Sign()1 (Xr(1)+ +++ Xa) N22 AnH AD> «+s Xa(r}-4r))s 
r\!r2! 
TES; +r9 


(2417) 


®Beware that there exist different conventions concerning the choice of the factor in For- 
mula (2.4.17). 


76 2 Vector Bundles 


for all x; € Em. A local section o in NN’ E* over U can be viewed as an antisym- 
metric mapping 


oil (Ey)x “xT ley > CW) (2.4.18) 
which is C*(U)-linear in every argument. Every pair of local sections 0; of POE, 
i = 1, 2, defines a local section 0; A 02 of (\"!*” E* by 


(01 Ao2)(m) :=o01(m) Ao2(m), meu. (2.4.19) 


If we view oj A o2 as a mapping (2.4.18), it is given by (2.4.17) with &; replaced by 
o; and x; replaced by local sections in E. If {s,,..., 5;} is a local frame in F, then 


LO Rat ASI Sy ei <1} (2.4.20) 
is a local frame in /\’ E*. Every o € '(/\" E*) can be decomposed over U as 


ou = bs O11. 8 A+ As (2.4.21) 


ip<-<i, 


with oj,..;,(m) = o(m)(s;,(m), ..., 5;,(m)) (Exercise 2.4.2). Next, every K-vector 
bundle morphism @ : E, — Ej, projecting to a diffeomorphism g : M, — M, in- 
duces a morphism @™ : /\’ Ex > /\" EX projecting to y~!, defined by 


(2) (n)) (X1,...,Xyp) t= N(Dyo-1 amy 1), ee D-\(m) (Xr). (2.4.22) 


This generalizes Formula (2.4.4). Via (2.3.3), @' induces a transport operator 
(@')., of sections. Moreover, the pull-back operation (2.4.5) generalizes in an ob- 
vious way to a mapping &* : I'(/\" Es) > I'(/\’ E%), given by 


((®*o)(m)) (Nis se5j Xp) c= (o co) y(m)) (®(x1), jie ®(x;)). (2.4.23) 
Again, if ® projects to a diffeomorphism, then * = (@™),. 


Example 2.4.6 (Exterior algebra bundle) By composing the operations of exterior 
power and direct sum one obtains the exterior algebra bundle /\ E* = Qj _y /\' E*. 
which has dimension 2!. We retain the notations zr” : /\E = M for the projection 
and 61: ( E¥ — /\ Ex for the morphism induced by a morphism ® : Eg > Ep. 
The local frame in /\ E* associated with a local frame {s1,...,s;} in E consists of 
the constant mapping U — K given by m+ 1 and the local sections (2.4.20) with 
r=1,...,/. In addition to being a vector bundle, /\ E* is an associative K-algebra 
bundle’ of class Ck over M. The exterior product of local sections (2.4.19) induces 


TIn the definition of vector bundle, replace “K-vector space” by “K-algebra” and “linear mapping” 
by “algebra homomorphism”. 
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a bilinear mapping 


r(Ae)xr(Ae)=r(Ke) 


and hence defines on "(/\ E*) the structure of an associative K-algebra. By (2.4.17) 
and (2.4.23), the pull-back is a homomorphism with respect to this algebra structure, 


®* (0) Aon) =(@*01) A(®*o2), 01,02 € r(A E}). (2.4.24) 


Remark 2.4.7 (Homomorphism and endomorphism bundles) Analogously, one can 
construct the homomorphism bundle Hom(£), F2) of E; and E2, which has the 
fibres Hom(£1,™, E2,m), and the endomorphism bundle End(£) of E, which has 
the fibres End(E,,). For every m, the vector space Hom(£1,m, E2,m) is naturally 
isomorphic to the vector space E L m © £2,m and all these isomorphisms combine 
to a natural isomorphism of Hom(£}, E2) with the tensor product E bf ® E> (Exer- 
cise 2.4.4). Therefore, we may always identify Hom(£, E2) with E ; ® E2. Accord- 
ingly, we may identify End(£) with the tensor bundle E* ® E=T i E of E. Then, 
since vertical C* -morphisms EF, — E) correspond to C *_sections of Hom(£}, £2), 
the vector space of these morphisms is naturally isomorphic to '(E} ® E2). Ac- 
cordingly, since vertical endomorphisms of EF correspond to sections of End(£), the 
vector space of these endomorphisms is naturally isomorphic to I” (TIE ). The proof 
is left to the reader (Exercise 2.4.5). 


Exercises 


2.4.1 Show that the mapping 7 defined by (2.4.4) is a morphism of vector bundles. 

2.4.2 Verify Formulae (2.4.12) and (2.4.21). 

2.4.3 Let (E, M, 7) be a smooth K-vector bundle. Consider the tangent mapping 
mw':TE>TM. 

(a) Show that (TE, TM, x’) is a K-vector bundle by determining the linear 
structure of the fibres and constructing a system of local trivializations. 

(b) Show that in the cases E = TM and E = T*M, local charts on M induce 
local trivializations of (TE, TM, 2’). 

(c) If E = TM, then (TE, TM, zr’) has the same base manifold as the tangent 
bundle of E. Are these two vector bundles isomorphic? 

2.4.4 Let E, E, and E> be K-vector bundles over M of class C*. Construct the ho- 
momorphism bundle Hom(£), 2) and the endomorphism bundle End(£) as 
explained in Remark 2.4.7. Show that Hom(£}, E2) and End(£) are naturally 
isomorphic to Ej} ® E2 and dle E, respectively. 

2.4.5 Show that the natural isomorphisms of Exercise 2.4.4 induce natural iso- 
morphisms between the vector space of vertical Cé-morphisms E; > Ez 
and '(E i ® Ez), as well as between the vector space of vertical C Es 
endomorphisms of E and Tle ), cf. Remark 2.4.7. 
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2.4.6 The image of the identical mapping idg¢ under the isomorphism from the vec- 
tor space of C‘-endomorphisms of E to I” (TIE ) of Exercise 2.4.5 is called 
the Kronecker tensor field of EF and is denoted by 6. Determine the coefficient 
functions 5 of 5 with respect to the local frame in TIE induced by a local 
frame in E. 

2.4.7 Show that if E is one-dimensional, the tensor bundles TRE : TSE and TIE 
are trivial. 


2.5 Tensor Bundles and Tensor Fields 


Let M be a C*-manifold of dimension n. By tensor bundles over M one means the 
various vector bundles arising from the tangent bundle TM by applying the vector 
bundle operations of Sect. 2.4. These are 


(a) the cotangent bundle T*M := (TM)*. Its fibres are the cotangent spaces® TM 
introduced in Sect. 1.4. (Local) sections of T* M are called (local) covector fields 
or (local) differential 1-forms. 

(b) The bundle of alternating r-vectors /\’ TM, the bundle of alternating r-forms 
/\' T*M and the bundles of exterior algebras 


n 


Atm =Q@ Atm. Arm=@ Arm. 


r=0 r=0 


Their (local) sections are called (local) multivector fields and (local) differential 
forms, respectively. The number r is called the degree. We denote 


3%" (M) := r(A T™), Q"(M):= r(Atm), Q*(M):= r(AT™) 


and, as before, X(M) = ¥!(M). One has X°(M) = 2°(M) = C*(M). 

(c) The tensor bundles 14M = aes (TM), p,g =0,1,2,.... Their (local) sections 
are called (local) tensor fields of type (p,q). The algebraic operations of sym- 
metrization, antisymmetrization and contraction of tensors over a vector space 
carry over to tensor fields in an obvious way. 


Since TM is of class C trod so are all the tensor bundles over M. Recall from Ex- 
ample 2.4.1 that pointwise evaluation T?, M x T,,M — R defines a natural pairing 


2'(M) x X(M) > C*—!(M), (a, X) (a, X), (2.5.1) 


which depending on the context can also be written as a(X) or X (a). 


8Like for the tangent bundle we will stick to this notation (instead of writing (T*M)j). 
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Example 2.5.1 Let f : U > R, with U C M open, be a real-valued local C*- 
function. Then, the differentials (df), of f atm € U, defined by (1.4.20), combine 
to a local Cé~!-covector field df on U, called the differential of f. 


Now, let (U,«) be a local chart on M. The differentials of the coordinate func- 
tions x! form a local frame {d«!,...,d«”} in T*M which is dual to {0),..., d,}, cf. 
Examples 2.3.11/1 and 2.4.1 and Formula (1.4.21). The induced local frame in the 
tensor bundle TAM consists of the local sections 


de icy es cite dich 0) C+e=@ O)2 - Liywinyd gy flvesca dg = lkaiagths 


see Example 2.4.4. Using these local frames, a tensor field T of type (p,q) can be 
represented locally as follows: 


Thu = (Te) duet Q@-»-@dkir @ dj, @---@ d;,, (2.5.2) 


i..0p 


where, according to (2.4.12), pointwise we have 


(P57 = T (Bip, «++ Bip de", ..., de/2). (2.5.3) 
Remark 2.5.2 We determine the transformation laws for the local frames and for the 
corresponding coefficient functions of tensor fields under a change of local chart. 
Thus, let (V, ) be another local chart on M. The following formulae hold over 
UV. From (1.4.17) and (1.4.23) we read off 


af = Ao dp! = Aide! 


where 


Ai:=[(po«) ox], Ab :=[(eop 7!) op], 


and an according formula for the induced local frames in 14M . Then, (2.5.3) im- 


plies 


Shed qk vky ij J Ly seed, 
(Ty An Ap An AL (Tee (2.5.4) 


iy nip kk’ 


To pass to coefficient functions which depend on the coordinates, denote the ele- 
ments of «(U MV) by x and the elements of o(U M V) by y and write 


yiQx):=plow (x), x! (y):= Kop! (y). 
Then, from (2.5.4) we read off 


k kp ‘ : 
(eye Pee. = ox"! ox"? dy/! ayia 


se ip Vl alg —1 
Had = "py aye Oa oh (7) ok, (2.5.5) 


ky...Kp 
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This formula is well-known from classical tensor analysis. The argument in (2.5.5) 


. . . i j tee j poe 
can be either x, in which case axe and (T? ah i ‘op ' have to be evaluated at y(x), 
y ade 
ayla 


or y, in which case a 


and (T* es ok! have to be evaluated at x(y). 


Next, let M and N be C*-manifolds and let y : M — N be a Ck-mapping. Ac- 
cording to Proposition 2.2.9, g’ : TM — TN is a vector bundle morphism of class 
C*—! projecting to g. The corresponding pull-back operation (2.4.23) applies to dif- 
ferential r-forms of class C*—!. It will be denoted by g* : Q2"(N) > 2" (M). Ac- 
cording to Examples 2.4.1—2.4.5, if g is a diffeomorphism, g’ induces isomorphisms 
of tensor bundles. The corresponding transport operator (2.3.3) will be denoted by 
(x in case the induced isomorphism projects to gy and by g* in case it projects to 
gy !. Then, for T € r(vsM), we have 


9,T = ((v')®) 7 = (v')® oT og, (2.5.6) 
with (y’)® given by (2.4.13), and Formula (2.4.16) takes the form 


(GxT)(X1,..., Xp, 1, +++, Aq) 


=T(g,'X1,...,0,;' Xp, 9*a1,...,9*ag) og (2.5.7) 
with X; € €(M) and a; € 2!(M). Moreover, Eq. (2.4.15) reads 


?x(T) ® Tz) = (GT) ® (xT). (2.5.8) 


Recall from Example 2.4.5 that for differential forms, the transport operation g* 
coincides with the pull-back under @. 


Remark 2.5.3 


1. Let TET’ (15M ) and let g: M —> N be a diffeomorphism. Given local charts 
(U,«) and (V, e) on M and N, respectively, the local formula for the transport 
(2.5.6) of T is given by (2.5.4), with T replaced by @,.T on the left hand side and 
by T og7! on the right hand side, and with A and A given by 


Ai =[(oogox!)'oxog'},, Aj =[(cog op!) op],. 


The proof of this fact is left to the reader (Exercise 2.5.3). 

2. Let (U,«) be a local chart on M. We compare the corresponding local represen- 
tative «,(T;y) of a tensor field T ¢ I (15M ) with the local representative of the 
mapping T: M > TAM with respect to the induced chart ((®)-!(U), K®) on 
T,M , given by 


K® o Thu ok} :K(U) > K(U) x TSR’. 
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Proofs are left to the reader (Exercise 2.5.4). Since 
a * Vl i 
K,0j = —, kK’ dx' =dk', (2.5.9) 
ox! 
we have 


i) 


ko(Tyu) = (Tj 8 ox! )dx"! @ + @ =. 


ial ip 


(2.5.10) 
On the other hand, 


(x? 0 Try on ')) = (x, ((TH)f 4 on TO) e™ @--- @ej,), 
where, as before, e; denote the elements of the standard basis of R” and e*! the 
elements of the dual basis. The relation to «,.(7}y ) is as follows. The natural iden- 
tifications of the tangent spaces Tx(k(U)) with R” and of the cotangent spaces 
Tx(«(U)) with R’ induce a natural identification of tensor fields on «(U) with 
C*—!_mappings aa )—> T IR”. Since the latter identifies the elements of the 
global frames {4 ai} in Tee )) and {dx' ‘tin T*(k(U)) with the constant map- 
pings x b> e; ahd xt> e*', respectively, it identifies x, (T;y) with K®o Try ox =i 
Note that for M = R” and x = id, (2.5.9) yields 0; = a 


xt 


Exercises 


2.5.1 Let M; = Ry x S!, with S! realized as the unit sphere in R?, and M> = 
R? \ {0}. Consider the mapping g: M, > Mo, g(r, (a, b)) := (ra,rb). Letr 
denote the standard coordinate on R and let ¢ denote the angle coordinate 
of S!. Determine the coefficient functions of yy 2 and @,. a with respect to 
the global frame {4, 2} in T(R? \ {0}). 

2.5.2 Let M; = R, x S?, with S* realized as the unit sphere in R*, and Mz = 
IR? \ {0}. Consider the mapping y : Mj > Mo, g(r, (a,b, c)) = (ra, rb, rc). 
Let r denote the natural coordinate on R+ and let the angle coordinates 3, @ 
on S? be defined by a = cone sin v, b= nn? sin", c= cos v. Determine the 
coefficient functions of ¢,. 2. > Px # a and ¢,. 4 4 With respect to the global frame 
(aes a> ae} in TOR? \ {0}). 

2.5.3 Prove the transformation formula for the transport of tensor fields under dif- 


feomorphisms given in Remark 2.5.3/1. 
2.5.4 Prove the assertions of Remark 2.5.3/2. 


2.6 Induced Bundles 


Let (E, N,7) be a K-vector bundle of class cC. let M be a C*-manifold and let 
g € CK(M, N). Using ¢, one can construct from E a vector bundle y* E over M by 


82 2 Vector Bundles 


attaching to m € M the fibre Eyqn) as follows. Define 
QE: {(m, x) eM x E:g(m) =n(x)| 


and consider the surjective mapping 7” : y*E — M defined by 2% (m, x) :=™m. 
The fibres are 


(v*E),, = (2?) '(m) =(m} x Egun)- 


They inherit a natural K-vector space structure from EF. 


Proposition 2.6.1 Under the above assumptions, g* E admits a C*-structure such 
that it is an embedded submanifold of M x E. Then, 


1. (g*E,M, a”) is a K-vector bundle of class ct, 
2. the natural projection M x E - E restricts to a C*-morphism o* E — E cov- 


ering ~, 
3. every local section s of E induces a local section of y* E defined by 


(y*s) (m) := (m, 50 y(m)). 


Proof We apply Proposition 1.7.3 in the formulation of Remark 1.7.4. Choose a 
typical fibre F and a system of local trivializations {(Ug, Xa) : a € A} for E. For 
every a € A, consider the open subset Vy := g~!(Uq) of M and the mapping 


Wa: Vax F>Mx E, Wa(m, u) = (m, xy (v(m), u)). 


Since pq is obtained by composing the diffeomorphism x7 ' with the natural in- 
clusion mapping of the graph of gry, : Va > Ug, by Example 1.6.12/2, it is a Ce 
embedding. Hence, the image (Va x F) inherits a C *_structure from Vy x F and 
with respect to this structure it is an embedded C*-submanifold of M x E. Since 
the image is y* EM (Va x a~!(Uy)) and since the Vy x x—!(Ug) are open subsets 
of M x E covering y* FE, we conclude that g* E is an embedded submanifold. It 
remains to prove assertion 1; assertions 2 and 3 are then obvious. Since m? is the 
restriction of the natural projection M x E > M to the C*-submanifold g*E, it 
is of class C*. Since, by construction, the Wy restrict to C<-diffeomorphisms from 
Vy x F to p* EN (Vy x m~!(Uy)) = (0 ?")“! (Vy), by inverting them we obtain 
C*-diffeomorphisms 


xo" (2?) Va) > Va x Fy x8 (nx) =(m, Xa,gem()). (2.6.1) 


The latter satisfy conditions 2a and 2b of Definition 2.2.1. Thus, (g*E, M, mr?) 1S 
a K-vector bundle of class C*. 


Definition 2.6.2 (Induced vector bundle) The K-vector bundle (g*E, M, nm?) is 
called the vector bundle induced from E by ¢ or the pull-back of E by g. For a 
local section s of E, the local section g*s of y* E is said to be induced from s by g 
or to be the pull-back of s by ¢. 
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Another common notation for the induced vector bundle is gp* E = M xy E. 


Remark 2.6.3 


1. 


From the proof of Proposition 2.6.1 we note that via (2.6.1), every local triv- 
ialization (U, x) of E induces a local trivialization (gy !(U), x?) of g*E. 
In particular, the pull-back of a trivial vector bundle is trivial. Moreover, if 
Pap : Ug 1 Ug — GL(F) are the transition functions of a system of local trivi- 
alizations of E, then g* fgg : g~!(Uw) a yp '(Uz) — GL(F) are the transition 
functions of the induced system of local trivializations of g* E. 


. Let (E;, M;,7;), i = 1, 2, be K-vector bundles of class C* and let ® : Ey} > E 


be a morphism with projection g : M, > Mp2. @ naturally decomposes as 


Dyer ® or 
E, —$ g* Ex —% Ep, (2.6.2) 


where yer is given by Pyer(x) = (771 (x), (x)), x € Ej, and Pyo, denotes the 
induced vector bundle morphism of Proposition 2.6.1/2. One can check that Pye, 
is a vertical morphism, with differentiability of class Cé following from Propo- 
sition 1.6.10 and the fact that y*E is an embedded submanifold of M x E. 
Using this decomposition, one can derive the following characterization of iso- 
morphisms in terms of their projections and fibre mappings (Exercise 2.6.1): a 
morphism is an isomorphism iff its projection is a diffeomorphism and its fibre 
mappings are bijective. 


Example 2.6.4 


1. 


2. 


If g: M + N is constant with y(m) = p, then y* E coincides with the product 
vector bundle M x Ep. 

If M CN is an open subset and j : M > N is the natural inclusion mapping, 
jE can be identified with the restriction E}y, see Example 2.2.3/3. 


. Let M= N =S! and let E be the Mobius strip of Example 2.2.4. Realize S! 


as the unit circle in C and consider the n-fold covering @p : si > s!, Gn(Z) = 
z". Since y*E is a differentiable real vector bundle over S! of dimension 1, 
according to Remark 2.3.14, it must be isomorphic to either E or the product 
vector bundle S! x R. Indeed, one finds (Exercise 2.6.2) 


E | n odd, 


“ES 
Pn { x R| 7 even. 


. Let E; and E> be K-vector bundles of class C K over M, let E 1 X E> denote the 


product vector bundle over M x M, see Exercise 2.2.7, and let A: M—> Mx M 
denote the diagonal mapping, A(m) = (m,m). The pull-back A*(E, x E2) is 
naturally isomorphic to the direct sum E, @ E> (Exercise 2.6.3). 


. If E is a K-vector bundle of class C* over N and (M, g)isaC k submanifold of 


N, the induced vector bundle ¢* F is referred to as the restriction of E to M and 
is usually denoted by E;. This applies in particular to E = TN, where y*TN 
is a real vector bundle over M of class C<~! and dimension dim N. 
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Exercises 

2.6.1 Use the natural decomposition (2.6.2) of vector bundle morphisms to show 
that a morphism is an isomorphism iff its projection is a diffeomorphism and 
the fibre mappings are bijective, cf. Remark 2.6.3/2. 

2.6.2 Prove the statement of Example 2.6.4/3 about the pull-back of the Mobius 
strip by means of a covering of S!. 

2.6.3 Show that A*(E, x E2) = FE; @ Ez, see Example 2.6.4/4. 


2.7 Subbundles and Quotient Bundles 


Definition 2.7.1 (Vector subbundle) Let (£;, Mj, 7;), i = 1, 2, be K-vector bundles 
of class Ck and let ® : E; > E> bea morphism. The pair (E;, ®) is called a sub- 
bundle, an initial subbundle or an embedded subbundle of EF» if it is, respectively, a 
submanifold, an initial submanifold or an embedded submanifold. If M; = M, = M 
and @ is vertical, (E;, ®) is called a vertical subbundle or a subbundle over M. 


At the very beginning, we observe that Propositions 1.6.10 and 1.6.14 remain true 
if the term submanifold is replaced by subbundle and C<-mapping by morphism. 
The following two specific types of subbundles are the building blocks for arbitrary 
subbundles. 


Example 2.7.2 (Vertical subbundle) If E; and £2 are K-vector bundles of class C . 
over M and ®: E,; — Ep? is an injective vertical morphism, then (£1, ®) is a ver- 
tical subbundle of £2. Vertical subbundles are embedded. To see this, it suffices to 
show that (£1, ®) is an embedded submanifold of E>. Let /; denote the dimensions 
of E;. Necessarily, 1; <2. Let x € E; and m := 7(x). Choose a local frame in 
E, at m. By injectivity, the image under @ is a local /,-frame in Ey. According to 
Proposition 2.3.15/3, the latter can be complemented, over a possibly smaller do- 
main U, to a local frame in Ey at m. The local representative of ® with respect to 
the local trivializations associated with these local frames in EF; and E> is given by 


UxK!+UxK?, — (mx) (m, (x,0)). 


Hence, it is an embedding. Since x, (U) = &-!(5'(U)), this implies that the 


restriction ® is an embedding. Since 7, '(Y) isan open neighbourhood 


[@-! (ry '(U)) 
of }(x) and x was arbitrary, Remark 1.6.13/3 yields the assertion. 


Example 2.7.3 (Restriction of the base manifold) Let (E, N, 7) be a K-vector bun- 
dle of class C* and let (M,y) be a C*-submanifold of N. Let ® : y*E > E de- 
note the induced vector bundle morphism of Proposition 2.6.1/2. Recall from Ex- 
ample 2.6.4/5 that g* E is referred to as the restriction of E to M and is alternatively 
denoted by E;y. We show that (y* E, ®) is a C*-subbundle of E. If (M, ¢) is initial 
or embedded, so is (y* E, ®). Indeed, the local representatives of ® with respect to 
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a system of local trivializations {(Ug,, Xx.) : a € A} of E and the induced system of 
local trivializations of g* E are given by 


ee 
Xa © Prgory-1(y-1(yy)) ° (x?) :@ '(Uy) x F > Uy x F, 


(m, U) (v(m), u). 


P 
First, this implies that ® is an immersion. Second, since xy and xf are diffeomor- 
phisms and since 


(2%) (@7! (Ua) = ®7! (x7 !(Uy)), 


this implies that the submanifolds (®~!(2~!(Ug)), ®pg-1¢-1(u,))) inherit the 
property of being initial or embedded from (M, g). Then, Remark 1.6.13/3 yields 
the assertion. 


The following proposition states criteria for a morphism to define a subbundle. 


Proposition 2.7.4 Let (E;, Mj, 7), i = 1,2, be K-vector bundles of class C*, let 
@ : E, > E2 be a morphism and let gp: M, > M2 be the projection. The following 
statements are equivalent. 


1. (£1, ®) is, respectively, a subbundle, initial subbundle or embedded subbundle 
of E2. 

2. (M1, ¢) is, respectively, a submanifold, initial submanifold or embedded sub- 
manifold of Mz and the fibre mappings ®y : E\,m — E2,9(m) are injective for all 
me M,. 

3. In the decomposition (2.6.2), (E1, Pyer) is a vertical subbundle of y* Ey and 
(g* E, Por) is, respectively, a subbundle, initial subbundle or embedded sub- 
bundle of E>. 


Item 3 gives a precise meaning to the statement made above that vertical subbun- 
dles (Example 2.7.2) and restrictions of the base manifold (Example 2.7.3) provide 
the building blocks for arbitrary subbundles. 


Proof | = 2: The fibre mappings ®,, are obviously injective. Since they are linear, 
one has the commutative diagram 


Poso1 
M, —~> E> 


NI 


M2 
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where sp; denotes the zero sections of E;. According to Proposition 2.3.5, so,; and 
50,2 are embeddings. Hence, the assertion follows by applying Proposition 1.6.14. 
(We encourage the reader to work out the argument for each case.) 

2 = 3: Since the mappings ®,, are injective, Pye; is injective, hence the asser- 
tion on (EF, ®yer) holds due to Example 2.7.2. The assertion on (g* E2, Phor) was 
proved in Example 2.7.3. 

3 = |: Since vertical subbundles are embedded, this follows from Proposi- 
tion 1.6.14/1. 


In the following proposition we give criteria for a family of fibre subspaces of a 
vector bundle to define a vertical subbundle. The proof is left to the reader (Exer- 
cise 2.7.1). 


Proposition 2.7.5 (Families of fibre subspaces) Let (Ez, M, 1) be a K-vector bun- 
dle of class C*. For every m € M, let E\,m C E2,m be a linear subspace. Define 
E\ := mem E1,m- The following statements are equivalent. 


1. E, admits a C*-structure such that it is a vertical subbundle of Ez of dimen- 
sionr. 

2. There exists a covering of M by local r-frames in E2 which span E. 

3. There exists a covering of M by local frames in Ez whose first r elements 
span E}. 

4. There exists a system of local trivializations {(Ug, Xv) : a € A} of Ex and a 
subspace F\ of dimension r of the typical fibre Fy such that the restrictions of 
the Xq to E, take values in Uy x Fy. 


Example 2.7.6 (Regular distribution) Let M be a C*-manifold. A vertical subbun- 
dle (D, ®) of TM is called a regular distribution (in the geometrical sense) on M. 
According to Proposition 2.7.5, a family of r-dimensional subspaces Dy, C TM, 
m € M, defines a distribution iff for every mo € M there exists an open neighbour- 
hood U and pointwise linearly independent local vector fields X;,..., X, on U 
such that D,, is spanned by X1,m,..., X;m for all m € U. There is a more general 
notion of distribution on M which will be defined and studied in Sect. 3.5. 


Example 2.7.7 (Kernel and image) Let E; be K-vector bundles over M of class Cé 
and dimension /;, i = 1,2, and let ®: E, — Ep» be a vertical morphism of constant 
rank r. Define the image and the kernel of ® to be 


im@ := U im®», ker := U ker @,,, 
meM meM 


respectively. We show that im @ is a vertical subbundle of E2 of dimension r and 
that ker ® is a vertical subbundle of £; of dimension /; — r. 

Let mo € M. Choose a basis {e1,..., e7,} Of E1,mo Such that e-41,...,e7, span 
ker @,,,. Extend this basis to a local frame {s1, ..., sj, } in Ey, cf. Proposition 2.3.15. 
By construction, the vectors ®»,.(e1),..., Pm (er) form a basis of the subspace 
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im ®». C E2,m- In particular, the local sections @ os1,..., Bos, of E2 are linearly 
independent at mo so that, by possibly shrinking the domain of definition of the s;, 
we may assume that they form a local r-frame in EF. Since @ has rank r, this local 
r-frame spans im @,, for all m belonging to the domain of definition. First, in view 
of Proposition 2.7.5, this yields the assertion for im®. Second, this implies that there 
exist local Cé-functions aj,is=r+l,...,4,j=1,...,r,0n M such that 


i 
Posi =) aij osj, r+1<i<kh. 
j=l 


Then the local sections 5,41, ..., 57, given by 


- 
Sj = Sj — ) ajsj, rt lsi<h, 
j=l 


form a local (/; — r)-frame in Ey spanning ker ®,,. Applying Proposition 2.7.5 once 
again, we obtain the assertion for ker ®. 


Example 2.7.8 (Annihilator) Let V be a vector space. The annihilator of a subspace 
W C V is the subspace 


W® := {ve V*: vbw =0} 


of the dual vector space V*. Let E> be a K-vector bundle over M of class C* and 
dimension / and let (E;, ®) be a vertical subbundle of dimension /;. Then, 


E°:= (J (®(Eim))° 


meM 


is a vertical subbundle of dimension /) — 1; of the dual vector bundle E%, called 
the annihilator of E; in E2. In view of Proposition 2.7.5/3, this follows from the 
obvious fact that for every local frame in E2 whose first /; elements span (£1, ®), 
the last 17 — /, elements of the corresponding dual local frame in E> span E i The 
annihilator of a general vector subbundle (£,, ®) is defined to be (E i (Phor) [ Bo), 


where EY is the annihilator of the vertical subbundle (£1, ®ye,) of g* Ey and ¢ is 
the projection of ®. It has the same base manifold as EF}. 


Remark 2.7.9 


1. For every vertical subbundle (EZ, ®) of E2 there exists a complement in E2, that 
is, a vertical subbundle (E1, @) of E> such that Ex = E; ® Fi. The proof is in 
two steps. 

(a) Show that for every K-vector bundle (£, M, 7) of class C* there exists a 
C*-function h: E @ E — K such that h,, :=h |Em@Em 48 a scalar product on 
Em for all m € M (Exercise 2.7.2).? 


m 


CE, h) is called a Euclidean vector bundle if K = R and a Hermitian vector bundle if K = C. 
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(b) Show that the family of h,,-orthogonal complements of the subspaces 
E\.m C E2,m defines a vertical subbundle of Ez (Exercise 2.7.3). 

2. Let M be a compact smooth manifold. The statement of | provides part of the 
proof that for every smooth vector bundle FE over M there exists a smooth vec- 
tor bundle E over M such that E @ E is trivial. For the remaining part, see 
for example [125, Prop. 1.4].!° This is known as the cancellation property and 
is an important ingredient in what is called the K-theory of M. Let us have a 
glimpse at the reduced version of the latter. Two smooth K-vector bundles E and 
E over M are said to be stably equivalent if E @ (M x K") is isomorphic to 
E® (M x KK*) for some r,s. The set of stable equivalence classes is an Abelian 
semigroup with respect to the operation of direct sum, where the unit element is 
given by the class of trivial bundles. Now, the cancellation property yields that 
every element of this semigroup has an inverse, hence the semigroup is in fact 
a group, called the reduced real (for K = R) or complex (for K = C) K-group 
of M. Together with the operation of tensor product, it is an Abelian ring. 


Next, we discuss quotient vector bundles. Let (E2, M, 12) be a K-vector bundle 
of class C¥ and let (E,, ®) be a vertical subbundle of E2 of rank r with projec- 
tion z. Since vertical subbundles are embedded, we may assume that E; C E> 
and @ is the natural inclusion mapping. F1 is a vector subspace of E>, for all 
m € M, and we can form the quotient spaces E2 /E1,m. Let 


Eo/E\:= | | Eo m/E\,m 
meM 


and let 2 : E2/E, — M denote the natural projection to the index set. By construc- 
tion, the fibres 2 ~!(m) are vector spaces. According to Proposition 2.7.5, there ex- 
ists a family of local trivializations {(Ug, x2q) : a € A} of Ez and an r-dimensional 
subspace F; of the typical fibre Fy of E> such that the restrictions of x2_ to E take 
values in Uy x F). For any such x29, we define a mapping 


Xa: Uy) > Ug x Fa/Fi, Xa([x]) = (m, [ x20,m (x)]), 


where m = 72(x). To check differentiability of the corresponding transition map- 
pings, choose a complement F, of F, in F> and let 4: F)/F,| — Fy» denote the 
linear mapping which assigns to each class its unique representative in F,. More- 
over, let pr: F2 + F/F; be the natural projection. Since xg o x7 ' decomposes 
as 


Xpe Xo" = (id x pr) o (x2p co) Xu ) o (id x A), 


it is of class Ck. Then, Remark 2.2.5 yields that the family {(Ug, Xq) : a € A} de- 
fines a C*-structure on E/E, such that (E2/E,, M,z) is a K-vector bundle of 
class C* over M. This C*-structure obviously does not depend on the choice of the 
subspace F). 


'OCompactness of M is necessary here, see Example 3.6 in [125]. 
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Definition 2.7.10 (Quotient vector bundle) The vector bundle (E2/£,, M, 7) con- 
structed above is called the quotient vector bundle of E> by £}. 


Remark 2.7.11 


1. The fibrewise natural projections Ez, — E2,/E\,m to classes combine to a 
natural projection E27 + E/E). The latter is a vertical morphism, because its 
local representative with respect to a local trivialization (Ug, X2q) of Ez whose 
restriction to E, takes values in Uy x F\, and the induced local trivialization of 
E/E, is given by the natural projection F2 — F/F |. By composing a local 
section s of E2 with the natural projection E2 — E/E, one obtains a local 
section of E/E}, denoted by [s]. 


2. Let/; denote the dimension of £;, i = 1, 2. For any local frame {s1,..., 57,} in E2 
with the property!! that s;,..., 57, span £1, {[s7,41],..-, [s7,]} 1s a local frame 
in E2/E). 


3. According to Remark 2.7.9/1, E, admits a complement E 1 in E. For any such 
complement, the natural projection Ez — E/E restricts to a vertical isomor- 
phism E, — E/E. This follows at once by observing that the induced mapping 
is a bijective vertical morphism. Thus, every complement defines a vector bundle 
isomorphism 


E> = E, @ (E2/E)). 


4. By a coorientation, or transversal orientation, of E; in Ez one means an orien- 
tation of the quotient vector bundle E2/E,. Accordingly, FE; is said to be coori- 
entable, or transversally orientable, in F2 if E2/£F, is orientable. 


Example 2.7.12 (Homomorphism theorem) Let E; and Ez be K-vector bundles of 
class C* over M and let ®: Ej > E> be a vertical morphism of constant rank. 
Then, the induced mapping 


® : E,/ker® > im® 


is an isomorphism. Indeed, @ is obviously bijective and fibrewise linear. To see that 
it is of class C*, one may choose a complement Eo of ker® in E and write ® 
as the composition of the isomorphism E,/ker ® — Epo and the restriction of @ in 
domain to Eo. Thus, @ is a bijective vertical morphism and hence an isomorphism. 


Example 2.7.13 (Dual quotient vector bundle) Let Ez be a K-vector bundle of class 
C* over M and let E, be a vertical subbundle. The dual vector bundle (E> /E\)* 
is called the dual quotient vector bundle. It is naturally isomorphic over M to the 
annihilator E a Indeed, the mapping 


®: EY > (Eo/Ei)*, — (@m(&))([x]) =), § € ERs X © Erm, 


‘Such local frames exist by Proposition 2.7.5. 
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is well-defined, bijective and fibrewise linear. Hence, it remains to show that @ is of 


class C*. To see this, choose a local frame {51,..., 5j,}, whose first 1; elements span 
E, over U. Then, the elements s*!, i =/; +1, ..., lo, of the dual local frame span E? 
over U and, according to Remark 2.7.11/2, the dual local frame {[s;,41]*, ..., [s7,]*} 


spans (E2/E1)* over U. By construction, the local representative of ® with respect 
to the local trivializations defined by these local frames does not depend on m and 
is hence of class C*, as asserted. 


To conclude this section, we discuss vector bundle structures induced by sub- 
manifolds. Thus, let V be a C*-manifold and let (M, v) beaC kK submanifold of N. 


Proposition 2.7.14 (TM, 9’) is a vector subbundle of TN. It is initial or embedded 
iff so is (M, 9). 


Proof Recall from Example 2.6.4/5 that the restriction of TN to the submanifold 
(M, ¢) is defined to be the induced vector bundle (TN); m := y*TN. Since ¢ is an 
immersion, the vertical morphism (9’) yer : TM — (TN) tm in the natural decom- 
position (2.6.2) of g’ is injective. Hence, (TM, (9’)vyer) is a vertical subbundle of 
(TN) ;a. Then, Proposition 2.7.4/3 yields that (TM, gy’) is a subbundle of TN and 
that it is initial or embedded if so is (M, y). The converse direction follows from 
Proposition 1.6.14 and the fact that the zero sections of the tangent bundles of M 
and N are embeddings. The details are left to the reader (Exercise 2.7.4). 


Remark 2.7.15 Let V be a finite-dimensional real vector space and let M C V be an 
embedded C*-submanifold. For every u € M, the natural identification of T, V with 
V of Example 1.4.3/2 identifies T, M with a subspace of V, which we denote by the 
same symbol. In particular, in case M is open in V, one has T, M = V; and in case 
M is a level set of a C'-mapping f, one has T, M = ker f’(v). In the general case, 
T,M is just the tangent plane of M at v, shifted by —v to the origin. Thus, together 
with the induced natural identification of TV with V x V, Proposition 2.7.14 yields 
a natural identification of TM with the embedded C‘—!-submanifold 


{(v,X)eMxV:XET,M} 


of M x V and a natural representation of vector fields on M by C*—!-mappings 
X:M — V satisfying X(v) € T,M for all v € M. This generalizes Remarks 2.1.4/2 
and 2.3.4/2. 


A further consequence of the observation that (TM, (@’)ver) is a vertical subbun- 
dle of (TN); is the following. A vector field X on N is said to be tangent to the 
submanifold (M, ¢) if Xgim) € ¢'(TmM) for allm eM. 


Proposition 2.7.16 Let N be a manifold and let (M, ~) be a submanifold of N. For 
every vector field X on N which is tangent to (M, ~), there exists a unique vector 
field X on M such that g' 0 X = X og, that is, X and X are o-related. 
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We will say that X is induced from X and call it the restriction of X to (M, ¢). 


Proof Due to the assumption, the equation ’ o X = X og defines a mapping xX: 
M — TM. X is the restriction in range to TM of the section of (TN) mw = g* TN 
induced from X by @. Since vertical subbundles are embedded, Proposition 1.6.10 
yields that X is differentiable,!” that is, of class Ck~!. 


Finally, we introduce 


Definition 2.7.17 (Normal and conormal bundle) Let N be a manifold and let 
(M, ~) be a submanifold of NV. 


1. The quotient vector bundle NM := (TN);y/TM is called the normal bundle of 
(M, ¢). Its fibres are called the normal spaces of M at m € M. They are denoted 
by NyM. 

2. The dual vector bundle N*M := (NM)* is called the conormal bundle of (M, ¢). 
Its fibres are called the conormal spaces of M at m € M. They are denoted by 
N*M 


m . 


Remark 2.7.18 


1. The normal and the conormal bundle of (M, g) are real vector bundles over M 
of class C‘~! and dimension dim N — dim M. According to Remark 2.7.1 1/3, 
NM is isomorphic to an arbitrary complement of TM in (TN) y, and it is of- 
ten realized in this way. For an example, see Exercise 2.7.6. According to Ex- 
ample 2.7.13, N*M is naturally isomorphic to the annihilator (TM)° of TM in 
(TN) jw. 

2. By acoorientation, or a transversal orientation, of (M, g) one means an orienta- 
tion of NM. Accordingly, (M, @) is said to be coorientable, or transversally ori- 
entable, if the normal bundle NM of (M, @) is orientable. This is consistent with 
the terminology for vector subbundles introduced in Remark 2.7.1 1/4: a coorien- 
tation of (M, ¢) is the same as a coorientation of TM in (TN) ;m. 

3. We discuss local frames in NM and N*M induced by local charts on N adapted 
to M. Denote r := dim M and s := dim N. For simplicity, we consider the case of 
M being a subset of NV. We leave it to the reader to write down the respective local 
frames for the general situation. According to Proposition 1.6.7, for every m € 
M, there exists an open neighbourhood U of m in M and a local chart (V, ¢) on 
N atm such that U C V and (U, pry) is a local chart on M, taking values in the 
subspace R” x {0} C R*. Then, {0;;y :i = 1,..., 5} is a local frame in (TN) pw 
whose first r elements span TM over U. According to Remark 2.7.11/2, then 
{ld;tu]:i=r+1,...,s} is a local frame in NM. This, in turn, induces a dual 
local frame {[0;;y]* :i =r +1,...,5} in N*M, see Example 2.4.1. According 


By construction, X is also the restriction of X in domain to the submanifold (M, g) and in range 
to the subbundle (TM, g’). This does not help for the argument though, because the latter need not 
be embedded, so that Proposition 1.6.10 does not apply here. 
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to Example 2.7.13, the natural isomorphism N*M —> (TM)° maps the latter to 
the local frame in (TM) consisting of the last s — r elements of the local frame 
{(do')ju :i=1,..., 5} in (TN) pm)* = (T*N) pu. 

4. Assume that (M, ~) is embedded. The subset 


Cy (N) ={f €C*(N): ¢* f =0} (2.7.1) 


is an ideal of the associative algebra C*(N), called the vanishing ideal of M. 
By means of this ideal, for m € M, the subspaces T,,M of Tgim)N and N¥ M = 
(Tn M)° of de (N) can be characterized as follows: 


9 (TmM) ={X € Tym N : X(f) =0 for all f ech ny}, (2.7.2) 


Ni M= {é € Too = df(g(m)) for some f € Ci (N)}. (2.7.3) 
The proof is left to the reader (Exercise 2.7.5). Beware that (2.7.2) or (2.7.3) need 
not hold if M is not embedded. A counterexample is provided by the figure eight 
submanifold (R, yi) of Example 1.6.6/2. At the crossing point, the derivative of 
any element of Cyr (N) vanishes. Hence, for the right hand side of (2.7.2) one 
obtains Tyim)N. 


Exercises 

2.7.1 Prove Proposition 2.7.5 by means of Proposition 2.3.15. 

2.7.2 Let (E, M, 2) be a K-vector bundle of class C*. Use a system of local trivial- 
izations and a subordinate partition of unity of M to construct a C*-function 
h: E @ E — K such that h,, := hype, gz,, is a scalar product on E,, for all 
memM. 

2.7.3, Show that every vertical subbundle admits a complement. 

2.7.4 Complete the proof of Proposition 2.7.14. 

2.7.5 Prove Eqs. (2.7.2) and (2.7.3) of Remark 2.7.18, characterizing the tangent 
and the conormal spaces of an embedded submanifold. 

2.7.6 Using the Euclidean metric, construct the normal bundle of the submanifold 
S” of R’*! as a complement of TS” in (TR’*!) sn. Is this bundle trivial? 


m 


Chapter 3 
Vector Fields 


In the first four sections, we discuss elementary aspects of the theory of vector fields. 
In Sect. 3.1 we show that it is fruitful to view vector fields as derivations of the alge- 
bra of functions on the manifold. Next, in Sect. 3.2, we discuss in detail the notions 
of integral curve and flow and, in Sect. 3.3 we introduce the Lie derivative of a 
tensor field with respect to a given vector field. Finally, in Sect. 3.4, we extend the 
notion of an ordinary vector field to that of a time-dependent vector field. Next, 
we pass to more advanced topics. In Sect. 3.5, we give an introduction to the the- 
ory of (geometric) distributions, a notion which generalizes that of a vector field: a 
distribution is a subset of the tangent bundle consisting of linear subspaces of the 
tangent spaces. Following the theory developed by Stefan and Sussmann, we dis- 
cuss the concept of integrability in some detail. The special case of a distribution 
of constant rank is built in here and the classical Frobenius Theorem occurs as a 
special case of a general theorem yielding integrability criteria. In the remaining 
four sections, we give an introduction to the study of the qualitative behaviour of 
the flows of vector fields.' In Sect. 3.6 we collect the basic notions related to crit- 
ical integral curves.” In Sect. 3.7, we introduce the concept of a Poincaré mapping 
and in Sect. 3.8 we pass to the study of elementary aspects of stability. This notion 
comprises a variety of concepts characterizing, in effect, two aspects of the long- 
time behaviour of a flow, namely, returning properties and attraction properties of 
integral curves or, more generally, of invariant subsets. Here, we limit our atten- 
tion to the concept of so-called orbital stability, to which for simplicity we refer 
to merely as stability. At the end of this section we briefly discuss the relation to 
Lyapunov stability, a notion physicists are probably more familiar with. Finally, in 
Sect. 3.9, we present the concept of invariant manifolds which plays a basic role 
in the analysis of the qualitative behaviour of flows near critical integral curves. 
In all of the four final sections, the reader will find a number of illustrative exam- 
ples. 


‘That is, to the theory of dynamical systems. 


Equilibrium points or periodic integral curves. 


G. Rudolph, M. Schmidt, Differential Geometry and Mathematical Physics, 93 
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From now on, we restrict attention to smooth manifolds and smooth mappings. 
Let M be a smooth manifold of dimension n. Recall from Chap. 2 that a vector field 
on M is a section of the tangent bundle TM and that the space of vector fields is 
denoted by X(M). Recall, furthermore, that for every local chart (U, «), the local 
sections 0; form a local frame in TM. 


3.1 Vector Fields as Derivations 


In this section, we will relate vector fields to derivations of the associative alge- 
bra C®(M) and use this to define their commutator. Let X € X(M). Recall from 
Sect. 1.4 that the tangent vectors X,,, m € M, define directional derivatives of func- 
tions and hence mappings X;, : C°°(M) — R. Thus, for f € C°(M), we can define 
a function X(f) on M by 


X(f)(m) = Xm(f), meM. 
This function is smooth, because its local representative with respect to a chart 
(U, k) is 
a(f o«7!) 
ax! ; 
where X! € C®(U) are the coefficient functions of X with respect to the local frame 
{0;}. By assigning to f the function X(f) we obtain a linear mapping 


X:C™~(M) > C™(M), 


X(f)ox! — (x ox‘) 


denoted by the same symbol. According to (1.4.11), it satisfies 
X(fg)=X(f)gt+ fX(g), frgec?(™), 
hence it is a derivation of the algebra C°(M iF 


Proposition 3.1.1 Vector fields on M correspond bijectively to derivations of the 
algebra C®(M). 


Proof It remains to show that every derivation of C°(M) is generated by a vector 
field. Thus, for a given derivation 5 define a mapping X;,:C™(M) > R by 
Xm(f) = ((f))(m) 
for every m € M. X,, is linear and satisfies 
Xm(£8) = (8(F8))(m) = ((A)g + f(g) mr) = Xm (fg) + f ()Xm(g), 


hence it is a derivation of C°(M) at m. Then, according to Proposition 1.4.7, 
Xm € TmM, and so the assignment m +> X,, defines a mapping X : M > TM 


3The derivations of C°(M) are frequently referred to as the first order differential operators on M. 
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with 2 o X =idy. By construction, X(f) = 6(f) € C~(M) for any fe C™(M). 
We show that this implies that X is smooth. In view of Proposition 2.3.10, it suf- 
fices to show that for every m € M there exists a local chart (U, «) at m such that 
the coefficient functions X! of X with respect to the induced local frame {0;} are 
smooth. According to Remark 2.3.2/4, by possibly shrinking U we may assume 
that «' = &{,, for some &' € C*(M), i =1,...,n. Then, Example 2.3.11/1 and 
Proposition 1.4.5/2 imply 


X' = Xju(«') = Xu (ey) = (X(k)) iy = (5(k')) iy 


thus showing that X‘ is smooth, indeed. 


For what follows, let us recall the notion of Lie algebra. 


Definition 3.1.2 A Lie algebra over K = R or C is a vector space g over K to- 
gether with a bilinear mapping [-,-] : g x g — g, called multiplication, satisfying the 
following axioms. 


1. Anticommutativity: [A, A] = 0 for all A € g. 
2. Jacobi identity: [A,[B,C]]+[B,[C, A]] +[C, [A, B]]=0 forall A, B,C eg. 


A homomorphism of Lie algebras g and h is a mapping y : g > § which is linear 
and satisfies p([A, B]) = [g(A), g(B)] for all A, Beg. 


One can check that the commutator [6 1, 62] = 5; o d2 — 52 o 6; of derivations 51, 
52 of an associative algebra 2 is a derivation of 2{ and that the commutator defines a 
Lie algebra structure on the space of all derivations of 2l (Exercise 3.1.1). Therefore, 
Proposition 3.1.1 allows for 


Definition 3.1.3 (Commutator of vector fields) The commutator of the vector fields 
X,Y on M is the vector field [X, Y] on M corresponding to the derivation [X, Y] = 
XoY—-Yox. 


Thus, X(M) with the commutator as a product is a real Lie algebra. 


Remark 3.1.4 


1. Recall that X(M), as a space of sections in a vector bundle over M, is naturally 
a C™®(M)-module. The commutator and the module structure are related by 


[fX, gY]= fglX, Y]+ fX(g)¥ — gY(f)X, (3.1.1) 


with X,Y € X(M) and f, g € C~(M). The proof is left to the reader (Exer- 
cise 3.1.2). 

2. Let (U,«) be a local chart on M and let X, Y € X(M). The coefficient functions 
of [X, Y] with respect to the local frame {0;} are given by (Exercise 3.1.4) 


[X,Y] = X/a;¥' — Y/a;x", (3.1.2) 
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3. Let V be a finite-dimensional real vector space, let M Cc V be an open subset 
and let X, Y € X(M), viewed as mappings M — V, cf. Remark 2.3.4/1. Using 
(2.3.2), we find that the mapping representing [X, Y] is given by 


[X, Y](v) = a, Y(v+tX(v)) X(v+tY(v)), veM. (3.1.3) 
0 


~ dt to 


By writing down this equation in terms of the global chart on M associated with 
a basis in V one recovers (3.1.2) in this particular situation. 

4. For the more general situation of M Cc V being an embedded submanifold, For- 
mula (3.1.3) generalizes to 


d = d = 
[X, Y](v) = a Y(v+tX(v)) - a X(v+tY(v)), veM, (3.14) 
0 0 


where X and Y are arbitrary smooth extensions of X and Y to an open neigh- 
bourhood of M in V. 


There are two further ways to view a vector field X as a mapping, namely, as a 
smooth function, 
X:TM—R, X(n):= (Xm), (3.1.5) 


where 7 € T;, M, and as the morphism of C°°(M)-modules induced by the natural 
pairing (2.5.1), 


X:2'(M) > C®(M), X (a) := (a, X). (3.1.6) 
In calculations involving vector fields one often switches forth and back between 
these viewpoints, as for example in the proof of the next proposition. 
Proposition 3.1.5 Let M and N be manifolds and let p € C°(M, N). Let X, X1, 
X2 be vector fields on M and let Y, Y, Y2 be vector fields on N. 


1. X is g-related to Y iff the corresponding derivations satisfy X 0 g* = 9* oY. 
2. If X; is y-related to Y;,i = 1,2, then [X,, X2] is g-related to [Y,, Y2]. 


If g is a diffeomorphism, then, in addition, for the transport operator ps the 
following holds. 


3. As a derivation of C®(N), xX is given by o,X = (g7!)* 0 X og". 
4. [p.X1, PxeX2] = xl X1, X2]- 


Recall that if g is a diffeomorphism, ¢,.X is the unique vector field on N which 
is g-related to X. Hence, assertions | and 2 hold in particular for X and Y = g,.X. 


Proof 1. X is g-related to Y iff gp’ o X = Y og, hence iff (9 Xm)(f) = Youn) (f) for 
allm € M and f € C®(N). According to (1.5.3), the left hand side can be rewritten 
as (X 0 y*(f))(m), whereas the right hand side yields (y* o Y(f))(m). 
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2. On the level of derivations, one has 
[X1, X2.]0g* = X10 X20g* — X20 X, 0g" 
=p o¥jol—g*oYn0V 
= ¢* o[Y%, Yo]. 


Points 3 and 4 follow from points 1 and 2 by putting Y = g,X and Y; = ¢,.Xj, 
respectively. 


Corollary 3.1.6 Let (N,y) be a submanifold of M. Let X,Y € X(M) be tangent 
to N and let X,Y € X(N) be the restrictions to N, see Proposition 2.7.16. Then, 
[X, Y] is tangent to N and [X,Y] is its restriction. 


Proof By Proposition 2.7.16, xX and Y are g-related to X and Y, respectively. By 
Proposition 3.1.5/2, the vector field [xX , Y]on N is g-related to [X, Y]. This implies 
that [X, Y] is tangent to N and [X, Y] is its restriction. 


Corollary 3.1.6 reproduces the following direct consequence of Proposition 1.4.5: 


LX, Ylpu = [Xyu. Yu] forall X,Y e X(M), U CM open. (3.1.7) 


Exercises 

3.1.1 Let 2 be an associative K-algebra and let Der(2l) denote the K-vector space 
of derivations of 21. Show that 
(a) for Dy, Dz € Der(2l), the commutator [D;, D2] := D, 0 D2 — D2 0 Dy is 

in Der(2l), 

(b) Der(2l) with the commutator as a product is a Lie algebra. 

3.1.2 Prove Formula (3.1.1). 

3.1.3 Compute the commutator of X; = (x tye = - and Xz = (y*+ Ne +25 
on R?. 

3.1.4 Verify Formula (3.1.2). 

3.1.5 Use (2.3.2) to verify (3.1.3). 


3.2 Integral Curves and Flows 


Let M be a manifold and let X be a vector field on M. Consider a smooth curve 
y:I — M, where I C R is some open interval. According to Example 1.5.6, the 
tangent vector y(t) of y at t € J is defined to be the tangent vector of M at y(t) 
given by the curve sth y(t+5). 


Definition 3.2.1 (Integral curve) Let X € X(M). A smooth curve y on M is an 
integral curve of X if 


y(t)=Xyq) forallre Tl. (3.2.1) 
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Less formally, an integral curve of a vector field is a smooth curve whose tangent 
vectors are given by the values of this vector field along the curve. In terms of 
derivations, the defining condition is equivalent to 


d 
Xy(f) = v7 (foy) forall f eC (M). 


By an extension of an integral curve y of X we mean an integral curve y of X which 
extends y in the sense of mappings. 


Definition 3.2.2 (Maximality and completeness) An integral curve of a vector field 
on M is called maximal if there does not exist a proper extension. It is called com- 
plete if it has domain J = R. A vector field is called complete if its maximal integral 
curves are complete. 


We analyse Eq. (3.2.1) in a local chart (U,«) on M. According to (1.5.5), for 
t € I such that y(t) € U, (3.2.1) is equivalent to 


d 
rf ‘ (Koy)(s) = x): (3.2.2) 
Using the notation x(t) := « o y(t) and Xi(x) = seater one may rewrite (3.2.2) 
as 
PVH rED),. Laat (3.2.3) 


This is a system of ordinary first order differential equations with smooth coeffi- 
cients on the open subset «(U) of IR”. The fundamental existence and uniqueness 
theorems for the solutions of such systems state the following. 


(a) For every xo € x(U) there exists a unique solution Ix, 5 f +> x(t; x9) € K(U) 
which is maximal among all solutions defined on open intervals containing 0 
and satisfying the initial condition x(0; xg) = Xo. 

(b) The set J= sient Ix, X {Xo} is open in R x «(U). 

(c) The mapping J > x (U), (ft, X90)  X(f; Xo), is smooth. 


Theorem 3.2.3 (Existence and uniqueness of maximal integral curves) Let M be 
a manifold and let X € X(M). For every m € M there exists a unique maximal 
integral curve Vm: Im —> M of X with ym(0) =m. 


Proof Let m € M be given. According to the points (a)—(c) above, the fundamental 
existence and uniqueness theorems for solutions of (3.2.3) imply that 


1. integral curves through m exist, 
2. if two integral curves through m coincide at t, then they coincide on an open 
interval containing f. 


Uniqueness follows from 2. To prove existence, let Jj, be the union of the do- 
mains of all integral curves through m. Due to | and since the domain of any in- 
tegral curve through m is an open interval containing the common point 0, J, is a 
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nonempty open interval. Thus, in order that all the integral curves through m com- 
bine to a single maximal integral curve y,, with domain /,,,, it remains to show that 
any two of them, say y; : 1; ~ M and y2: In > M, coincide on their common 
domain 1; M Ip. Let J C 1 ty be the set of ¢ such that yj (t) = yo(t). Since J con- 
tains 0, it is nonempty. Due to 2, J is open in 1; N In. J is also closed: if a sequence 
{t,} in J converges to a point ¢ in J; M J» then, by continuity of y;, 


yw) = lim y(t) = lim y2(tn) = y2(t). 
n—->Oo fi OG 


Thus, J is a nonempty open and closed subset of the interval J) M Jy. This implies 
J =1, 1 If and hence the assertion. 


Corollary 3.2.4 Fort € In, ly, ={s € Ris +t € In}. Fors € I, (1), one has 


Yym (t) (5) = Vm (t + S). 


Proof Denote In, —t := {s € R:s +t € In}. The smooth curve y : I, —t > M, de- 
fined by y(s) := Ym (t +5), is an integral curve of X through y (0) = ym (t). Hence, 
In —t C I, (t). Interchanging the roles of m and y(t) and applying the same argu- 
ment with ¢ replaced by —t, one finds Jy, 7) +t C Im. It follows that I, —t = Jy,, (1) 
and that yy,, (1) (s) = v (8) = ym(t + 8) for all s € In — t. 


The maximal integral curves combine to what is called a flow.* 


Definition 3.2.5 (Flow) Let Z be an open neighbourhood of {0} x M in R x M and 
let 6: Y— M be asmooth mapping. For m € M and t € R, denote 


Dm = {t ER: (t,m) € Fj, J, :={meM:(t,m)€ D} 
and let By, : Dn — M and ®, : D; > M denote the induced mappings given by 
@,p(t) = &,(m) = P(t, m). 


The mapping @ is called a (smooth) flow on M if the following holds: 


1. ®-~o9 = idy, 

2. if (s,m) € F and (t,®;,(m)) € Y, then (s + t,m) € ZY and @,(,(m)) = 
P,44(m), 

3. Yn is connected and hence an open interval for all m € M, 


4. &(H) C Z_; for allt eR. 


A flow is called maximal if it does not admit an extension, in the sense of map- 
pings, which itself is a flow. It is called complete if 7 = R x M. 


4This name alludes to the intuitive picture of the points of M moving synchronously along their 
individual maximal integral curves. 
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Now, for a given vector field X, by analogy with the local system (3.2.3), we 
form the subset 


G:= U Im x {m} (3.2.4) 
meM 


of R x M and define the mapping 
@P:J->M, P(t,m) := Yn (t), (3.2.5) 


where Yn : Im — M is the maximal integral curve of X through m provided by 
Theorem 3.2.3. 


Proposition 3.2.6 (Vector fields and flows) The assignment of ® to X given by 
(3.2.4) and (3.2.5) defines a bijection between vector fields and maximal flows on M. 
Complete vector fields thereby correspond to complete flows. 


We will refer to ® given by (3.2.4) and (3.2.5) as the flow of the vector field X. 
If we have to deal with the flows of several vector fields we will occasionally write 
@X* and Y*. The notion of flow extends in an obvious way to local vector fields: by 
definition, the flow of the local vector field X over U C M is the flow of the vector 
field X on the manifold U. 


Proof First, let X ¢ X(M). We must show that @ is a flow. By Theorem 3.2.3, 
we have {0} x M Cc QJ. Recall the consequences (a)—(c) of the fundamental exis- 
tence and uniqueness theorems for solutions of the local system (3.2.3), listed be- 
fore Theorem 3.2.3. Due to (b), Y is open in R x M. Due to (c), ® is smooth. 
Properties 1, 3 and 4 of Definition 3.2.5 hold by construction and property 2 follows 
from Corollary 3.2.4. To check the maximality property, let @ be an open subset 
of R x M containing J and let 6 : Y—> M be a smooth mapping possessing the 
properties 1-4 and satisfying & 1g = ®. Let m € M. Due to property 3, the mapping 
Dm : Dm — M is acurve. Using property 2, for t € Dm one obtains 

d 


s dws d > 
dst, On(s) = a ro Ps Pr(em)) = ds ty D;(;(m)) = Xm)" 


This shows that Dm is an integral curve of X through m. It follows that Dn CDn 
for all m € M. This implies Y= J and b=. 

Conversely, let @ : Y— M beamaximal flow on M. Forme M, Pp, : Dm —>M 
is a smooth curve through m and hence defines a tangent vector 


d - 
Xmi= a oom (t). 


The assignment of X,, to m defines a section X : M — TM. Its local representative 
with respect to a chart (U, «) on M and the induced chart on TM is given by 


Xb (x iG ° &(k—!(x), ‘). 


Hence, it is smooth. Therefore, X is a smooth vector field on M. 
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Fig. 3.1 Domain J of the f 
flow of the vector field 
X(x)=x’d, on M=R 


Finally, we have to check that the above assignments between vector fields and 
flows are inverse to one another. Given a vector field X, the vector field assigned to 
the flow of X obviously coincides with X. Given a flow @ : Q->M,\et®:9> 
M be the flow of the vector field defined by &. Then, & and @ coincide on YN J. 
Since both @ and ® are maximal, it follows that Z = J and & = @. This proves 
the proposition. 


Example 3.2.7 


1. Let M =U CR" be an open subset and let X = 0;. The system (3.2.3) reads 
x = e; and the unique maximal solution with initial condition x(0) = x is given 
by x(t) =x-+ te;, where ¢ € R is such that x + se; € U for all s € [0, t]. Hence, 


= { (t,x) ERxU:x+se; €U foralls € [0, t}}, @, (x) =x-+ te;. 
2. Let M=R and X (x) = x7d,. The system (3.2.3) consists of the single equation 


x = x*. Integration with initial condition x (0) = x yields 


x(t) = 


1—tx 
This is defined for all t < 1/x. Hence, the flow is given by 


—tx! 


1 
={eneRxmir< ot, @,(x) = 
x 1 
The domain ZY is shown in Fig. 3.1. In particular, X is not complete. 


We use Example 3.2.7/2 to show that neither the sum nor the commutator of 
complete vector fields need be complete. Let M = R? with coordinates x and y. For 
fi. f2 € C~(R), the vector fields 


XX, y) = fi(y) dx, X2(x, y) = faQx)dy 
are complete and have the flows 
Ox y=(e+ifi.y) OP? y)=(% y+ th). 


For fi(s) = fa(s) = 58°, the sum X + X2 is tangent to the diagonal A C M defined 
by x = y. Hence, by Proposition 2.7.16, it induces a vector field X on A. Using the 
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global coordinate x on A, one finds X = x7d,, which was shown above to be not 
complete. Hence, X; + X2 is not complete. For fj (s) = s and f(s) =s7, we have 


[X1, Xo](x) = —2xydr + y7dy. 


This vector field is tangent to the y-axis. Using y as a global coordinate on this 
submanifold, for the induced vector field one finds again X = yay. Hence, [X1, X2] 
is not complete. 


Example 3.2.8 (Linear vector fields) Let V be a finite-dimensional real vector space. 
In terms of the natural representation of vector fields on V by smooth mappings 
X:V— V, cf. Remark 2.3.4/1, the system (3.2.1) reads 


b= X(v(). (3.2.6) 


A vector field on V is called linear if the corresponding mapping is linear, that is, 
X(v) = Av for some A € End(V). In this case, X is said to be generated by A. The 
solution of (3.2.6) with initial condition v(0) = vo is 
[o,@) 
tA n 
vit)=el4u, 4a > aes : 


n! 
n=0 


where (t A)” means n-fold composition of the endomorphism tA with itself.> Thus, 
X is complete and its flow @ is given by the one-parameter group of automorphisms 
of V 


®,=e4, teR. (3.2.7) 


Conversely, if ® is a one-parameter group of automorphisms of V, the generating 
vector field is given by Xy = i to @,(v) = (f yp PH (Y)> that is, it is linear and cor- 


io Pe of V. 


For later use, we determine the flow &,; = e!4 explicitly. The necessary calcula- 
tions are left to the reader (Exercise 3.2.2). Choose a basis in V such that the matrix 
associated with A has Jordan normal form with Jordan blocks B,,..., B,. Let 4; 
denote the eigenvalue of A with nonnegative imaginary part corresponding to B; 
and let m; denote its multiplicity. Let N; denote the (k x k)-matrix with entries 1 
on the upper secondary diagonal and 0 elsewhere. For 1 = a + i6 with a, real, 


define 
_|% —-B 
ma =|[ 4 ca 


responds to the endomorphism ¢ 


>The exponential series ae, A. is convergent for all A € End(V). This follows from the fact that 
the operator norm on End(V) induced by an arbitrary norm on V satisfies ||A”|| < || All”. 


®L.e., the dimension of the corresponding Jordan block of the extension of A to the complexification 
of V. 
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For a complex (k x k)-matrix C, let 94(C) be the real (2k x 2k)-matrix obtained by 
replacing each entry C;; by the (2 x 2)-block R(C;;). The Jordan normal form of 
A is given by 


Aim; + Nin; Ki ER, 
diag(B,,...,B-), B= ; ‘ = 3.2.8 
iag( 1 2) i | R(Aj1m, ah Nm;) hie d R. ( ) 
Then, e’4 is represented by the matrix diag(e’#', ..., e’8"), where 
pni-t 
1 t (m;—1)! 
aha tt! O %. (3.2.9) 
ee t 
(a) 1 
for A; € R and 
Daj t Dit aa Dgit 
ef Bi = ert @) Tey — : D,a- cos @ —sing 
: ; F , 2 sing cosd 
: . t Dgit 
0 vee 0 Dgit 


(3.2.10) 


for 4; ¢ R. For a detailed discussion of the linear vector fields on R2, see Exam- 
ple 3.6.13. 


Remark 3.2.9 
1. Successive application of the vector fields X;,..., X; toa function f € C™(M) 
yields 
_d d X xX, 
(X,0-++-0 X1(f))(m)=— ---— f(G,!0---0@"(m)). (3.2.11 
dti}o = dt jo 
In particular, for X,Y e X(M), 
d d d d 
Dee 5 =— — f(o(ox a a FiO (ae ; 
GW) a aig yd We WOE OO) ariel Os (Pe) 


(3.2.12) 


Furthermore, for X € X(M), f € C(M) and m € M, Taylor expansion of the 
smooth function tr f (px (m)) at t = 0 and computation of the derivatives 
using (3.2.11) yields the following Taylor formula for manifolds: 


nk 
F(®X am) = 0 (XA) im) + Ot"), FE Dp, (3.2.13) 


k=1 
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where O(t”*!) is a smooth function on ZX such that O(¢"*!)/1"*! is bounded. 
Repeated application of this formula yields the iterated Taylor formula for man- 
ifolds 


n 


k ol 
f(@*(@¥ (m))) = > —Y (X*(f))Gm) + O(2"*7, s* tt) (3.2.14) 
k,l=1 
which can be easily generalized to an arbitrary number of vector fields. 

2. Let (N,@) be a submanifold of M, let X be a vector field on M which is tan- 
gent to N and let X be the restriction to N, see Proposition 2.7.16. The flow 
®:Y-> N of X is related to the flow ® : J > M of X as follows. The domain 
9 consists of the pairs (t, p) in R x N satisfying (t, p(p)) € Z and ®s(y(p)) € 
v(N) for all s between 0 and t. The mapping @ fulfils (;(p)) = ;((p)). 


In the remainder of this section, we derive the basic properties of flows. 


Proposition 3.2.10 Let X be a vector field on M and let : Y— M be its flow. 


1. For every t € R such that Y, is nonempty, ®, is a diffeomorphism from Y, onto 
Y_, with inverse B_,. 
2. For every t € R and every s between 0 and t, one has J, C Y; and ®s(H) C 
Dis. 
. If ® is complete, then ®, o Bs = Di+5 forall t,s ER. 
4. Fors ER, let 6°* ; JX — M be the flow of the vector field sX. Then, for all 
teR, 


io) 


BX = De, @* — @y,. (3.2.15) 


Since every flow satisfies ®p) = idy, point 3 states that a complete flow defines an 
action of the additive group R on M.’ Complete flows are therefore also called one- 
parameter groups of local diffeomorphisms. Generalizing this terminology, flows 
are sometimes referred to as local one-parameter groups of diffeomorphisms. 


Proof We prove assertion 1. By the defining property 4 of flows, we have ®,(Y,) C 
Q_, and ®_;(Y_+) C Y,. Applying ®; to the second relation and using property 2, 
we obtain Z_; C &,(Y,). Thus, ;(Y,) = D_; and S_;(D_;) = J. By the prop- 
erties | and 2, for every m € Y,, we have ®_;(®,;(m)) = Gy(m) = m. The proof of 
assertions 2 and 4 is left to the reader (Exercise 3.2.3). Assertion 3 is obvious. 


Proposition 3.2.11 Let X be a vector field on M, let ®: Y— M be its flow and let 
meM. 


1. If Xm =0, then Jy, =R and ®,(t) =m for allt ER. 
2. If Xm #0 and ®,, is injective, (Gn, Bm) is a submanifold of M diffeomorphic 
to R. 


7Group actions on manifolds will be treated in detail in Chap. 6. 


3.2 Integral Curves and Flows 105 


3. If Xm #0 and ®,, is not injective, Jy», = R and the image of ®,, is an embedded 
submanifold of M diffeomorphic to S'. 


In particular, the images of the maximal integral curves of X carry natural sub- 
manifold structures. It will follow by a general argument in Section 3.5 that these 
submanifold structures are initial and hence unique, cf. Remark 3.5.16. Of course, 
this is relevant in the situation of point 2 only. 


Proof 1. 1f Xm =0, the curve y : R > M, y(t) :=™m, is a maximal integral curve 
of X through m. Hence, by uniqueness, J, = R and J, = y. 

2. and 3. If X, 40, then Xq,,(r) # 0 for all t € F,,, because otherwise ®,, would 
be constant by point |. It follows that ®,, is an immersion. Now, if ®,, is injective, it 
is a submanifold diffeomorphic to the open interval Y,, and hence to R. Otherwise, 
there exists a minimal positive T € Y, such that (7) =m. Then o,(m) = Bn, 
hence 2T € Y,y, and Yo,7(m) = Ym. Iterating this argument one finds Y,, = R. For 
every t € Randk €Z, 


Pm(t +kT) = Piper (m) = O;(m) = On (Ct). (3.2.16) 


Consider the topological quotient R/ZT of R by the equivalence relation t ~ s iff 
(t — s) =kT for some k € Z. By choosing representatives in the open intervals 
(0, T) or (-4, 5), respectively, one obtains two local charts which are smoothly 
compatible and hence define a smooth structure on R/ZT. With respect to this 
smooth structure, R/ZT is diffeomorphic to the sphere S!. Due to (3.2.16), ®n 
descends to an immersion Diy :R/ZT — M. Since T was chosen to be minimal, 
Pin is injective and hence defines a submanifold of M whose image coincides with 
the image of @,,. By Remark 1.6.13/2, this submanifold is embedded. This yields 
the assertion. 


Proposition 3.2.12 On a compact manifold, every vector field is complete. 


Proof Let 8: Y— M be the flow of a given vector field and let m € M. Assume 
that FY, = (a, b) with b < oo. Choose a sequence {t,} in J,» converging to b. By 
compactness of M, the sequence {y (f,)} has a cluster point m € M. Since F is an 
open neighbourhood of {0} x M in R x M, there exists ¢ > 0 and a neighbourhood 
U of m in M such that (—e, €) x U C Y. By construction of m, there exists tf € By, 
such that |b — t| < € and @,,(t) € U and hence (b — t, &;(m)) € B. Since also 
(t,m) € Y, property 2 of Definition 3.2.5 yields (b — ft) + t= b € Y,, (contradic- 
tion). Hence, b = oo and, analogously, a = —o. 


Proposition 3.2.13 (Flow and transport) Let M and N be manifolds. Let g € 
C°(M,N), X €X(M), Y € X(N) andt eR. 


1. If X and Y are ¢-related, then oD) C go and py OPrgX =PO px, 
2. If y is a diffeomorphism, then gex a o(DX) and 
4 X a 
oP = Qo ox ° (y ") 


we 
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3. Ifin addition M = N and 9,,.X = X, then g(DX) = ar and 


gos =O og ign. 


Proof 1. Let m € 9*. By assumption, for any s € 9X, 
ds p82 Pm (8) = OX ox (5) = YpowX(s)> 
hence @ o ox is an integral curve of Y with initial condition g o ox (0) = g(m). 
Therefore, 
go x (m) =o &* (s) = Dm) (8) = oY og(m), se 9X. 


Since t ¢ YX, this yields the assertion. 
1 


2. Apply point 1 to X and Y = g,.X. Then, replace X by g,.X and g by g~. 
3. This is an immediate consequence of point 2. 


Example 3.2.14 Let (U, «) be a local chart on M. We determine the flow of the local 
vector field 0; on U. Using the diffeomorphism x : U > «(U), Proposition 3.2.13/2 
and Example 3.2.7, we obtain 


QZ ={(t,m) €R x M:k(m) + se €«(U) for all s €[0, ¢]}, 
oii (m) =k! (k(m) + tej). 


That is, the maximal integral curve through m is given by the connected component 
of m of the i-th coordinate line through m, parameterized by the coordinate itself 
minus «'(m). 


Proposition 3.2.15 (Flows of commuting vector fields) Let M be a manifold and 
let X,Y € X(M). We have [X, Y]= 0 iff 


@* o &! (m) = 6Y 0 &X(m) (3.2.17) 


for all t = to,s = 59 € R and m € M such that both sides are defined for all t 
between 0 and to and s between 0 and so.° 


Proof First, assume that @* and ©” commute. For every m € M there exists 
€é > 0 such that both sides of (3.2.17) are defined for t,s € (—e, €). Thus, in view 
of (3.2.12), Formula (3.2.17) implies [X, Y](f) = 0 for all f € C°(M), hence 
[X, Y]m = 0. Conversely, assume that [X, Y] = 0 and let m € M and fo, so € R be 
such that both sides of (3.2.17) are defined for all t between O and fg and s be- 
tween 0 and sg. In the following, t and s are assumed to satisfy these inequalities 
without further notice. To prove (3.2.17) it suffices to show that, for all t, the curve 
s t+ &X ob" (m) is an integral curve of Y through the point & (m). This is equiv- 
alent to 


/ 
(@*) Yor m) = Vox o@¥ (m) (3.2.18) 


8One says that the flows of X and Y commute. 
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for all ¢ and s. Fix ¢ and s and denote p := px fo) BY (m). Equation (3.2.18) is 
equivalent to 


(O.1)'Yox (py =Yp (3.2.19) 


for all u between 0 and t. The left hand side of this equation is smooth in u, because 
its coordinates with respect to the chart on TM induced by a chart (U, «) on M are 
given by «!(p) and 


, d : 
(Pr) Yor p)(e) = ge Pu 2%: 0 P20): 


Since the right hand side is a partial derivative of a smooth function of the two 
variables u and s, it is smooth in uv. Hence, we may differentiate the left hand side 
of (3.2.19) with respect to u, and we can conclude that this equation holds if 


d d 
S (ex) = (S(O) Yex0n) =o 


where py := px (p). Since (OF), is a linear isomorphism from T,, M to T,M, 
the latter equation holds if 
d 
dv ro 
for all gq € M. Applying the left hand side to f € C°(M) and using that the evalu- 


ation of a tangent vector at g on f is a linear mapping from T, M to R, as well as 
the product rule (2.2.8) and Formula (3.2.12), one obtains 


d ; d ; 
(5; _((er) You.) = Gog l(Or) Fox (P) = X1a Pf). 


Since, by assumption, [X, Y] = 0, this yields (3.2.20) and hence proves the propo- 
sition. 


((@2)' Vox q) = (3.2.20) 


To formulate the last of the basic properties of flows to be discussed here, we 
need 


Definition 3.2.16 (Flow box chart) Let M be an n-dimensional manifold and let X 
be a vector field on M. A flow box chart for X is a local chart (U, «) on M such that 
X;}y = 0, and k(U) = (—a, a) x V for some a > 0 and some open neighbourhood 
V of the origin in R’~!. 


Proposition 3.2.17 (Straightening Lemma) Let M be a manifold of dimension n, 
let X € X(M) and let @ be its flow. 


1. A local chart (U, «) on M with k(U) = (—a,a) x V CR x R""! is a flow box 
chart for X iff for all s € (—a,a), y € V andt € R such that |s + t| <a we have 


(k oD, ox ')(s,y)=(s +t, y). (3.2.21) 


2. For everym € M such that Xm #0 there exists a flow box chart for X atm. 
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Proof 1. Let « be such that Eq. (3.2.21) holds. Differentiation at t = 0 yields X! = 1 
and X' = 0 for i =2,...,n, so that X}u = 01, indeed. Conversely, if « is such 
that X}y = 01, then the local system (3.2.3) associated with X by (U, x) is that of 
Example 3.2.14. This yields the assertion. 

2. Since Xj #0, there is a local frame {Xj,..., X,} in TM at m such that 
X, = Xyy. There exists a > 0 such that the mapping 


W :(—a,a)" > M, W(t) := Of! 0---0 OX") 


is well-defined. Since Wo,....0) coincides with the vector space isomorphism R” — 
TM induced by the basis {X1.m,..., Xn,m}, according to the Inverse Mapping 
Theorem, a can be chosen so that U := W((—a,a)") is open in M and W isa 
diffeomorphism onto U. Then, (U, W-!) is a local chart on M at m. Since X lu = 
X1;u, forall s € (—a, a), te (—a, a)"—! and t € R such that |f + 5| <a, we obtain 


xX} 
st 


Xn 
tn-1 


(Wo d, oW)(s,t) =! (GX 0 O72 0-0 OX" (m)) =(s +1, 0). 


Hence, by point 1, (U, yw!) is a flow box chart. 


Exercises 


3.2.1 Along the lines of Example 3.2.8, determine the flow of the linear vector field 
on R? given by the matrix 


Qo t 
A= 0 », p,D>O0. 
Find a mechanical system whose time evolution in phase space is given by 
this flow. 

3.2.2 Determine e4 for a real n x n matrix A of Jordan normal form, see Exam- 
ple 3.2.8. 

3.2.3 Complete the proof of Proposition 3.2.10. 

Hint. For proving assertion 2, use the defining property 2 of flows and the fact 
that Z,, is connected. 

3.2.4 Determine the flow of the vector field X = x'd; on R” and give a geometric 
interpretation. 

3.2.5 Consider the vector fields X, = —x20, + x!@) and Xz = x!d, + x74) on R?. 
Show that X; and X2 commute. Check that X, is invariant under the transport 
by the flow of X2 and vice versa. Relate X; and X> to polar coordinates, see 
Example 1.1.11. 

3.2.6 Determine the flows of the vector fields 


X1 =x7d3 — x7, Xo =x, —x'ds, X3 =x! —x7a 


on R? and find a geometric interpretation. 
3.2.7 Construct flow box charts for the vector fields of the previous exercises. 
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3.3. The Lie Derivative 


The Lie derivative with respect to a vector field X on M extends the directional 
derivative along the integral curves of X from functions to tensor fields T on M. 
The idea behind is to compare the value of T « I” (14M ) at a given point m with 
its values along the integral curve through m, transported back to m by the flow 
®:Y-— M of X and to pass to the differential quotient. More precisely, for every 
(t, m) € Y, one can apply the transport operator associated with the diffeomorphism 
P_;: G+ —> YF, to the tensor field 7}g_, induced on the open subset Z_, and 
evaluate the tensor field on J, so obtained at m. Thus, we obtain a mapping” 


Q>TiM, — (t,m)+> (G47) m. (3.3.1) 

This mapping is smooth: for the variable m, this is obvious. To show smoothness in 

t, due to (2.5.7) and (2.5.8), it suffices to consider the cases where T is a function 

or a vector field. The case of a function is obvious. The case of a vector field was 

explained in the proof of Proposition 3.2.15, cf. Eq. (3.2.19). Thus, we can define a 
mapping 

d 
LxT :M > TEM, (LyT)m = a (D_jxT)m- (3.3.2) 
0 
Since YT is given by the partial derivative!” of (3.3.1) with respect to the first vari- 


able, evaluated on the vector field (4, 0) and restricted to the submanifold {0} x M, 
it is smooth. Thus, “xT is a tensor field of the same type as T. 


Definition 3.3.1 The tensor field Zy T is called the Lie derivative of T with respect 
to X. 


According to (2.5.6), 


d 1\@ e ((®_+)')® To, (m) > Tn 
(LxT)in = dt (le ) To,(m)) = batt t , 


This justifies the intuitive picture given in the beginning. For further use, we note 
that 


d 
a ((®_1)aT),, = ((P-1)x(LxT)),,» G35) 


see Exercise 3.3.1. For functions and vector fields, the Lie derivative can be ex- 
pressed in terms of operations we know already. 


Proposition 3.3.2 For a vector field X on M, one has 


1. Lyf =X(f) forall f ¢C°(M), 
2. LY =[X,Y] forall ¥ ¢ X(M). 


°We suppress the restriction of T to Z_y. 
'0See Remark 2.2.10. 
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Proof 1. This is obvious. 

2. According to Proposition 3.1.1, it suffices to show that (YyY)(f) = 
[X,Y](f) for all smooth functions f on M. Using (2.5.6) and the fact that ap- 
plication of tangent vectors at m to f is a linear mapping from T,,M to R, we 
calculate 


d ray d d a 
(LxY)m(f) = di 1 ((P=1) Yoxim))(f)) = died (fo 4,0 Bf 0 O;*(m)). 
By the product rule (2.2.8) and by (3.2.12), the right hand side equals!! 


d d 
ds fo dt lo 


d d 
(PE (®E(m))) — Fae F (PE (BS) = LX, Ym) 


The next proposition collects the basic properties of the Lie derivative. 


Proposition 3.3.3 Let X be a vector field on M. 


. LY(T @S)=(LxT) @S+T ® (LxS) for all tensor fields T, S. 
. Ly(a A B)=(Lya)A B+ad (LxB) forall a, B € 2*(M). 

. LyoC=Co LX for every contraction operator C of tensor fields. 
. L&y, Ly] = Zx.yy forall Y €X(M). 

. Ox 0 Ly = Lo, x 0 py for every diffeomorphism yg: M > N. 


nA WN eR 


The first two assertions state that the Lie derivative establishes a derivation of the 
algebra of tensor fields and of the exterior algebra of differential forms, respectively. 


Proof 1. By (2.5.8) and the product rule (2.2.6), 


d 
(Lx(T @S)) = ann ((D14T)m ® (B-145)m) 


d 
-(4 @-nDn) @ Sn + Tin @ (= (©-105)n) 
dt fo 


= ((ZxT) @S+T @(&xS)),,. 


A similar calculation yields point 2. 

3. Since y and C are linear and local operations, that is, (ZyT)., and C(T) 
depend on the values of 7 in an arbitrarily small neighbourhood of m only, it suffices 
to prove the assertion for T =a] @---@ap @Y| ®---@ Yq with aj € 2'(M) and 
Y; € X(M), cf. Remark 2.5.2. Then, in view of point 1, it suffices to show that 
Lyla, Y) =(Lxa, Y) + (a, ZyY) for all Y € X(M) and a € Q!(M). Indeed, by 
(2.5.7) and the product rule, 


d d 
Lyla, Y)= a (D_1(a, Y)) = — (®_p,01, By V) = (Lea, Y) + (a, LZrY). 
to dt to 


'I'This is the same calculation as in the last step of the proof of Proposition 3.2.15. 
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4. Again, it suffices to prove the assertion for tensor fields of the form a; ® 
“Qa, @Y| @---@Yy with aj € 92'(M) and Y; € X(M). First, we show that 
if the assertion holds for T and S, then it holds for T @ S. Since “xy and Yy are 
derivations of the algebra of tensor fields and since the commutator of derivations is 
a derivation, we have 


[Lx, LUT @S) = (LLx, ZyIT) ®S+T ® ([-Lx, HIS). 


Since by assumption the assertion holds for T and S individually, and since x,y] 
is a derivation, the right hand side equals “%x,y}(T ® S), indeed. Thus, it suf- 
fices to prove the assertion for vector fields and 1-forms. For vector fields it 
follows from Proposition 3.3.2/2 and from the Jacobi identity for the commu- 
tator. To prove the assertion for 1-forms, we first observe that, due to Proposi- 
tion 3.3.2/1, it holds for functions. Hence, for a € 2!(M) and Z € X(M), we 
have [.2y, Yy](a, Z) = LYix,y}(a, Z). Using point 3, the right hand side can be 
rewritten as (Z[x,yja, Z) + (a, YZfx,y;Z) and the left hand side takes the form 
([Lx, Lyla, Z) + (a, (Ly, Ly]Z). Since the assertion holds for Z, it then follows 
for a. 

5. Using Proposition 3.2.13/2 and (2.5.6), for T ¢ '(T,M) and p € N one ob- 
tains 


d 


((-0T))) = 3 ((¢')°((@-)«T),-1¢p)): 


d 
(Zp,x(YsT)),, = 4 dt fo 


dt} 


Here, (y’)® stands for the linear mapping (T’) g-(p)M > (Tp N. Hence, the right 
hand side equals (g’)®(* jp P—t#T o-l(py) = (Gx LxT)) p- 


Exercises 
3.3.1 Use (2.5.6) to prove Formula (3.3.3). 


3.4 Time-Dependent Vector Fields 
Let M be a manifold and let J C R be an open interval. 


Definition 3.4.1 (Time-dependent vector field) A time-dependent vector field on M 
is a smooth mapping X: J x M — TM such that X(t,m) € T,,M for all (t,m) € 
J x M.A smooth curve y : J > M is an integral curve of X if J Cc J and 


y(t) =X(t,v@) for all t € J. (3.4.1) 


The properties of maximality and completeness of an integral curve and of com- 
pleteness of a time-dependent vector field are defined analogously as for ordinary 
vector fields. By definition, for every t € J, the induced mapping X; : M > TM, 
X;(m) := X(t, m), is a vector field on M. It is, therefore, common to denote time- 
dependent vector fields by X;, with t € J understood. Note that the parameter of an 
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integral curve coincides with the parameter of the family. This means that while a 
point moves along an integral curve, the vector field which this curve is tangent to 
changes synchronously. 

In the sequel, for simplicity we assume J = R. To determine the flow of X one 
decouples the parameters of the curve and of the vector field by passing to the ordi- 
nary vector field X on R x M defined by 


Xom :=((t,),X(@,m)), teR, meM, (3.4.2) 
where we have used the natural identification T(R x M) = (R x R) x TM. Let @: 


QZ — Rx M denote the flow of X and let prz :R x M > Rand pry :RxM—> M 
denote the natural projections. 


Proposition 3.4.2 For every m € M and to € R, there exists a unique maximal in- 
tegral curve y : I — M of X with initial condition y (to) = m. It is given by 


1=Qiymyt+to, — y(t) = pty 0PG—w) (to, m). (3.4.3) 


Proof For a smooth curve 7 : / > R x M, define p® : 7 +t) > Rand yp” :1 +1 
— M by 


pe) :=prpoy(t—to), «94 (t) := pry op (t — 10). 


The curve 7 is an integral curve of X through (to, m) iff 


yYoO=l YM O=x(FFO. 7") 


with initial conditions y® (to) = tg and y M (to) =m, respectively. The equations for 
y™ are solved by y®(t) =t for all t € J + to. The above system is then equivalent 
to 


yhO=X(.7"O), 7) =m. 
First, this shows that the curve y defined by (3.4.3) is an integral curve of X with the 
appropriate initial condition. Second, this shows that if y; : 1; — M is an integral 
curve of X with initial condition y; (f9) =m, the curve y; : 1; +t9 > Rx M defined 
by y(t) :-= (t + fo, v(t + fo)) is an integral curve of X through (to, 7). Therefore, 
y is maximal and unique. 


Like the maximal integral curves of an ordinary vector field combine to an ordi- 
nary flow, the maximal integral curves of a time-dependent vector field will combine 
to a time-dependent flow. For a set A, let A4 C A x A denote the diagonal. 


Definition 3.4.3 (Time-dependent flow) Let Y be an open neighbourhood of 
Ap x M in R x R x M and let 6: YJ > M be a smooth mapping. For m € M 
and ft), f2 € R, denote 


Ds, m = {t2 ER: (t,t,m)€ F\, Dry.t) ‘= {m EM: (to,t1,m)€ gp} 


and let ®;,4, : By, — M denote the induced mapping, given by ®;,+,(m) = 
P(t2,t1,m). The mapping @ is called a time-dependent flow on M if for all 
ti, f2,t3 € R and m € M the following holds: 
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1. Prt, = idy, 
2. if (t2,t,m) and (fs, fo, Pr 1, (m)) € J, then (t3, t1,m) € Z and 


Pi 1 (Pi, (m)) _ Pr, 4, (m), 


3. Yr .m is connected, 
4. Dp 1, (By,t.) C Dyn: 


A time-dependent flow is called maximal if it does not admit an extension, in the 
sense of mappings, which itself is a time-dependent flow. A time-dependent flow is 
called complete if 7=Rx Rx M. 


Now, for a time-dependent vector field X, define 
Q :={(t,t9,m) €Rx Rx M:(t—1, (o,m)) € F} (3.4.4) 
and 
DB:D>M, ®,)(m):= pry o Py—-1((to,m)). (3.4.5) 


Using Proposition 3.4.2 and the fact that @ is an ordinary flow, one can prove the 
following (Exercise 3.4.1). 


Proposition 3.4.4 The assignment of B® to X given by (3.4.4) and (3.4.5) defines a 
bijection between time-dependent vector fields on M and maximal time-dependent 
flows on M. Complete time-dependent vector fields thereby correspond to complete 
time-dependent flows. 


Remark 3.4.5 


1. Let ’: Y— M be a time-dependent flow on M. Then, for all t,t. € R such 
that 1, AS, we have ®,, 44 (Fy.1,.) = But and Ppt, : Dy, > Dry ry i8 a 
diffeomorphism with inverse ®;, 1, : F,,~. > Br,r,. Moreover, connectedness 
of Ym implies that for every t between ft; and f2 one has J, +, C Yr, and 
Pt .1,(Yr,t,) C Ao, The proof of these two statements is left to the reader (Ex- 
ercise 3.4.2). 

2. Time-dependent flows are in bijective correspondence with smooth 1-parameter 
families {®;} of local diffeomorphisms of M: given ®,, +,, one defines ®; := ®; 0. 
Then, ®,,.4, = Pp,0 0 Pot; = Pr, 0 P, and hence 


_ a4 
Pr ,t, = Pro ®,, : 


Conversely, given {®;}, this equation defines local diffeomorphisms ©,, ,, for all 
t; and f2. We leave it to the reader to show that this family forms a time-dependent 
flow on M. 

3. If a time-dependent flow ® is complete, all the mappings ®;, ;, are diffeomor- 
phisms of M and Definition 3.4.3 reduces to the requirement that ® is smooth 
and satisfies 


Dp, 1 fe) Pr 1, => Pi, 4, for all ,%2,8B6€ R, DP, 1 = idy for allt eR. 
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These are the defining properties of an action on M of the groupoid R x R 
with multiplication (a, b) - (b, d) = (a, d) and inverse (a, b)~! = (b, a). There- 
fore, a complete time-dependent flow may also be referred to as a two-parameter 
groupoid of diffeomorphisms. 

4. Ordinary flows embed into time-dependent flows as follows. If ®° is an ordinary 
flow on M with domain 9°, by defining 


={(b,t1,.m)eRxRxM:h-te FD}, Pr 1 = @e 


to—-t? 
we obtain a time-dependent flow on M. The latter has the property that Y,, ;, 
and ®;, ,, depend on the difference t2 — t; only. Conversely, if 6: J—> M isa 
time-dependent flow on M with this property, then 

= {(t,m)€ Rx M:(t,0,m) € J}, D? := P19 


yields an ordinary flow on M. The proof of both assertions is left to the reader 
(Exercise 3.4.3). 


To conclude this section, we briefly discuss the special case of periodically time- 
dependent vector fields. 


Definition 3.4.6 A time-dependent vector field X on M is said to be periodic if 
there exists a minimal T > 0, called the period, such that for all t € R and m € M 
one has 


X(t+T,m) = X(t,m). 


Proposition 3.4.7 The flow ® : Y— M of a periodically time-dependent vector 
field with period T satisfies 


1. Dry +7 ty +7 = Dry t and ay ey = Py 1 for all t,t€ R, 
2. Dero is nonempty and ®xr,.9 = Pi. 9 for allk €Z. 


Point 2 includes the statement that the domain of ok 9 coincides with Koro. 


Proof Let X be the vector field on R x M defined by (3.4.2) and let 6: J> Rx M 
be its flow. Then, FY and G are related by (3.4.4) and @ and @ are related by (3.4.5). 
1. Consider the diffeomorphism 
gr: RxM>RxM, or(t,m):=(t+T,m). 


A brief calculation shows that gr gi = X. According to Proposition 3.2.13/3, then 
QT (F,) = J, and G, 0 vr = ¢r o ®;. First, in view of (3.4.4), this implies Dr = 
r4-7.4,+7- Second, for every m € Y,, +,, (3.4.5) implies 


P47 1.47 (M) = pry © Prt o gr (t,m) = pry °¢rT o Dak (t1,m) = Py, 1, (m). 


2. We prove this by induction on k. For k = 1, the assertion is obvious. Thus, 
assume that it holds for k. According to Remark 3.4.5/1 and the first equality in 
assertion 1, 


Dk y(Dassyr0) = Pero ar yro) C Detyrar = Pro. 
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It follows that ®7 9 can be applied to PF. (DK+17.0); so that One is defined on 
Y+1)7,0- Conversely, if m € M belongs to the domain of oF. then it belongs 
to the domain of Pio and PF o(m) belongs to Ax+4iy7,n7- Since by the induction 
assumption, the domain of De coincides with Yr o, it follows that m € A«x+1)7,0- 
Finally, using the second equality in assertion 1, on Ax+1)7,0 we calculate 


Pr =@7 90 OF 5 = Desyr kr ° Pero = PK+1T7,0- 


Exercises 


3.4.1 Use Proposition 3.4.2 and the fact that PD isan ordinary flow to prove Propo- 
sition 3.4.4. 

3.4.2 Prove the assertions of Remark 3.4.5/1. 

3.4.3 Show that ordinary flows may be viewed as time-dependent flows, cf. Re- 
mark 3.4.5/4. 

3.4.4 Let M =R. Determine the maximal integral curves and the time-dependent 
flow of the time-dependent vector field X on M given by X (t,x) = (x, 74), 
te(-l1, 1). 
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Let M be a manifold. For a subset D of TM, let ?(M) denote the set of vector 
fields on M taking values in D. Correspondingly, let x? (M ) denote the set of local 
vector fields on M taking values in D. 


Definition 3.5.1 (Distribution) A distribution!* on M is a subset D of TM such 
that for all m € M the following holds. 


1. Dn := DOT,,M is a linear subspace of T,,M. 
2. For every Y € Dy, there exists X € X?(M) such that X», = Y. 


The function which assigns to m € M the dimension of D,, is called the rank 
of D. If the rank is constant, D is called regular. Otherwise, it is called singular. 


Concerning distributions and regularity we follow the terminology used e.g. in 
[181].!° Beware that it is quite common to include the constant rank requirement 
into the definition of distribution and to reserve the notion of regularity for addi- 
tional properties. Then, distributions which are not necessarily of constant rank are 
referred to as generalized distributions. 


To distinguish this notion from a distribution in the sense of analysis one should speak, more 
precisely, of a geometric distribution. However, it is common to omit the term geometric. 


'3 According to [181], our distributions should be referred to, more precisely, as smooth distri- 
butions. Since in this book we will meet only smooth distributions, we systematically omit the 
adjective smooth. 
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Remark 3.5.2 


1. According to Remark 2.3.2/4, condition 2 of Definition 3.5.1 is equivalent to the 
requirement that for every Y € D,, there exists X € cn 0 ) such that X,, = Y. 

2. Let me M andr = dim D,,. Condition 2 of Definition 3.5.1 implies that there 
exists a local r-frame {X1,..., X;} in TM at m taking values in D, such that 
Xim,---,Xr,m span Dm (choose a basis in D,, apply property 2 and restrict the 
domains of the vector fields so obtained appropriately). This shows that the rank 
is locally non-decreasing, that is, every m € M has a neighbourhood where the 
rank is greater than or equal to the rank at m. Moreover, if the rank is constant, 
then Xj m,..., X;,m also span Dy for allm € U. Thus, Proposition 2.7.5/2 yields 
that D is regular iff it is a vertical subbundle of TM. Hence, the definition of 
regular distribution given in Example 2.7.6 is equivalent to the one given here. 


Definition 3.5.3 (Integral manifold) Let D be a distribution on M. A connected 
submanifold (N,v) of M is called an integral manifold'* of D through m € M if 
me€w(N) and 


Wp(TpN) = Dyip) (3.5.1) 


for all p € N. D is said to be integrable if for every m € M there exists an integral 
manifold of D through m. 


Along an integral manifold, D has constant rank. If N is given by a subset, For- 
mula (3.5.1) reads 


TN = Dw. (3.5.2) 


Example 3.5.4 


1. Every vector field X on M generates a distribution D by Dy, := RX». The rank 
of D at m is 0 if X,, = 0 and | otherwise. According to Proposition 3.2.11, the 
images of the maximal integral curves of X are submanifolds of M. Obviously, 
they are integral manifolds of D. Thus, D is integrable. 

2. Every subset A C X(M) generates a distribution D, with D,, defined to be the 
linear span of the set {X,, : X € A}, and every distribution can be generated this 
way. A sufficient, but by far not necessary, condition for D to be regular of rank 
r is that A be an r-frame. 

3. Let E be a vector bundle over a manifold M and let ®: E > TM be a vertical 
vector bundle morphism. The image of @ is a distribution, because it is locally 
spanned by @ o 5;, where {5;} is a local frame in E. 

4. Let x: M — P beasubmersion. According to Example 2.7.7, ker z’ is a regular 
distribution on M of rank dim M — dim P. According to the Level Set Theorem 


4Tn part of the literature, for (NV, y) to be an integral manifold, only Vp (TpN) C Dy py is re- 
quired. What we call integral manifold here is referred to there as an integral manifold which is 
everywhere of maximal dimension. 
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for manifolds (Corollary 1.8.3), the connected components of the level sets of 
x are integral manifolds of ker’. Since they make up M, kerz’ is integrable. 
In fact, the connected components of the level sets of 2 are maximal integral 
manifolds, because the composition of a curve in an integral manifold with z is 
an integral curve of the zero vector field on P and is hence constant. 


. Let ¢ > 0 and 0 <r <n. Consider the open cube (—e, ¢)” C R”. The vec- 


tor fields 0),...,0, span a regular distribution D of rank r on this cube. 
This distribution is integrable, with the integral manifolds given by the subsets 
(—e,&)" X {(Xr41,---,Xn)}, where (%-41,.--,%n) € (—8, €)”" is fixed. It will 
be shown below that, locally, every integrable regular distribution of rank r on 
an n-dimensional manifold is of this form. 


. The distribution on R* spanned by the vector fields 4, and ydy is singular, be- 


cause it has rank 1 on the x-axis and rank 2 outside. Similarly, the distribution 
spanned by the vector fields 0, and xdy is singular, because it has rank 1 on the 
y-axis and rank 2 outside. The first of these distributions is integrable, with in- 
tegral manifolds being the x-axis and the two open half-planes. In contrast, the 
reader can easily see that the second one is not integrable. 


Next, we are going to derive criteria for integrability. We need the following 


notions. 


Definition 3.5.5 A distribution D on M is called 


1. 


involutive if ¥?(M) Cc X(M) is a Lie subalgebra, 


2. homogeneous!° if for all X € XP (M) and all (t, m) in the domain of &*, one 


has 


(0%) Dn = Dox (m): 


m 


Note that homogeneity means invariance of D under the flow of an arbitrary local 


vector field taking values in D. 


Remark 3.5.6 Each of the following two conditions is equivalent to involutivity 
(Exercise 3.5.1). 


Ie 


For all X,Y € a. (M) whose domains have nontrivial intersection, [X, Y] € 
xP (M). 


. For all mo € M, there exists an open neighbourhood U of mo in M and local 


vector fields X;,..., X, on U such that 
(a) Xim,.--,Xrm span D» for allm € U, 
(b) [X;, Xj] = ch Xk with smooth functions cf :UO SR. 


'S After Stefan [280]. 
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Example 3.5.7 


1. The distribution D generated by a vector field X on M is involutive: let Y, Z € 
X?(M). If dim D,, = 1, there exists an open neighbourhood V of m such that 
X}y does not have zeros. Hence, there exist unique f, g € C°(V) such that 
Yiv = fXyv and Ziv = gXyv. Then, (3.1.1) and (3.1.7) yield 


LY, Zn = [fX hv, gXivlm = (f (m)Xm(g) = g(m)Xm(f))Xm € Dn. 
If dim D,, = 0, then Y;, = Z;, = 0 and hence 


LY, Z]m(f) = Yn(Z(f)) _ Zm(Y(f)) =0 


for all f € C°(M), so that [Y, Z],, = 0. 

2. The distribution on R? spanned by the vector fields 8, + yd, and (x? + 1)dy is 
regular, but not involutive (Exercise 3.5.2). 

3. The (singular) distribution on R? spanned by the vector fields 3, and ydy is 
involutive, whereas the distribution spanned by 0, and x dy is not. 


Definition 3.5.8 (Adapted chart) Let D be a distribution on M, let m € M, and let 
r=dim Dy, n = dim M. A local chart (U, «) on M is said to be adapted to D at m 
if 


1. k(m) =0 and k(U) = (—¢, €)” for some é > 0, 
2. Dijxs-y 0 €XP_(M), 
3. for all e € (—e, ¢)"~", D has constant rank along Ue := «~!((—e, €)” x {e}). 


Remark 3.5.9 


1. The subsets Ue, ¢ € (—e, €)""", are embedded submanifolds of M. They are 
referred to as the slices of U. By condition 2, the rank of D at points of U 
is greater than or equal to the rank at the central point m, ie., r. If a slice Ug 
contains a point where D has the minimal rank r then, by counting dimensions, 
conditions 2 and 3 imply that U¢ is an integral manifold of D. In particular, Uo 
is an integral manifold of D through m. 

2. If D is regular, condition 3 is redundant. In this case, condition 2 means that 
0,,.--, 0, span D on U and that every slice U, is an integral manifold. Thus, in 
an adapted chart, the integral manifolds are given by the equations 


Xr+1] =C1,-+-5Xn =Cn—-r- 


3. In case D is generated by a vector field X and X,,, 4 0, every flow box chart for 
X at m is adapted to D at m. 


Theorem 3.5.10 (Stefan and Sussmann) Let M be a manifold of dimension n and 
let D be a distribution on M. The following statements are equivalent. 


1. D is integrable. 
2. D is involutive and has constant rank along integral curves of elements of 
XP_(M). 
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3. D is homogeneous. 
4. For every m € M, there exists a local chart adapted to D atm. 


Proof See [280, 283] for the original sources. Our proof follows [181, App. 3]. 

1 > 2: Let X, Y € X?(M). By integrability, every m € M is contained in some 
integral manifold (Nin, Wm) of D. Due to Corollary 3.1.6, since X and Y are tangent 
to (Nm, Wm), So is their commutator. It follows [X, Y]m € Dm for all m € M. Since 
the integral curves through m of the elements of ae (M) are contained in YW», (Nin) 
and since the rank of D is constant along integral manifolds, the second assertion 
follows, too. 

2=> 3: Let X € #2 (M) with domain U, let ®: Y— U be the flow of X and 
let (t, m) € Y. Assume that we are given open neighbourhoods of ®,,(s) for all 
s € [0, t]. Since the preimages under ®,, of these neighbourhoods provide an open 
covering of the compact interval [0, t], finitely many of them already cover the in- 
tegral curve of X from m to ®;(m). This shows that it suffices to prove the asser- 
tion for m being arbitrary but fixed, U being an arbitrarily small neighbourhood 
of m, and all t € Y,,. Let r = dim D,,. We choose U so that there exists a local 
r-frame {Y|,..., Y,} taking values in D and spanning D,,. Define smooth curves 
Zi: Bn > TmM by 


Zi(t) = (®_t%Yi)m, a) errr ct 


where by an abuse of notation we have omitted the restriction of Y; to _;. By 
construction, 


(Oj) 71 = Oem. t= ea h (3.5.3) 


In particular, the Z;(t) are linearly independent. It suffices to show that Z;(t) € Dy 
for all f € Dn, because then (3.5.3) implies (®;)’Dm C De, (m) and by the assump- 
tion on the rank of D there holds equality. According to Formula (3.3.3) and Propo- 
sition 3.3.2/2, 


Zi(t) = (®~14 LX) = (®-selX, ¥i]),, = (G1) ([X, Vile.) 3.5.4) 


m 
Since D is involutive, by Remark 3.5.6, [X, Yi]o,~m) € Do,(m) for all t € Bn. 
Since the rank of D along the integral curve t > ®;(m) is constant, (Y1)o,(m),---> 
(Y-)@,(m) Span D@,(m) for all t € An. Hence there exist unique smooth functions 
ry : Dn — R such that [X, Yi]o,cn) = I (t)(Yj)o,~n) (Summation convention). 
Then, (3.5.4) implies that the curves Z;(t) satisfy the ordinary differential equation 
Zi(t) =A} OZjO) 

so that, due to the initial condition Z;(0) € D»,, they stay in Dm, as asserted. 

3 = 4: The proof generalizes the construction of a flow box chart in the proof 
of Proposition 3.2.17. Let m € M and let r = dim D,,. Choose a local r-frame 
{X1,..., X,-} at m taking values in D. By Proposition 2.3.15, in a neighbourhood 


of m this local r-frame can be complemented to a local frame {X1,..., Xn} in TM. 
There exists ¢ > O such that the mapping 


W:(-6,8)">M,  W(t):= O81 0---0 4") 
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is defined. Since ¥% amounts to the natural vector space isomorphism R” —> T,,M 
induced by the basis {X1.m,..., Xn,m}, according to the Inverse Mapping Theorem, 
é can be chosen so that U := W((—e, €)”) is open in M and W is a diffeomorphism 
onto U. Then (U, xk := W~!) is a local chart on M at m. It remains to check that 
(U, «) satisfies conditions 1-3 of Definition 3.5.8. Condition 1 holds by construc- 
tion. For condition 2, let i € {1,..., 7} and m € U. Then, denoting ®; = pri 


Kki(m)’ We 


obtain 
0 (Ff) = (Gj 0---0 B!_ (Xj) G;0--.08,(m)) (Ff): 


By homogeneity, the vector on the right hand side belongs to D,,, and condition 2 
holds, indeed. Condition 3 follows from homogeneity and the fact that, by construc- 
tion, any element of the subset «—!((—e, €)” x {x}) can be joined to W(0,x) by a 
composition of integral curves of X1,..., X;. 

4 => 1: This follows from Remark 3.5.9/1. 


Remark 3.5.11 


1. The proofs of the implications 2 = 3 and 3 => 4 show that it suffices that the 
constant rank condition of point 2 or the homogeneity condition of point 3 hold 
for a neighbourhood of every point and a family of local vector fields spanning 
D over that neighbourhood. 

2. Consider the proof of the implication 3 = 4. From the formula for OF i it follows 
that 0; = Xj ;v, that is, (U, «) is a flow box chart for X. For the remaining local 
vector fields X;, i =2,...,7, this is not true in general. However, it is true in 
the special case where the X; commute, because then their flows commute, see 
Proposition 3.2.15. For later use, let us formulate this observation as follows. 
Let D» be spanned by the values at m of a local r-frame {X1,..., X;} in TM 
over V. If the X; commute, there exists a local chart (U,«) adapted to D at m 
such that U C V and X;;y = 90;,i=1,...,r. 


Corollary 3.5.12 (Frobenius Theorem) Let M be a manifold and let D be a regular 
distribution on M. If D is involutive, then for every m € M, there exists a local chart 
adapted to D at m. In particular, D is integrable iff it is involutive. 


Example 3.5.13 


1. The distributions discussed in points 1, 3 and 4 of Example 3.5.4 are involutive 
and homogeneous, because they are integrable. The distribution on R* spanned 
by the vector fields 0, + yd, and (x? + 1)dy is neither integrable nor homo- 
geneous, because it is not involutive, see Example 3.5.7/2. For the first of the 
distributions of Example 3.5.4/6, Theorem 3.5.10 yields homogeneity and con- 
firms integrability. On the other hand, the second one is neither homogeneous 
nor integrable. 
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2. In view of Remark 3.5.11/1, point 2 of Theorem 3.5.10 yields that a distribution 
D spanned by commuting vector fields X,,..., X; is always integrable. Indeed, 
since the flows commute, Proposition 3.2.13/2 yields 


Xi(;!(m)) = (#7) (Xilm)) 


for all i and j, hence D has constant rank along the integral curves of the X;. 


Next, we prove that integral manifolds of integrable distributions are initial and 
that there exist maximal integral manifolds. Without loss of generality, for con- 
venience we may assume any integral manifold (NV, @) to be given by its image 
g(N) C M, endowed with the topology and differentiable structure induced from N 
via y. 


Lemma 3.5.14 Let D be a distribution on M. 


1. Let Ny C M and Nz C M be integral manifolds of D. If Ni 1 Nz # @, then 
N11 N2 is open in N, and Nz and the smooth structures on N, ™ N2 induced 
from N, and N»2 coincide. There is a unique smooth structure on N, U Nz such 
that N, and N2 are open submanifolds. With respect to this structure, N, U N2 is 
an integral manifold of D. 

2. Let N CM be an integral manifold of D, let m € M such that dim D, = dim N 
and let (U, x) be a local chart adapted to D atm € M. If a slice Ue of U inter- 
sects N, then Ug is an integral manifold of D and N AQ Ug is open and closed'® 
in NU with respect to the relative topology induced from N. In particular, the 
number of slices of U which intersect N is at most countable. 


The number of slices of U intersected by N in assertion 2 may happen to be 
(countably) infinite, indeed: for M = S! x S! with angle coordinates ¢@; and ¢2 and 
D generated by the vector field X = 0g, + wdg, with o irrational, the intersection 
of any open subset with a maximal integral curve of X has infinitely many slices. 


Proof |. The assertion on Nj U N2 follows from that on Nj M N2, hence it suffices 
to prove the latter. The dimensions of Nj and N>2 coincide with the rank of D on 
N, M N2 and hence are equal. Denote this number by r. For m € Ni M No, choose a 


local r-frame {X1,..., X-} in TM over some neighbourhood U of m taking values 
in D. Since Xj; is tangent to N;, by restriction it induces a local vector field X ® 
on UN N;, j =1,...,r, i= 1,2. By the same argument as in the proof of the 


implication 3 = 4 of Theorem 3.5.10, there exists ¢ > 0 such that the mappings 


: : @ (i) 
WO: (—e,e +UNN, WOW = 2! 0---0 6%" (m) 


are diffeomorphisms onto open subsets V; of N;, i = 1,2. By construction, 
w(t) = w(t) as elements of M for all t € (—e, e)", cf. Remark 3.2.9/2. First, 


'6Tt is in fact a connected component of NM U. We do not need this though. 
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this implies Vj = V2 =: V and hence V C Ni; 1 No, where V is open in both Ny, 
and N2. Second, this implies that with respect to the differentiable structure induced 
from either N;, V is diffeomorphic to (—e, ¢)’. Since the construction works for 
every m € Ni 1 No, the assertion follows. 

2. Let r = dim N = dim D,,. If N intersects a slice Uc, then Ue contains a point 
where D has rank r. By Remark 3.5.9/1, then Ue is an integral manifold of D. 
Now, assertion | implies that N M U¢ is open in N and hence in NM U, where the 
topology of NM U is assumed to be induced from N. Since this holds for all slices 
which intersect N and since 


NNUe=(NN w\(Unn Us), 
ce 

the intersection N M U, is also closed in NOU. It follows that NM U, is a union 
of connected components of N MU. Since the slices are disjoint, this implies that 
the number of slices which intersect N is at most as large as the number of con- 
nected components of N MU. Since the topology of NM U is induced from the 
manifold N, it is second countable, hence the number of connected components is 
at most countable. 


Proposition 3.5.15 Integral manifolds of integrable distributions are initial sub- 
manifolds. 


Proof Let M be a manifold, let D be an integrable distribution and let N be an 
integral manifold of D. To show that N is initial, we must prove that, for every 
smooth manifold P and every smooth mapping yg : P > M such that p(P) CN, the 
restriction in range g!" : P — N is continuous. For that purpose, we will show that 
for every p € P and for every open neighbourhood V of g(p) in N, the preimage 
g!(V) is an open neighbourhood of p in P. 

Choose a local chart (U,«) adapted to D at p(p). Since the slice Uo is an in- 
tegral manifold through g(p) and since g(p) € N, Lemma 3.5.14/1 implies that 
Up MN is an open submanifold of N and of Up. Hence, without loss of general- 
ity we may restrict attention to open neighbourhoods V of g(p) in Up NN. Since 
Uo MN is open in Up and since Up is embedded, there exists an open subset V of 
U such that VM Uy MN = V. Now, let W be the connected component of p in 
yg 1(V). By construction, W is an open neighbourhood of p in P. It remains to 
show that W Cc gy '(V), that is, ge(W) C V. Since W is a connected subset of P 
and since ¢ is continuous as a mapping to M, y(W) is a connected subset of M 
which contains g(p) and which is contained in N M U. Hence, the image of g(W) 
under the mapping («’t!,...,«”) is a connected subset of R”~" which contains 0 
and which is contained in the image of N MU under this mapping. Since the lat- 
ter image labels the slices of U intersecting N, it is countable by Lemma 3.5.14/2. 
Since a nonempty countable connected subset of R”~" consists of a single point, it 
follows that x’+!(g(W)) =---=«"(y(W)) = 0 and hence g(W) C NN Up. Then, 
y(W) CV NUD AN =V. This completes the proof. 
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Remark 3.5.16 It follows, in particular, that the images of the maximal integral 
curves of a vector field on M/ are initial submanifolds of M, cf. the remark after 
Proposition 3.2.11. This can also be proved directly by means of the Straightening 
Lemma 3.2.17 and the above countability argument. 


Theorem 3.5.17 Let M be a manifold and let D be an integrable distribution on M. 
For every m € M there exists a unique integral manifold (Nm, Wm) of D through m 
which is maximal in the following sense. For every integral manifold (N, w) of D 
such that W(N) 0 Wn(Nin) # © there holds w(N) C Wn(Nm), and (N, yt Nm) is 
an open submanifold of Ny. 


Proof As mentioned above, for convenience, in the proof we assume all integral 
manifolds to be given by subsets. Moreover, we will repeatedly use the statements of 
Lemma 3.5.14, mostly without explicitly spelling that out. Let m € M be given and 
let r = dim D,,. Every integral manifold of D through m has dimension r. Define 
a subset N,, C M by taking the union over all integral manifolds of D through m. 
Equip N,, with the topology generated by the open subsets of the integral mani- 
folds through m. Then, the union over the maximal atlases of the integral manifolds 
through m defines an atlas on N,,. By Lemma 3.5.14/1, this atlas is smooth. By 
the same argument as in the case of a regular distribution, see e.g. [302], one can 
show that N,, is second countable. Then, N,, is a manifold of dimension r. By 
construction, the local representatives of the natural inclusion mapping Nj, — M 
are smooth, hence N,, is a smooth submanifold of M. Also by construction, every 
m € Ny» belongs to an integral manifold N through m and N is an open submanifold 
of Nin. Hence, Ty, Nin =T;,N = Dy, so that Nj», is an integral manifold through m. 
It has the maximality property stated in the theorem, because if some integral man- 
ifold N of D intersects N,,, then N U Nj, is an integral manifold through m, hence 
it is contained in N,,. There follows NM N,, = N and hence N is an open subman- 
ifold of Nj. To show uniqueness, let Nn be an integral manifold through m which 
has the above maximality property. Then, Nn = N, as sets and each of them is an 
open submanifold of the other one, hence they are equal as manifolds. 


The properties of the family of maximal integral manifolds of an integrable dis- 
tribution can be conveniently summarized in the notion of foliation. Our definition 
follows [280]. 


Definition 3.5.18 (Foliation) Let M be a manifold of dimension n. A foliation of 
M isa family VY of connected submanifolds of M, called the leaves of the foliation, 
such that 


1. the leaves are pairwise disjoint and yey N = M,!’ 
2. for every m € M there exists a local chart (U, «) of M satisfying 


'7 That is, WV is a partition of M. 


124 3 Vector Fields 


(a) k(m) =O and x(U) = (—é, €)” for some « > 0, 
(b) the leaves are invariant under the flows of the local vector fields 0),..., 0,, 
where r denotes the dimension of the leaf containing m. 


The function which assigns to m € M the dimension of its leaf is called the rank 
or the dimension of VW at m. V is called regular if this function is constant, that is, 
if all leaves have the same dimension. Otherwise, ./ is called singular. 


Concerning terminology, the remark made after Definition 3.5.1 applies accord- 
ingly to foliations. Local charts at m satisfying the two conditions of point 2 will be 
called adapted to .%. 


Remark 3.5.19 


1. If -Y is regular of dimension r, condition 2b is equivalent to the condition that 
for every leaf N, the image «(N MU) is a union of subsets (—e, €)’ x {ce}, where 
c € (—e, €)""". This union is necessarily at most countable. Moreover, in this 
case, if a leaf N is an embedded submanifold of M, it is closed in M. To see 
this, let {m,,} be a sequence in N converging to some m € M. Let N,,, denote the 
leaf of m. For every local chart (U, x) at m adapted to ./, U intersects N. Since 
N is embedded, (U, «) can be chosen so that k(N MU) = (—e, €)" x {ec} for 
some c € (—é, €)"”~". Since k(m,) > k(m) = 0, it follows that ¢ = 0 and hence 
meN. 

2. By condition 2b, the dimension of leaves is locally non-decreasing, that is, for 
every m € M there exists a neighbourhood U such that the dimension of any leaf 
intersecting U is equal to or greater than the dimension of the leaf containing m. 


Example 3.5.20 


1. The family of connected components of a manifold M is a regular foliation of 
dimension n. The family of all one-point subsets is the only regular foliation of 
dimension 0. 

2. Ifa: M — P isasmooth submersion with the property that the preimage 2~!(p) 
is connected for all p € P, the Constant Rank Theorem implies that the preim- 
ages form a regular foliation of dimension dim M — dim P. A foliation of this 
form is called simple. By Remark 1.5.16/4, for a given simple foliation of M, 
the manifold P is unique up to diffeomorphisms. It is referred to as the space 
of leaves of the foliation. Let us add that Theorem 3.5.10/4 says that every inte- 
grable regular foliation is locally simple. 

3. According to Proposition 3.2.17, the images of the integral curves of a vector 
field on M form a foliation. It is regular of dimension 1 if the vector field does 
not have zeros. 

4. The partition of R? consisting of the single points of the x-axis and the two open 
half-planes separated by the x-axis is a smooth foliation, whereas the partition 
consisting of this axis and the single points outside is not. 
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Proposition 3.5.21 The assignment of the family of maximal integral manifolds 
to an integrable distribution defines a bijection between smooth integrable distri- 
butions on M and smooth foliations on M. Regular distributions of dimension r 
thereby correspond to regular foliations of dimension r. 


Proof The family of maximal integral manifolds of an integrable distribution is a 
foliation: Theorem 3.5.17 implies that condition | of Definition 3.5.18 is satisfied. 
Theorem 3.5.10/4 and the invariance of maximal integral manifolds under the flows 
of the elements of x? (M ) imply that condition 2 is satisfied. Conversely, let 
be a smooth foliation. For m € M, let D,, be the tangent space at m of the leaf 
containing m. This defines a subset D C TM. By condition 2(b) of Definition 3.5.18, 
Dy is spanned by the values at m of elements of a. (M). Hence, D is a distribution. 
By construction, every leaf of VW is an integral manifold of D. First, this implies that 
D is integrable. Second, in view of Theorem 3.5.17, this implies that every leaf of 
YW is contained as an open subset in a maximal integral manifold N of D. Thus, NV 
is a union of leaves. Since the leaves are disjoint and since N is connected, N must 
coincide with a single leaf. This shows that ./ is the family of maximal integral 
manifolds of D, and the assignment is bijective, indeed. The assertion about regular 
distributions and foliations then follows by observing that for every m € M, the 
dimension of the maximal integral manifold of an integrable distribution through m 
is equal to the rank of this distribution at m. 


Exercises 


3.5.1 Use Remark 2.3.2/4 and Formula (3.1.1) to prove that the conditions given in 
Remark 3.5.6 are equivalent to involutivity. 

3.5.2 Show that the vector fields X = 4, + yd, and Y = (x7 + 1)dy generate a non- 
involutive regular distribution of rank 2 on R?. 

3.5.3 Show that the distribution on R? \ {0}, generated by the vector fields 


X| = x203 — x302, X27 = x30 — x1 03, X3 = x1 02 — x20], 


is regular of rank 2 and involutive. Find the maximal integral manifolds. How 
are they related to the action of the rotation group SO(3) on R3? 

3.5.4 Show that the distribution on the unit sphere S? C R*, generated by the vector 
fields 


X= (1 x4 xq) a1 X1xX202 — X1x303 +.x1(1 — x4) 04, 


Y = —x x20; + (l—x4— x3) a — x2x303 + x2(1 — x4) 04, 


is integrable. Find the maximal integral manifolds. Is this distribution regular? 
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In the last four sections of this chapter, we discuss qualitative aspects of the flows of 
vector fields. Usually, this is treated as a part of the theory of dynamical systems.!® 
Let M be a smooth manifold, let X be a vector field on M and let ® : Y— M be the 
flow of X. Recall from Proposition 3.2.11 that the image of a maximal integral curve 
of X is a submanifold of M which consists of a single point or is diffeomorphic 
either to S! or toR. 


Definition 3.6.1 (Critical integral curve) A maximal integral curve y of X is called 
critical if its image is compact, that is, if the image consists of a single point or if it 
is diffeomorphic to the sphere S!. In the first case, y is called an equilibrium of X. 
In the second case, it is called a periodic integral curve. 


In what follows, by an abuse of terminology and notation, by an integral curve y 
of X we will mean the mapping itself or its image in M, depending on the context. 

Let y be a periodic integral curve of X and let m € y. The minimal positive real 
number 7 € R satisfying ®r(m) = m is called the period of y and @r is called the 
period mapping. Due to @7 o @,(m) = D; o Sy (m), the period does not depend on 
the choice of m. Since a considerable part of the analysis of the flow near a peri- 
odic integral curve is reduced to the study of fixed points of local diffeomorphisms, 
parallely to periodic integral curves of flows we will discuss fixed points of local 
diffeomorphisms. 

A fundamental tool in the study of critical integral curves is the linearized flow. 
Given a critical integral curve y of X, we consider the normal bundle 


Ny = (TM) y,/Ty 


of the submanifold y of M. Recall that Ny is a real vector bundle over y of dimen- 
sion dim M — dimy. Since y is invariant under the flow @, the tangent mappings 
(@,)', t € R, induce vector bundle automorphisms 


(®;)'":Ny > Ny, (3.6.1) 


which project to the diffeomorphisms of y induced by ;. The family {(®,)’” : 
t € R} is a one-parameter group of diffeomorphisms and hence defines a complete 
flow on Ny. 


Definition 3.6.2 (Linearized flow) The flow on Ny defined by {(@,)’” : t € R} is 
called the linearization of ® along y. 


We are going to determine the linearization explicitly. First, consider the case of 
an equilibrium. Here, y = {m} and Ny = T,, M. The linearized flow (®;)’” is given 
by the one-parameter group of vector space automorphisms (@;)),,:TmM > TnM, 


‘84 (continuous time) dynamical system consists of a manifold M and a vector field on M. We do 
not use this terminology here. 
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t ER. Hence, according to Example 3.2.8, this flow is generated by the linear vector 
field on T,, M which corresponds to the vector space endomorphism 


d 
Hess (X):= — (®;)), (3.6.2) 
dt to 
of T,;,M and one has 
(G,),, =e) teER. (3.6.3) 


Definition 3.6.3 The mapping Hess,,(X) : T,;,M — T,,M is called the Hessian en- 
domorphism of X at m. 


Remark 3.6.4 


1. Let (U,«) be a local chart at m and let X' denote the coefficient functions of 
X with respect to the local frame of TM induced by x. A brief computation 
(Exercise 3.6.1) shows that with respect to the basis {0),m,..-,90n,m} Of TmM, 
the endomorphism Hess,, (X) is represented by the (n x n)-matrix 


a A(X! ox!) 
i Axi 
2. Hess(X) can be characterized by X alone, without using the flow, as follows. 

Let so be the zero section of TM. Its image sg(M) is an embedded submani- 


fold of TM, diffeomorphic to M. Consider the tangent mapping Xj, :TnM > 
Tso(m)(TM). Since so(M) and T,, M intersect transversally in so(m), one has 


[), 0 Hessm(X) 0 («/,) "J («(m)). (3.6.4) 


Tso(m) (TM) = Tso(m) 50 (M) + Tso(m) (TmM). 


By counting dimensions one sees that the sum is direct, hence it defines a projec- 
tion to Ts)(m)(TmM). Composing this projection with the natural identification 
Of Ts)(m)(TmM) with Ti,M, one obtains a linear mapping tm : Tsy¢m)(TM) > 
TmM. Using tm, one can express Hess,,(X) as 


Hess» (X) = Tp 0 X},. (3.6.5) 


The easiest way to prove this is to use (3.6.4). This is left to the reader (Exer- 
cise 3.6.2). 


Next, consider the case of a periodic integral curve. In contrast to equilibria, 
periodic integral curves come in a great variety of shapes. Hence, we cannot expect 
to obtain an explicit formula for the linearized flow along y like (3.6.3) for the case 
of an equilibrium. Rather, we will construct a normal form for the linearized flow 
along y. This requires the following notion. 


Definition 3.6.5 (Conjugacy) Let k > 0. 


1. Flows 6 : 9 — Mj, i =1, 2, as well as the corresponding vector fields, are 
said to be conjugate of class Cé if there exists a diffeomorphism h: M, > M2 of 
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class C* such that (idp xh)(9) = Y™ and, for all (t,m) « 9™, 
h(®? (m)) = & (hin). 


2. Local diffeomorphisms pg :U; > Mj,i = 1, 2, are said to be conjugate of class 
C* if there exists a diffeomorphism h: M, —> Mp of class C* such that Uz = 
h(U,) and 


hog) =9 ohpy,. 


Recall that a diffeomorphism of class C° is just a homeomorphism. In this case, 
one speaks of topological conjugacy. 

As a second input, we need the suspension of a vector space automorphism. Let 
V be a finite-dimensional real vector space, let a €¢ GL(V) and let T > 0. We de- 
fine an equivalence relation on the direct product V x R by (v1, ft) ~ (v2, fo) iff 
there exists k € Z such that (v2, to) = (ak vy, t; — kT). Let V“ denote the set of 
equivalence classes. By constructing local charts in an analogous way as for the 
Mobius strip in Example 1.1.12, one can equip V“ with a smooth structure (Ex- 
ercise 3.6.4). With respect to this structure, the natural projection V x R— V® is 
a local diffeomorphism. In particular, V“ has the same dimension as V x R, 1.e., 
dim V + 1. The flow on V x R induced by the standard vector field (0, ¢) is given 
by (t, (v, 5)) > (v, 5+1). Hence, it maps equivalence classes to equivalence classes 
and thus induces a flow +“ on V“. By definition, the projected flow is given by 


=F ([,s)])=[@s+0], ve, s,teR. (3.6.6) 


Definition 3.6.6 (Suspension) V“ is called the suspension with period T of V rel- 
ative to a. The flow 2% on V“ is called the suspension of a with period 7. The 
integral curve through [(0, 0)] is referred to as the central integral curve of 2”. 


The flow X” is complete, that is, it is a one-parameter group of diffeomorphisms 
of V“. It does not have fixed points. Due to 


Ler ([@, s)]) = [(v, s +kT)| = [(a‘v, s)], vEeV,sEeR, keZ, 


an integral curve of XY“ is periodic iff it passes through a point [(v, s)] satisfying 
a‘(v) = v for some k € Z \ {0}. In this case, the integral curve has period kT. In 
particular, the central integral curve is periodic with period 7. Let us add that V% can 
be viewed as a vector bundle over R/TZ = S! with typical fibre V and transition 
functions with respect to a covering of S! by two open intervals U;, U2 given by the 
constant mappings x +> idy for one of the connected components of U; M U2 and 
x +> a for the other one. 


Example 3.6.7 


1. For V =R and a being multiplication by —1, the suspension V“ with period 
T = 1 is diffeomorphic to the Mobius strip, cf. Examples 1.1.12 and 2.2.4. All 
integral curves of +“ are periodic. The central integral curve has period | and 
the other integral curves have period 2. 
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2. For V = C and a being multiplication by e'”, the suspension with period T = 1 
is diffeomorphic to C x S!. If x is rational, all integral curves are periodic, 
where those apart from the central one have the same period (a certain inte- 
ger). Otherwise, the central integral curve is the only one which is periodic. For 
r > 0, the submanifold {[(z, t)] : |z| = 7} is invariant under the suspended flow 
and the induced flow is conjugate to the rational or irrational torus flow, cf. Ex- 
ample 1.6.6/3. 


Now, we can construct the desired normal form for the linearized flow along a 
periodic integral curve y. For that purpose, we observe that the mapping 


Ph = (®7)n sNny > Nny (3.6.7) 


is a vector space automorphism. It is called the period automorphism of y at m. 


Proposition 3.6.8 Let y be a periodic integral curve of X with period T . For every 
m € y, the linearized flow is smoothly conjugate to the suspension with period T of 
the period automorphism Pj. 


Proof Denote a = P;, and define a mapping 
h:Nny x R= Ny, A((X], t) = (®,)/"[X]. 


For simplicity, we write v = [X]. Due to hiakv, t—kT)= hv, t) for all k € Z, the 
mapping / is constant on equivalence classes and hence induces a mapping 


h:(Nmy)* > Ny, A([(v, t)]) = (@,)/v. 
By construction, 
hoLf= (D,)'" oh. 


It remains to show that h is a diffeomorphism. Since the natural projection 
Nmy X R= (Nmy)* isa local diffeomorphism, it suffices to show that h is bijective 
and that hy, t) is a bijection for all (v, s) EN» y x R. Surjectivity of h follows from 
the fact that the integral curves of the linearized flow project to y and hence each 
of them passes through Nmy. To prove injectivity, let (v1, t1), (v2, f2) ENmy x R 
such that (v4, t1) = A(vo, t2). Then, v2 = (®;,-1,)/" v1. Since v; and Az are both in 
Nmy, there exists k € Z such that t) — t2 = kT. It follows that v2 = a‘ vy, and hence 
[(v2, t2)] = [(v1, t)]. Finally, since Nj, y x R and Ny have the same dimension, bi- 
jectivity of ae + follows from surjectivity. The latter follows by observing that for 
every t € R, the mapping v > h(v,t)isa vector space isomorphism of fibres of Ny, 


and that for every v € N,,y, the mapping tf > h(v, f) is acurve in Ny projecting to 
y and hence intersecting each fibre transversally. 


Remark 3.6.9 Differentiation of the obvious equality @7 = @, 0 Pr o B_,; at D;(m) 
yields 


¥ _ 'y ié 'y 
Leen = (®,)m 0 Py 0 (P_1) 6, (m): 


Hence, all period automorphisms are conjugate under the linearized flow. 
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As a result of the previous discussion, the linearized flow along a critical inte- 
gral curve y is completely determined by a certain characteristic linear mapping. 
In case y is an equilibrium, this is the Hessian endomorphism. Its eigenvalues are 
called the characteristic exponents of y. In case y is periodic this is the period au- 
tomorphism at a chosen point m € y. Its eigenvalues are called the characteristic 
multipliers of y. Extending this terminology, by the characteristic linear mapping 
associated with a fixed point m of a local diffeomorphism g we mean the tangent 
mapping 97,. Its eigenvalues are called the characteristic multipliers of m. Note that, 
being eigenvalues of linear mappings, characteristic exponents and multipliers come 
with geometric and algebraic multiplicities.!? 

Next, we recall the following terminology from linear algebra. An endomorphism 
of a finite-dimensional real vector space V is called non-degenerate if all eigenval- 
ues are nonzero. A non-degenerate endomorphism of V is called elliptic (hyper- 
bolic) if every (no) eigenvalue lies on the imaginary axis. An automorphism of V is 
called non-degenerate if all eigenvalues are distinct from 1. A non-degenerate au- 
tomorphism of V is called elliptic (hyperbolic) if every (no) eigenvalue lies on the 
unit circle. Via the characteristic linear mapping, this terminology carries over to 
critical integral curves of vector fields and to fixed points of local diffeomorphisms. 
Thereby, the characteristic linear mapping of an equilibrium has to be treated as an 
endomorphism, because it is genuinely infinitesimal, whereas the characteristic lin- 
ear mapping of a periodic integral curve or of a fixed point of a local diffeomorphism 
has to be treated as an automorphism, because it is genuinely a transformation. 


Definition 3.6.10 (Ellipticity and hyperbolicity) A critical integral curve of a vector 
field or a fixed point of a local diffeomorphism is called non-degenerate, elliptic 
or hyperbolic if the associated characteristic linear mapping is, respectively, non- 
degenerate, elliptic or hyperbolic. 


From the separate analysis of hyperbolic and elliptic critical integral curves, con- 
clusions on the behaviour of the flow near an arbitrary critical integral curve can be 
drawn. 


Remark 3.6.11 


1. Let y be a periodic integral curve of X. According to Remark 3.6.9, the char- 
acteristic multipliers of X at y do not depend on the choice of the point m € y. 
Moreover, due to 


d 
(Pr)i, Xm = dt to 


Xm is an eigenvector of the automorphism (@r)j,, of TM with eigenvalue 1. It 
follows that the characteristic multipliers of X at y are given by the eigenvalues 


d 
Pr (#;(m)) = dt} ®,(m) = Xm, 
0 


'°The geometric multiplicity of an eigenvalue is the number of linearly independent eigenvectors. 
The algebraic multiplicity is defined as the multiplicity of the corresponding root of the character- 
istic polynomial. 
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of (@r);,, with the multiplicity of the eigenvalue 1 of (®r)/,, reduced by one 
(which may make this eigenvalue disappear). In particular, y is non-degenerate 
iff the eigenvalue 1 of (®7)/,, has multiplicity one. 

2. As a consequence of the Inverse Mapping Theorem, non-degenerate equilibria 
of vector fields and non-degenerate fixed points of local diffeomorphisms are 
isolated (Exercise 3.6.5). Moreover, since X is continuous and periodic integral 
curves are compact, each of them possesses a neighbourhood which is free of 
equilibria. 

3. Let M be an open subset of a finite-dimensional real vector space V and let y be 
a periodic integral curve of a vector field X on M. According to Remark 2.3.4/1, 
under the natural identification of TM with M x V C V x V, X is represented 
by a smooth mapping X : M — V and the tangent mapping (@;)’ can be written 
in the form (@,)!,(v, u) = (@;(v), Ay (tu) with v,u € V and A,(t) € GL(V). 
For every v € y, this defines a smooth curve A, : R > GL(V). Differentiation 
of this curve with respect to t yields the ordinary first order differential equation 


Ay(t) = X"(@,(v)) Av) (3.6.8) 


with initial condition A,(0) = 1. The solution is known as the path-ordered ex- 
ponential?°-7! and is usually written in the form 


t 
A,(t) = Texp | X'(®,(v)) ds. 
0 


The linearized flow (@,)/” : Ny > Ny along y is thus represented by the family 
of mappings 


V/RX(v) > V/RX(@,(v)), vey, teR, 
given by 
(®,)Y (u+ RX(v)) = Ayu t+ RX(G(v)), we. 
Hence, the characteristic multipliers of y coincide with the eigenvalues of A,(T) 


with the multiplicity of the eigenvalue 1 reduced by 1 (so that this eigenvalue may 
disappear). 


Example 3.6.12 The vector field X = 2yd, + (4x — 4x3)d, on R? has the equilib- 
rium points m, = (—1,0), mz = (1,0) and m3 = (0, 0). According to (3.6.4), with 
respect to the standard chart, the Hessian endomorphism of X at mj; is represented 


by the matrix 
ax! 0 2 
dxi | 4—12x? Of" 


20For negative f, te X'(®;(v)) ds has to be replaced by — i X'(®,(v)) ds. 
21 Since (3.6.8) has T-periodic coefficients, by a theorem of Floquet, the solution A,(t) can be 


written as A,(t) = Ay o(t)e!B» with T-periodic smooth Ayo : R > GL(V) and constant By € 
End(V). 
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The corresponding eigenvalues are 14 = A24 = +41 and A34 = +2,/2. Hence, mM, 
and mz are elliptic, whereas m3 is hyperbolic. In particular, all equilibria are non- 
degenerate. Consider the function f(x, y) = y? — 2x? + x4. Due to X(f) =0, the 
level sets of f are invariant under the flow of X. A function with this property is 
called a first integral or a constant of motion of X. Since we are in two dimensions, 
it follows that the level set components of f coincide with the images of the integral 
curves. We leave it to the reader (Exercise 3.6.7) to visualize the flow of X using 
this observation. 


Example 3.6.13 (Linear vector fields) Let V be a finite-dimensional real vector 
space and let X be a linear vector field on V, defined by the endomorphism A. 
Then, 0 is an equilibrium point of X. Under the natural identification of ToV with 
V, the linearized flow on ToV coincides with the flow of X on V. According to 
Example 3.2.8, the latter is given by ®, = e'4. There follows Hesso(X) = A. Hence, 


— the characteristic exponents of X at 0 are given by the eigenvalues of A, 
— 0 is non-degenerate iff A is invertible; if so, it is the only equilibrium point, 
— 0 is hyperbolic or elliptic iff so is A. 


For illustration, we determine the flow in the neighbourhood of the equilibrium 
point 0 for the case dim V = 2. We choose a basis such that the matrix representing 
A has Jordan normal form and use the results of Example 3.2.8. Up to the choice of 
basis, one can distinguish the following types.77 


1. A has a zero eigenvalue. 
(a) A=0, e!4 =idy. Every point is an equilibrium point. 


0 1 1 ¢ 
w) A=) Hegel Al 


The x1-axis consists of equilibria. The other maximal integral curves are par- 
allel to the x2-axis. Their velocity is constant in time and given by the x2- 
coordinate. 


© a= eel a eo. 


The x2-axis consists of equilibria. The other maximal integral curves are par- 
allel to the x-axis. Their velocity grows exponentially in time. 


_ Xn O tA _ eM 0 
2. a=|4 fe ak elt A, MER, AFZOF w. 
The integral curves are given by the graphs of the functions 


x2 = f(x1)=+C|x|4", 1140, C>0, (3.6.9) 


and the positive and negative coordinate semiaxes. Three cases can be distin- 
guished. 


221n the literature, linear vector fields X,, X> defined by Aj, Az are said to be linearly equivalent if 
Az =aA,a7! for some a € GL(V). In this terminology, our choice of basis amounts to discussing 
the linear vector fields on R? up to linear equivalence. 
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(a) A> w>O0ordA < pw <0. The curves (3.6.9) are branches of parabolas. In 
the first case, all integral curves run outwards, hence 0 is called an unstable 
knot. In the second case, they run inwards, hence 0 is called a stable knot. 

(b) 4 = uw. The curves (3.6.9) are semi-infinite line segments. As in case 2(a), 
the running direction is outwards for A > O and inwards for A < 0. Corre- 
spondingly, 0 is called a degenerate unstable or stable knot. 

(c) A>0O> uw. The curves (3.6.9) are branches of hyperbolas. The running di- 
rections are shown in the figure below. The equilibrium point 0 is called a 
saddle here. 


\ Z A 2 


AS ZI S\ 


2(a) knot 2(b) degenerate knot 2(c) saddle 


1A Ly sa car} lt 
3 a=|f ie =e" lg pp reR A440. 


For 4 < 0 (case 3(a)), the maximal integral curves run inwards, whereas for A > 0 
(case 3(b)) they run outwards. Correspondingly, 0 is called a stable or unstable 
improper knot. 


OE S ’ f ra a ; 
ey ~ \ f f gO ee 
“a RY He XN f jf _ ye 
ae SNA < NC ( BO ee 

Se ye TSN ee _ 
sa INN 
a f \ oS a va J } se 
\ i Se SS ee Pa / i 
\ ~ ~ - oO 
Ke SK ge. ie f 
3(a) stable improper knot 3(b) unstable improper knot 


a cos(Bt) —sin(6r) 
4.A= E ae =e ee a jae ER, B>0. 
This is the case of complex eigenvalues A+ = a + if. If a < 0 (case 4(a)), 
the maximal integral curves are logarithmic spirals approaching 0. If a > 0 
(case 4(b)), they are spirals running away from 0. If w = 0 (case 4(c)), they form 
circles. Correspondingly, 0 is called a stable or an unstable spiral or a centre. 


Fen, Pepe, tg 
é ee . Paine. \ \ 
aN / EN \ | 
tL LUO A i (hays 34 
YAMA i (CUsmai 
VX SEY) { \ \QeZZ / yi 
Es geet sf Ff \ (ee P 
ey, ae, 
a ~ 


4(a) stable spiral A(b) unstable spiral A(c) centre 
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In the cases 2, 3, 4(a) and 4(b), the endomorphism A and hence the equilibrium 
{0} is hyperbolic. In the case 4(c), A and hence {0} is elliptic. Further critical integral 
curves besides {0} are present in the cases 1(a)—1(c), where they form additional 
equilibria, and 4(c), where they form periodic integral curves. The latter have the 
common period T = ea Due to e74 = 1 there holds (@7),, = (e7 4), = 11 for all 
v #0. Hence, the single characteristic multiplier is 1, so that all periodic integral 
curves are degenerate. 

Examples for types 2(a), 2(b), 4(a) and 4(c) are provided by the harmonic oscil- 
lator in one dimension with linear friction, see Exercise 3.6.9. 


Example 3.6.14 (The planar pendulum) Let ¢, 6 denote the coordinates induced on 
TS! =S! x R by the standard angle coordinate ¢ on S!. The vector fields dg and dg 
provide a global frame in T(TS!). The planar pendulum with length /, gravitational 
acceleration g, elongation @ and angular velocity 6 is modelled by the (nonlinear) 
vector field? 


X = Bog — wo sin ddp, w* = ce 


on TS!. The equilibrium points are given by ms = (0,0) and m, = (1, 0). Accord- 
ing to Remark 3.6.4/1, with respect to the basis {(0g)m;,(0g)m;} in Tm,M,i=s,u, 
the Hessian endomorphisms are represented by the matrices 


0 1 0 1 
Hess, (X) = f 2 | : Hess, (X) = E il . 


(22) 


The eigenvalues are tiw for ms and tw for m,, hence my is elliptic and m, is 
hyperbolic. For the linearized flow on Tin,M, the origin is a centre (type 4(c)), 
whereas for the linearized flow on T,,,,M, it is a saddle (type 2(c)).2* The flow 
of X is shown in Fig. 3.2(a), where the left and right boundaries of the picture must 
be glued. There are contractible and non-contractible periodic integral curves. The 
first ones correspond to motions where the pendulum swings forth and back. The 
latter ones correspond to motions where the pendulum rotates. Altogether they form 
three 1-parameter families, each of which is diffeomorphic to a cylinder S' x R.?° 
Finally, there are two maximal integral curves which are not critical, represented 
by the fat curves in the picture. They approach m, for t — too and correspond 
to the two motions where the pendulum asymptotically approaches the upward po- 
sition from the left or the right, respectively. Together with m,,, these two integral 
curves divide M into three domains each of which is filled with periodic integral 
curves. Since periodic integral curves of different domains cannot be deformed into 


3 This is an example of a Hamiltonian system with one degree of freedom. 

4We will see later that it is exactly these two types of equilibria that can occur in a Hamiltonian 
system with one degree of freedom. 

°5In Chap. 9 we will learn that periodic integral curves of Hamiltonian systems typically come in 
such “orbit cylinders”. 
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Fig. 3.2. The flows of (a) the planar pendulum and (b) the modified harmonic oscillator 


one another through periodic integral curves, the subset~° of M made up by the two 
non-critical integral curves and m, is called the separatrix. 


Example 3.6.15 (The modified harmonic oscillator) As a last example, consider the 
vector field 


2 2 
X= (v-+ox(1 =—2° 5 2) Jac (-c%x +oy(1 SS a 5) )a 
a) a) : 


on R?. This system is usually referred to as the modified harmonic oscillator. In the 
sequel we state results about the critical integral curves of this system and leave it 
to the reader to provide arguments or to carry out calculations. 

X possesses the single equilibrium {0}. The characteristic exponents are w(1+i), 
hence 0 is hyperbolic and the linearized flow about 0 is an unstable spiral (type 4(b)), 
see Example 3.6.13. Outside the equilibrium, i.e., on R* \ {0}, one has the global 
chart x = (r, @), defined by 


x=rcos@¢, y=ar sing. 


We use it to identify R? \ {0} with R, x S! and to move with our discussion to the 
latter. This way, the equations for the integral curves of X separate, 
#=or(1—r’), b=o, 


and for the flow we obtain 


Ko, ok '(r,6) = ((1+(r-7 - le) 2 pt at), teR, (3.6.10) 


see Fig. 3.2(b). In particular, X is complete. From (3.6.10) one reads off that X 
possesses the single periodic integral curve y, given in the coordinates r, ¢ by 


?6Tn fact, this subset is a figure eight submanifold, see Example 1.6.6/2. 
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tr y(t) = C1, of). It has period T = an The tangent mapping of the flow at the 
point (1, d) of y is 


= e 20! 0 
(ko Dp oK Deh 0 il 


In particular, the characteristic multiplier of y is e~*”, hence y is hyperbolic, too. 
The normal bundle Ny admits the global section (1, 6) +> [(0;)(1,¢)]. With respect 
to the corresponding global trivialization x : Ny > y x R, the linearized flow along 
y is given by 


x 0(D,)” ox! ((1, 9), v) = (1, o + ot), 270). (3.6.11) 


In particular, the period automorphism Po ) amounts to multiplication by e~4”. 


Identifying Nq¢)y with R via x, for the suspension of Ni1,g)y relative to Pi ¢) 
one obtains 


—4r 
(Nuov)’  =(RxR)/~, 


where (s1, 01) ~ (s2, v2) iff 5} — so = kT and vy = e*ky, for some k € Z. The 
mapping R x R— y x R defined by (s, v) & ((1, ws), e~4”*v) descends to a dif- 
feomorphism t : Nav) — y x R which satisfies 


ro Ee or (1,4), v) =((1,¢ +r), ev). 


Thus, x~! ot conjugates the linearized flow along y to the suspension of the period 
automorphism. In fact, it yields the diffeomorphism h constructed in the proof of 
Proposition 3.6.8. 


Exercises 

3.6.1 Prove Eq. (3.6.4). 

3.6.2 Use (3.6.4) to prove Formula (3.6.5). 

3.6.3 Let m be an equilibrium point of a vector field X on M and let (U,«) bea 
local chart at m. Convince yourself by an explicit calculation that the eigen- 
values of the matrix 0;(X ‘o«—!)(k(m)) are independent of the choice of x. 

3.6.4 Along the lines of Example 1.1.12, construct an atlas on the suspension V“ 
with period 7 of a finite-dimensional real vector space V relative to an au- 
tomorphism a € GL(V), see Definition 3.6.6. 

3.6.5 Show that non-degenerate equilibria of vector fields and non-degenerate 
fixed points of local diffeomorphisms are isolated, cf. Remark 3.6.11. 

3.6.6 Consider the Mobius strip as a real one-dimensional vector bundle over S! 
and let @ denote the angle coordinate on S!. Show that every maximal inte- 
gral curve of the vector field dg is periodic and determine the characteristic 
multiplier for each of them. 

3.6.7 Draw a picture of the flow of the vector field of Example 3.6.12. Compare the 
behaviour of the flow near the elliptic equilibria with that near the hyperbolic 
equilibrium. 

3.6.8 Prove Eq. (3.6.9). 
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3.6.9 The harmonic oscillator in one dimension with frequency @, linear friction 
coefficient a > 0, elongation x and velocity y is modelled in phase space by 
the linear vector field 


X = ydy — (wx + 2ay)ay (3.6.12) 


on R?. Determine the type of flow about the equilibrium point 0 according 
to Example 3.6.13 for the cases a = 0,0 <a <a, a =a anda > wo, and 
discuss how the integral curves change with increasing a. 

3.6.10 The index of a non-degenerate equilibrium {m} of a vector field X is de- 
fined to be the sign of the determinant of Hess,, (X). Determine the index of 
the equilibrium {0} for the vector field X = —x!@; (summation convention) 
on R”. 


3.7 The Poincaré Mapping 


Let M be asmooth manifold and let X be a vector field on M with flow ®@: J—> M. 
Let y be a periodic integral curve of X with period 7. As mentioned before, a 
large part of the analysis of the flow of X near y can be reduced to the study of a 
certain local diffeomorphism. This will be constructed now. The idea is that instead 
of continuously watching the flow along the integral curves near y it suffices to 
record where these integral curves hit a certain transversal submanifold. This leads 
to the notion of Poincaré mapping. 

Let mg € y and let Y be a submanifold of codimension 1 which is transversa 
to the integral curves of X and which satisfies AM y = {mo}. For dimensional 
reasons, transversality implies 


Xm¢TmF forallme J. (3.7.1) 


27 


The subset of points m € Y for which there exists t > 0 such that &,;(m) € FY will 
be called the returning subset of Y and will be denoted by Aret. Due to (3.7.1), 
integral curves of X intersect Y at isolated values of t. Thus, for every m € Aret, 
there exists a minimal flow parameter value t(m) > 0 such that ©;(,)(m) € A. The 
assignment m +> t(m) defines a function tT : Aree > R+, called the first return time 
function. The corresponding mapping 


O:Pme—> P, — O(m):= Gym) (m), O72) 


is called the first return mapping. Obviously, t (m9) = T and © (mo) = mo. Neither 
t nor © need be continuous: for example, view the Mobius strip as the suspension 
of the automorphism s +> —s of R, cf. Example 3.6.7/1. Let y be the central integral 
curve of the corresponding flow and let Y be given by the equivalence classes of 
pairs (x, 0) with b < x <a, where b < 0 and a > 0. Unless b = —a, the mappings 
t and © are not continuous. 


?7Recall that, in particular, is transversal to all integral curves it does not intersect. 
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Lemma 3.7.1 


1. Aret is an open neighbourhood of mo in Y. 
2. © is smooth iff t is smooth. In this case, for all Y € Tiny FY, 


O'(Y) =1t'(Y)Xmp + (@r)' (VY) = (Pr) (t'(Y)Xmg +). (3.7.3) 


Proof 1.Letm € Pyet. Due to (3.7.1), Xoim) # 0. According to Proposition 3.2.17, 
there exists a flow box chart (U,«) for X at O(m). Since # is transversal to the 
integral curves of X, k(U MN #) is transversal to the maximal integral curves of 
the local representative of X. Thus, by possibly shrinking U transversally to the 
direction of the flow we can achieve that «(U M #) intersects all of these integral 
curves. Then, pry, (®~!(U MN Y) \ ({0} x M)) = pry (®~!(U) \ ({0} x M)). Since 
pry is an open mapping, this is an open subset of M. Hence, intersection with Y 
yields an open subset of Y which contains m and which is contained in Pret. 

2. Obviously, if t is smooth then so is ©. Conversely, assume that © is smooth 
and let m € Aer be arbitrary. As before, there exists a flow box chart (U, «) for X 
at ©(m). By possibly shrinking U transversally to the direction of the flow we can 
achieve that for every m € O~'!(UN F) one has t(m) € G;, and Prim) (m) € U. 
Let pr, : R” — R denote the projection to the first coordinate (the one which corre- 
sponds to the flow parameter). Since O(m) = ®,(j)(m) and ®,(,)(m) are both in 
UN &, the difference t(m) — t(m) is given by 


t(m) — t(m) = pr; ok(O(M)) — pry ok (Prim) (M)). 


Since the right hand side is smooth in the variable m, t is smooth on the open neigh- 
bourhood @7!(U MN Y) of m in Are. Finally, (3.7.3) follows by a straightforward 
calculation using the product rule. 


Definition 3.7.2 (Poincaré mapping) Let y be a periodic integral curve and let 
mo € y. A Poincaré mapping for y at mo is a triple (Y, W, ©), where 


1. P is an embedded submanifold of M of codimension | which is transversal to 
every integral curve of X and which satisfies AN y = {mo}, 

2. W is an open neighbourhood of mo in Ye, and O : W — F is the restriction”® 
to W of the first return mapping associated with Y, 


such that © is a diffeomorphism onto its image. 


Theorem 3.7.3 (Existence) For every periodic integral curve y of a vector field and 
for every mo € y, there exists a Poincaré mapping at mo. 


Proof As before, denote the manifold by M, the vector field under consideration 
by X and let d: Y— M be the flow of X. Since y is periodic, Xm . 4 0, hence 
there exists f € C°(M) such that X,.(f) 40. Choose f such that f (mo) = 0. 


8Denoted by the same letter. 
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According to the Level Set Theorem, the subset N C f~!(0) of regular points is 
an embedded submanifold of M of codimension | containing mg. There exists an 
open neighbourhood ¥ of mo in N such that Xm(f) 40 for all m € Y, that is, 
is transversal to the integral curves of X. Consequently, (3.7.1) holds and the 
integral curves intersect Y in isolated points. Thus, by possibly shrinking Y we 
can achieve that YM y = {mo}. Due to the existence of flow box charts at mg, we 
may furthermore assume that there exists ¢; > 0 such that (—¢1,€;) x AC and 
®,(m) ¢ F for all m € F and 0 <|t| < 1. Since Gp». = R, there exists an open 
neighbourhood W of mo in Y and a > 0 such that (0, T + a) x W Cc Y. Consider 
the smooth function f o ® on (0, T +a) x W. We have 


d 
fo®(T,mo) =0, ag 
Hence, after possibly shrinking WY, the Implicit Mapping Theorem 1.5.10 yields 
0 < €2 <a and a smooth function tT: W — (T — €2, T + €2) such that for every 
méW andt € (T —&2, T +2) there holds f o &(t, m) = 0 iff t = Z(m). Define”? 


O:WoN, O(m) := Dz(m)(m). 


Since F is open in N and since © is continuous, by possibly further shrinking W 
we can achieve O(W) C Y. Then, © is asmooth mapping from W to FY satisfying 
(3.7.3). Since Xing ¢ Tmo Y and (Pr), is injective, the latter implies that 6}, 
is injective and hence bijective. According to the Inverse Mapping Theorem, by 
possibly shrinking YW once more we can achieve that © is a diffeomorphism onto 
an open subset of FY. 

Finally, we show that Y and Y can be shrunk so that tT becomes the first return 
time mapping t and © becomes the restriction to W of the first return mapping 
© associated with Y. Assume, on the contrary, that this is impossible. Then, ev- 
ery neighbourhood V of mo in # contains a point m such that ;(m) € V for 
some t € (0, T(m)). By our choice of Y and by the uniqueness property of 7, then 
é€<t<T—e, where ¢ = min{e1, €2}. Thus, we can find sequences {m,} in Y and 
{tr} in [e, T — e] such that my, — mo and ®;, (mn) > mo. On the other hand, the 
sequence {t,} has a cluster point fo in [e, T — e] and, by passing to a subsequence 
converging to fo, we obtain ®,, (11n) > ®,.(mo). Since Y is Hausdorff, the two 
limits coincide, that is, there holds ®;,,(mo) = mo (contradiction). This completes 
the proof. 


Remark 3.7.4 Let (Y,W, ©) be a Poincaré mapping for y at mo. One can show 
that, by possibly shrinking Y and WY, one may always assume that there exists 
é > 0 such that 


2°Note that 7 need not be the first return time mapping and © need not be the first return mapping 
associated with Y. 
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. (-e,T+8e)xWcY, 

. the first return time function t takes values in (T — ¢,T + 8), 

. if &:(m) € FY for some (t,m) € (—e, T + €) x W, thent =0 ort =t(m), 

. for any neighbourhood U of mo in YW, ®((—«, T + €) x U) is a neighbourhood 
of y in M. 


BRWN re 


The proof is left to the reader (Exercise 3.7.1). 
Next, we study the relation between Poincaré mappings at different points of y. 


Proposition 3.7.5 (Uniqueness) Any two Poincaré mappings (Yi, HW, ©;) for y at 
m; € y, i = 1,2, are equivalent in the Polos sense.° There exist open neigh- 
bourhoods Y; of mj in PY; and Wee 0; UD, ) of m; in WG as well as a smooth 


mapping i: PY, — R such that the assignment mt> Bim) (m) defines a diffeomor- 
phism 9 : Pi > P Satisfying pW) = = Wy and 


@2,09(m)=~oO\(m) forallme%. (3.7.4) 


Proof There exists a unique t € [0, 7) such that ®;, (m1) = m2. According to Re- 
mark 1.8.4, there exists f2 € C°(M) such that Ay = i (0). Use f2 and the initial 
solution (t,, m1) of the equation f) o @(t,m) =0 to construct the smooth function 
4: YA, > Rand the diffeomorphism 9¢ : Py > Prin exactly the same way as T 
and ©, respectively, in the proof of Theorem 3.7.3. The resulting open neighbour- 
hoods Y; of m; in Y; can be shrunk ina 1 compatible way such that there exists « > 0 
satisfying (—e, © x Py CGand © (P»)N Yo = @ for all 0 < |t| < ©, as well as 
|A(m) — t\| < & for all m € Y\. Then, choose Y so that m; « WH c oO; \(P,), 
(WM) = = Wy aaa |ti)(m) — T| < 5 5 for all m € Y. After these adjustments, (3.7.4) 
holds. Indeed, for any m € W,, beh @2 0 y(m) and gy o ©;(m) are elements of Py 
and ©2 0 g(m) = By o go Oi (m), where 


A =(m) — A(@1(m)) + 12(y(m)) — t1(m). 
Due to 
|A| < |A(m) — 2(O1(m))| + |rx(gGm)) — T| + |t10m) — T| <e, 
it follows A = 0 and hence (3.7.4). This proves the proposition. 


Finally, we examine the relation between Poincaré mappings and the period au- 
tomorphisms. 


Proposition 3.7.6 Let y be a periodic integral curve of X and let mo € y. Let 
(P,W, ©) be a Poincaré mapping for y at mo. By restriction, the natural projec- 
tion pr: TmM — Nmy induces an isomorphism x :TmY—> Nmy which satisfies 


x0 @! =Phox. (3.7.5) 


30Briefly, their germs at m; are conjugate in the sense of Definition 3.6.5. 
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In particular, the eigenvalues of ©}, coincide with the characteristic multipliers of 
Xaty. 


Proof For dimensional reasons, (3.7.1) implies that x is bijective and hence an iso- 
morphism. Since pr(Xj,.) = 0, (3.7.5) follows from (3.7.3). 


Example 3.7.7 


1. Consider the modified harmonic oscillator of Example 3.6.15. As a submanifold 
transversal to the integral curves of X we may choose, in the coordinates x and y, 
FP = {(x,0): x > 0}. Denote the points of Y by x. From (3.6.10) we read off 
the first return time function t and the first return mapping ©, 


t(a)=T = =, O(x) = (14 (x2 = Ne)? 


Obviously, © is a diffeomorphism of Y onto itself, hence, if we choose W = Y, 
then (Y, W, ©) is a Poincaré mapping for y. Let us add that the iterates of O 
are given by 


OF (x) = (14+ (x? - Le ark) 3, (3.7.6) 


which for large k is given asymptotically by O* (x) ~ 1 — 5 (x? =e **, 

2. Let V be a finite-dimensional real vector space, let a € GL(V) and let T € R,. 
Consider the flow on the suspension V“ of V relative to a with period T which is 
given by the suspension 47“ of a with period T. Let mo := [(0,0)] € V“% and let 
y be the central integral curve of “ (which passes through mg). Under the iden- 
tification of V with the embedded submanifold {[(v, 0)]: v € V} of V“, the triple 
(F,W,O)=(V, V, a) is a Poincaré mapping for y at mo (Exercise 3.7.4). 


Exercises 

3.7.1 Show that every Poincaré mapping can be shrunk so that it satisfies condi- 
tions 1-4 of Remark 3.7.4. 

3.7.2 Provide the details for the construction of the smooth function A and the dif- 
feomorphism ¢ in the proof of Proposition 3.7.5. 

3.7.3 Verify Formula (3.7.6) of Example 3.7.7. 

3.7.4 Prove that the mapping given in Example 3.7.7/2 is a Poincaré mapping for 
the central integral curve of the suspension of a vector space automorphism. 


3.8 Stability 


As already mentioned in the introduction, in the study of stability of a dynamical 
system one is concerned with the long-time behaviour of a flow, with emphasis 
both on returning properties and on attraction properties of integral curves. In this 
context a variety of concepts and a lot of subtle techniques exist. In our elementary 


142 3 Vector Fields 


introduction to this field, we limit our attention exclusively to orbital stability, which 
we merely call stability here. Only at the end of this section we will make some 
remarks on other concepts. 

We give parallel definitions of stability for flows and for local diffeomorphisms. 
First, let X be a vector field on M and let ® : J— M be its flow. A subset A of M 
is said to be invariant under ® if ®,(m) € A for all m € A and t € Y,,. Of course, 
critical integral curves provide important examples for invariant subsets. 


Definition 3.8.1 (Stability for flows) A ®-invariant subset A C M is called 


1. stable under ® if for every neighbourhood U of A in M there exists a neigh- 
bourhood V of A in M such that R, x V C Zand @;(m) € U forall m € V and 
te Ri, 

2. asymptotically stable under @ if it is stable under @ and if there exists a neigh- 
bourhood V of A in M with the following properties: 

(a) Ry x VCY, 

(b) for every m € V and every neighbourhood U of A there exists tg € R+ such 
that };(m) € U for all t > fo. 

In this case, V is called a basin of attraction?! for A under @. 


Now, let g be a local diffeomorphism of M. A subset A of M is said to be 
invariant under g if g(m) € A for all m € A which are in the domain of @. For a 
point m in the domain of gy and k € Z, we say that y*(m) is defined if, successively, 
g'(m), wees g*—l(m) is in the domain of ¢. 


Definition 3.8.2 (Stability for local diffeomorphisms) A g-invariant subset A C M 
is called 


1. stable under ¢ if for every neighbourhood U of A in M there exists a neighbour- 
hood V of A in M such that for allm € V andk € Zi, gk (m) is defined and lies 
in U, 

2. asymptotically stable under ¢ if it is stable under ¢ and if there exists a neigh- 
bourhood V of A in M with the following properties: 
(a) gk (m) is defined for all m € V andallk € Z,, 
(b) for every m € V and every neighbourhood U of A there exists kp € Z such 

that o<(m) € U for all k > ko. 

In this case, V is called a basin of attraction for A under ¢. 


Remark 3.8.3 


1. Stability and asymptotic stability are local properties. They do not change if the 
vector field or the local diffeomorphism is modified outside a neighbourhood 
of A. As a consequence, if A is compact, without loss of generality one may 
assume that X is complete. Similarly, in the case of a local diffeomorphism ¢, 


3! Beware that in the literature it is common to use this name for the largest such set. 
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if A is compact and contained in the domain of ¢, without loss of generality one 
may assume that ¢ is a global diffeomorphism (Exercise 3.8.1). 

2. If A is compact, hence in particular if A is a critical integral curve, the definition 
of (asymptotic) stability may be formulated in terms of a metric p compatible 
with the topology of M. The proof is left to the reader (Exercise 3.8.3). For 
BC M, define 


(x, B):= inf{ p(x, y): y € B}. 


A @-invariant subset A of M is 

(a) stable if for every ¢ > 0 there is 5 > 0 such that p(m, A) < 6 implies Ry C 
Qn and p(®;(m), A) <e« forall tf € Ry. 

(b) asymptotically stable if there exists 6 > 0 such that p(m, A) < 6 implies 
R+ C By and p(®;(m), A) > 0 for t > co. 


The primary aim of this section is to derive stability criteria for critical integral 
curves. This includes 


(a) stability criteria for periodic integral curves in terms of the period mapping ®r 
and a Poincaré mapping, 

(b) stability criteria for the linearized flow and the discussion of how stability under 
the linearized flow is related to stability under the flow itself, 

(c) a stability criterion in terms of a Lyapunov function. 


We start with deriving stability criteria for periodic integral curves in terms of 
the period mapping and a Poincaré mapping. For that purpose we need 


Lemma 3.8.4 Let A be a ®-invariant subset and let t > 0 such that [0,t] x A C &. 
For every neighbourhood U of A in M there exists a neighbourhood V of A in M 
such that [0,t] x V C and ®,(V) CU for all s € [0, t]. 


Proof Since Y is open in R x M, for every m € A, there exists an open neigh- 
bourhood Vo, of m in M such that {t} x Vom C Y. Then, Vo = Lye A Vom is an 
open neighbourhood of A in M satisfying {t} x Vo C Z and Proposition 3.2.10/2 
implies [0,t] x Vo C J. By restriction, ® induces a continuous mapping @ : 
(0, t] x Vo > M. Let m € A. Due to &([0,t] x {m}) CA CU, &71(U) is a 
neighbourhood of [0,f] x {m} in [0,t] x Vo. Since [0,7] is compact, the Tube 
Lemma of elementary topology yields a neighbourhood V,, of m in Vo such that 
[0,t] x Vin C (VU). Then, V,, is a neighbourhood of m in M and for s € [0, ft] 
there holds ®,(V,.) = P({s} x Vin) CU. Then, V :=U,,c4 Vn has the desired 
properties. 


me 


Proposition 3.8.5 (Stability of periodic integral curves) Let y be a periodic integral 
curve with period T. Let mo € y and let (Y,W, ©) be a Poincaré mapping for y 
at mo. The following statements are equivalent. 
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1. y is stable under the flow of X. 
2. y is stable under the local diffeomorphism ®r of M. 
3. mg is stable under the local diffeomorphism © of F. 


The equivalence remains valid if stable is replaced by asymptotically stable. 


Proof Since stability is a local concept, we may shrink Y and YW so that there 
exists 0 < e < T satisfying conditions 1-4 of Remark 3.7.4. For U c W, denote 
U* := @_¢,r+e)(U). Since for m € W and k € Z+ such that ©*(m) is defined 
there holds @!(m),..., 0*—'(m) € W, one can put 


T}(m) = t(m) + t(O(m)) +--+ 7(O*|(m)). 


Then, 0*(m) = Pr (m)(m) and t;(m) => k(T — €). 
1 = 3: Let y be stable under ®. Let U be a neighbourhood of mo in Y. Define 
Uo :=W NU and U, := Up N O~!(Up). Condition 3 of Remark 3.7.4 implies 


UiNFACUo. (3.8.1) 


According to condition 4 of this remark, Uj is a neighbourhood of y in M. Hence, 
by assumption, there exists a neighbourhood V; of y such that Ry x V; C @ and 
;(m) € U; for all m € V; and t € Ry. Define V:= WN V, and let m € V. We 
show by induction that for all k € Z,, @*(m) is defined and lies in Up. Due to 
Uo CU, this implies that 79 is stable under ©. For k = 1, the assertion holds due to 
V CU, and O(U,) C Up. Thus, assume that @*(m) is defined and lies in Up. Due 
to Up C W, then ot!) is defined, hence so is t,+. (mm). Since t%+1(m) > 0, we 
have OF+1 (m) = ®z,, (m)(m) € U?, hence O*+!(m) € Up by (3.8.1). 

3 = 2: Assume that mo is stable under ©. Let U be a neighbourhood of y in M. 
According to Lemma 3.8.4, U contains a neighbourhood Up of y in M such that 
®,(Ug) C U for all s € [0, T + €]. Then, Uj) = Up N A is a neighbourhood of 
mo in &. By assumption, there exists a neighbourhood V; of mo in # such that 
for all me Vj and/le Zi, oe! (m) is defined and lies in U,. By condition 4 of 
Remark 3.7.4, V := Vy is aneighbourhood of y in M. Since @! is defined on V, for 
all/ € Z,, there holds Ry x V; C and hence R+ x V C J, so that or is defined 
on V for all k € Z,. Let m € V and k € Z,. Write m = &,(m) with m € V; and 
t € (—e, T +e). There exist / € Z, ands € [0, T +e] suchthatkT +t=t(m)+s. 
Then, Pk (m) = @,(@!(m)) and hence ok (m) € U. Thus, y is stable under @r. 

2 => 1: Let y be stable under @7. Let U be a neighbourhood of y in M. Accord- 
ing to Lemma 3.8.4, there is a neighbourhood Uo of y in M such that ®,(Uo) C U 
for all s € [0, T]. By assumption, for Up there exists a neighbourhood V of y in M 
such that for allm € V andk € Z,, ok (m) is defined and lies in Ug. That ok is de- 
fined on V for all k € Zs implies, in particular, Ry x V C J. Letm € V andt € Ry. 
Write t= kT +s with k € Z, and s € [0,7]. Then, ®,(m) = b,(Dk(m)) EU. 
Hence, y is stable under @. 

The proof for asymptotic stability is analogous and, therefore, left to the 
reader. 


Next, we derive stability criteria for the linearized flow along a critical integral 
curve. 
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Definition 3.8.6 (Linear stability) 


1. A critical integral curve y of a vector field X is called linearly stable or lin- 
early asymptotically stable if the zero section*” of Ny is stable or asymptotically 
stable, respectively, under the linearized flow of X. 

2. A fixed point m of a local diffeomorphism of M is called linearly stable or 
linearly asymptotically stable if the origin of T,,M is stable or asymptotically 
stable, respectively, under j,. 


First, we study the stability of the origin of an abstract finite-dimensional real 
vector space V under the flow of a linear vector field and under a vector space 
automorphism. The key ingredient is an appropriate decomposition of the corre- 
sponding spectrum. For a linear mapping L on V, let specg (L), specg (L) and 
specy(L) denote, respectively, the subsets of eigenvalues with positive, negative 
and zero real part. Analogously, let spec} (L), spec; (L) and spec, (L) denote, re- 
spectively, the subsets of eigenvalues with absolute value larger, smaller and equal 
to 1. In addition, for i = 0, 1, let specé (L) denote the subset of spec;(L) of eigen- 
values whose geometric and algebraic multiplicities coincide** and let spec? (L) = 
spec; (L) \ spec4 (L). For i =0, 1 anda =+,-—,d,n, let E?(L) and E;(L) denote 
the subspace of V spanned by the algebraic eigenspaces** of the eigenvalues in 
speci (L) and spec; (L), respectively. Thus, one has the disjoint decompositions 


spec(L) = specy (L) U specg(L) U specy (L), (3.8.2) 
specg(L) = spec (L) U specp(L), ~ 
spec(L) = spec, (L) U spec, (L) U spec] (L), 

p pec, pec; Pec, (3.8.3) 


spec, (L) = spec4 (L) U spec} (L), 
and the corresponding direct sum decompositions 
V=ES(L)@ EL) GEL),  Eo(L) = EAL) PERL), (G.84) 
V = E; (L)@ E\(L) @ Ej (L), E\(L)=E{(L)@ EX(L), (3.8.5) 
where all subspaces in (3.8.4) and (3.8.5) are invariant under L. We need the fol- 
lowing two facts from Euclidean geometry. 
Lemma 3.8.7 Let V be a finite-dimensional real vector space. 


1. Let A € End(V) and let c,d € R such that c < Re(A) <d for all X € spec(A). 
There exists a scalar product (-,-) on V such that c(v, v) < (Av, v) <d(v, v) for 
all v € V \ {O}. 


32Recall that for an equilibrium y = {m}, the zero section of Ny reduces to the origin of T,,M. 
33This means that L is diagonalizable on the corresponding algebraic eigenspace. 


That is, Lpy, — Aidy, is nilpotent. 
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2. Let a € GL(V) and let c,d > 0 such that c < |X| <d for all . € spec(a). There 
exists a scalar product on V whose norm satisfies c|\v|| < ||av|| < d||v|| for all 


veV\ {0}. 


Proof 1. It suffices to show that each algebraic eigenspace of A admits a scalar 
product with the desired property and to take the orthogonal direct sum of these 
scalar products. Thus, we may assume that A has a single Jordan block with eigen- 
value 2. Choose a basis in V such that the corresponding matrix of A has Jordan 
normal form, cf. Example 3.2.8. Let (-, -)g denote the scalar product on V making 
this basis orthonormal. Let r = dim V in case d is real and r = dim otherwise. 
Choose 0 < ¢ < min{d — Re(A), Re(A) — c} and let T denote the ippet triangular 
(r x r)-matrix with entries Ty := ¢”—! for k <1. Then, 
(Tv, Tw)o AER, 
(U,W) i= 2 
eet AER 
defines a scalar product on V with the desired property (Exercise 3.8.4). Here, R(T) 
denotes the realification of complex matrices defined in Example 3.2.8. 

2. Let p(a) = max{|A| : A € spec(a)} denote the spectral radius of a. Due to 
pia ty! = min{|A| : 4 € spec(a)}, there exist co,do € R such that c < cg < 
play"! and p(a) < dy <d. Choose some scalar product (-,-)y on V and define 

[o,@) 
(v, Ww) = Sia “a, cha *w ot 3) (dy kaky, dy” Pies 
k=1 
To see that the series on the right hand side converge, choose c;, d} € R such that 
co <ci < pay" and p(a) < d, < dp. Use the Cauchy-Schwarz inequality and 
the fact that for the operator norm associated with the scalar product (-,-)gy one has 


. aR : é . . : : 
p(a—!) = limg_, 00 ||a~*||F. to estimate these series by geometric series with ratios 
dc = A and qq = as respectively. Finally, renaming summation indices one finds 


lav? =a > Ie “fp ed +B las ‘aku 


Replacing do by cg or co . do, one obtains a lower estimate or an upper estimate, 
respectively. Thus, ||av||? > co|lul|? > cllul|? and ||av||? < do|lull? < d|lv||*. 


Lemma 3.8.8 Let V be a finite-dimensional real vector space. 


1. For every A € End(V) there exists an adapted scalar product on Ey (A) whose 
norm satisfies, for all v € Eg (A) \ {0}, 


eH v], < lula, eR, 
lim |je’v||_, =00, lim |je'4v], =0. (3.8.6) 
t—-b0o = t—>=0o se 


2. For every a € GL(V) there exists an adapted scalar product on Ey (a) whose 
norm satisfies, for v € Ey (a) \ {0}, 


1 : k _ ‘ k — 
Ja" vf, <tols, lim atvli,=00, im Jatul,=0. 8.8.7 
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Proof 1. First, consider Ep (A). There is d < 0 such that Rea <d for all A € 
specy (A). According to Lemma 3.8.7/1, there is a scalar product (-,-) on Eq (A) 
such that (Av, v) <d|lv||? for all v 4 0. Then, 


J fle! vl! _ (Ae'4y, ev) 
2||e"v| lle" u| 


There follows |e" v|| < e“'||v|| for all t > 0 and |je’4v|| > e“||v|] for all t < 0. Due 
to d < 0, this yields (3.8.6) with the lower signs. The assertion for ES (A) follows 
by replacing A by —A. 

2. First, consider EF, (a). There is d < 1 such that |A| <d for all 4 € spec, (a). 
According to Lemma 3.8.7/2, there exists a scalar product on FE, (a) whose norm 
satisfies ||av|| < d|lv|| for all v 40. Then, ||a*v|| < d*|\v|) for all k € Z, and 
\|a*v|| > d*||v]| for all k € Z_. Since d < 1, this implies (3.8.7). The assertion for 


Ey (a) follows by replacing a by a7!. 


{ |el4v] = <dleMv|. 


Lemma 3.8.8 yields the following stability criterion for the origin under a linear 
flow or a vector space automorphism. 


Proposition 3.8.9 (Stability under linear mappings) Let V be a finite-dimensional 
real vector space. 


1. Let A € End(V). The origin of V is asymptotically stable under e!4 if and only if 
spec(A) = spec, (A). It is stable under efA iff spec(A) = specg (A) U spec4 (A). 

2. Let a € GL(V). The origin of V is asymptotically stable under a if and only if 
spec(a) = spec (a). It is stable under a iff spec(a) = spec, (a) U spec? (a). 


Proof 1. Choose scalar products on E5 (A) according to Lemma 3.8.8. Choose a 
basis in Eg(A) such that the restriction of A to Eo(A) has Jordan normal form and 
define the scalar product on Eo(A) such that this basis is orthonormal. Using (3.2.9) 
and (3.2.10) one can check that 


(a) e4 acts isometrically on ES (A), 
(b) if Ej (A) 40, there exists a nonzero v € V such that lle" ul] = JT 4+ #2 Iv]. 


These two observations and (3.8.6) imply the following. 


For all v € Eg (A), lim;+o0 l|e’4 || = 0. Hence, if V = Ep (A), 0 is asymptoti- 

cally stable. 

For all v € Eg (A) ® E4(A), |e’ v|] < || vl]. Hence, if V = Ey (A) ®@ E4(A), O is 

stable. 

— If V AE, (A), there is v € V with \je’4 v|| = |||]. Hence, 0 is not asymptotically 
stable. 

-IfVAE, (AG E4(A), there is v € V with lim,_,o9 ||e’4 || = o0. Hence, 0 is 

not stable. 


2. In the argument for point 1, replace the lower index 0 by 1,t ER, byk €Z,, 
eA by a* and the reference to (3.8.6) by that to (3.8.7). We have to check that (a) 
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and (b) remain valid under these replacements. For (a), this is obvious. For (b), it 
follows by observing that a* = diag(B*,..., Bk ), where B; are the Jordan blocks 
of a, and by calculating the k-th power of a Jordan block of complex dimension 
m; = 2. If the corresponding eigenvalue A; is real, this yields 


k k-1 k k—mj+1 
Ai kA; oe (nj) i 
Bk=| 
: Ss ae 
0 sue 0 rk 


Otherwise, one has to replace the numbers 0 and A; by the 2 x 2-matrices 0 and 
M(A;), respectively; see Example 3.2.8 for the notation. In either case, (b) holds 
true. 


Corollary 3.8.10 (Linear stability) 


1. An equilibrium of a vector field is linearly asymptotically stable if and only if all 
characteristic exponents have negative real part. It is linearly stable if and only 
if all characteristic exponents have nonpositive real part and for those with zero 
real part, the geometric and algebraic multiplicities coincide. 

2. A periodic integral curve of a vector field or a fixed point of a local diffeomor- 
phism is linearly asymptotically stable if and only if all characteristic multipli- 
ers lie in the open unit disk. It is linearly stable if and only if all characteristic 
multipliers lie in the closed unit disk and for those lying on the unit circle, the 
geometric and algebraic multiplicities coincide. 


Proof For equilibria of vector fields and fixed points of local diffeomorphisms, the 
assertion follows directly from Proposition 3.8.9. To prove it for a periodic inte- 
gral curve y of a vector field X, choose m € y and consider the suspension of 
Nmy relative to the period automorphism P,,. According to Proposition 3.6.8, the 
(asymptotic) stability of the zero section of Ny under the linearized flow is equiv- 
alent to that of the central integral curve of the suspension of P?,. According to 
Example 3.7.7/2 and Proposition 3.8.5, the latter is equivalent to the (asymptotic) 
stability of the origin of Nmy under P),. Hence, the assertion follows from Propo- 
sition 3.8.9/2 by setting V =N,,y anda = Py. 


Next, we discuss how linear stability is related to stability. A partial answer fol- 
lows from the Grobman-Hartman Theorem. We cite versions for flows and for local 
diffeomorphisms. 


Theorem 3.8.11 (Grobman-Hartman) 


1. Let X be a vector field on M and let m be an equilibrium point. There exist open 
neighbourhoods U of m in M and V of the origin in TM _ such that the flow 
of X}y is topologically conjugate to the restriction of the linearized flow (®;);,, 
toV. 
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2. Let g be a local diffeomorphism of M and letm € M be a hyperbolic fixed point 
of v. There exist open neighbourhoods U of m in M and V of the origin in T,M 
such that the restrictions of p to U and of y' to V are topologically conjugate. 


Proof See [236] or [248]. Detailed proofs of assertion | can also be found in [123, 
§IX.7] and [207, §3.6]. 


Let us add that under certain additional conditions on the characteristic exponents 
or multipliers, respectively, the conjugacy is of class C’ for some r. See [257, §5.8] 
for examples and further references. The Grobman-Hartman Theorem does not ex- 
tend beyond hyperbolic critical integral curves, see Example 3.8.14 for a counterex- 
ample. It has, however, a generalization to arbitrary critical integral curves, known 
as the principle of reduction to the centre manifold, see Remark 3.9.13 in the next 
section. 


Corollary 3.8.12 (Stability in the hyperbolic case) A hyperbolic critical integral 
curve of a vector field or a hyperbolic fixed point of a local diffeomorphism is stable 
if and only if it is linearly stable. If it is stable, it is asymptotically stable. 


Proof Since stability is a local property, for equilibria of vector fields and fixed 
points of local diffeomorphisms, the assertion follows immediately from Theo- 
rem 3.8.11. To prove it for a hyperbolic periodic integral curve y of a vector field X, 
let m € y and let us choose a Poincaré mapping (Y7, YW, ©) for y at m. Accord- 
ing to Proposition 3.8.5, the stability and asymptotic stability of y under the flow 
of X is equivalent to the stability and asymptotic stability, respectively, of m un- 
der ©. According to Proposition 3.7.6, the critical multipliers of y coincide with 
the eigenvalues of ©/,. In particular, m is a hyperbolic fixed point of © and the 
Grobman-Hartman Theorem 3.8.11 implies that m is stable or asymptotically stable 
under © iff so is the origin of T,, Y under ©/,. Then, linear stability follows from 
Proposition 3.8.9/2 and Corollary 3.8.10/2. 


Remark 3.8.13 


1. The Grobman-Hartman Theorem holds for periodic integral curves as well, pro- 
vided one generalizes the notion of conjugacy by allowing for a reparameteriza- 
tion of the flow parameter. 

2. Besides the study of the stability of hyperbolic critical integral curves, there are 
many other important applications of the Grobman-Hartman Theorem, so that it 
is actually not quite adequate to treat it in a section on stability. For example, 
combining Theorem 3.8.11/1 with the classification of linear vector fields gener- 
ated by a hyperbolic endomorphism up to topological conjugacy, one finds that 
in the vicinity of a hyperbolic equilibrium point m, the flow of a vector field 
X is topologically conjugate to the linear flow on R” generated by the matrix 

In,.n_ with nz = dim Ef (Hess (X)); see [21, §22]. Similarly, combining the 

Grobman-Hartman Theorem for periodic integral curves with Proposition 3.6.8 


150 3 Vector Fields 


and the classification of hyperbolic linear automorphisms up to topological con- 
jugacy, one finds that in the vicinity of a hyperbolic periodic integral curve y 
with period 7, up to a rescaling of the time parameter, the flow ® of a vector 
field is topologically conjugate to the suspension with period | of the diago- 
nal real (n — 1) x (nm — 1)-matrix with n, entries e and n_ entries e—!, where 
n4=dimE 1 (PT )in)s m € y, and where the first entry carries the sign of the 
determinant of (@r)/,,; see [148]. These results provide normal forms for the 
flow near a hyperbolic critical integral curve. 


As a consequence of Corollary 3.8.12, for a critical integral curve of a vector 
field, linear asymptotic stability implies asymptotic stability, because it implies both 
linear stability and hyperbolicity. However, neither does linear stability imply sta- 
bility, nor does asymptotic stability imply linear asymptotic stability. Indeed, while 
a critical integral curve which is linearly stable but not linearly asymptotically sta- 
ble (and hence not hyperbolic) may be stable, as in the case of the equilibrium and 
the periodic integral curves of the frictionless harmonic oscillator, it may as well be 
asymptotically stable or unstable, as is shown by the following example. 


Example 3.8.14 Let M = R? and 

X= (-y+af(x° +y°))dx +(e + yf (x? +9") dy, 
where f : R > R is a smooth function satisfying f(0) = f(1) = f’(1) =0 and 
(s — 1) f(s?) £0 for s £0, 1.*° The calculations necessary for the following are 
left to the reader (Exercise 3.8.5). 

There exists one equilibrium, the origin, and one periodic integral curve, given 
by y(t) = (cos(f), sin(t)). The origin has characteristic exponents +i (hence it is 
elliptic) and y has characteristic multiplier 1. In case (s — 1) f (s*) <0 for all 
s #0, 1, the origin is unstable and y is asymptotically stable. Vice versa, in case 
(s — 1) f(s?) > 0 for all s £0, 1, the origin is asymptotically stable and y is un- 
stable. Let us add that in either case, according to the characteristic exponents, the 
linearized flow on ToM is that of a centre (type 4(c) of Example 3.6.13) and hence 
consists of periodic integral curves. Therefore, it cannot be topologically conjugate 
to the flow itself. 


This completes the discussion of linear stability criteria for critical integral 
curves. Since these criteria are formulated in terms of characteristic exponents and 
multipliers, that is, in terms of the spectrum of the associated characteristic linear 
mapping, they are often referred to as spectral stability criteria. In contrast to that, in 
the rest of this section, we discuss a functional stability criterion, that is, a criterion 
which relates stability to the existence of certain functions. Let X be a vector field 
on M, let 8: Y= M be its flow and let y be a critical integral curve of X. 


Definition 3.8.15 (Lyapunov function) A Lyapunov function for y consists of an 
open neighbourhood W of y and a continuous function f: W — R such that 


35To compare with Example 3.6.15, replace x by —x and set w = | there. 
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1. RixWcY, 
2. f(m)=0 forallm ey and f(m) >0forallmeW\y, 
3. f(&:(m)) < f(m) for all m € W \ y and all t € Ry. 


If the inequality in condition 3 holds strictly, (W, f) is called a strong Lyapunov 
function. 


If f is differentiable, for condition 3 to hold it suffices that X,,(f) <0 for a 
Lyapunov function and that Xj, (f) <0 for a strong Lyapunov function. 


Theorem 3.8.16 (Lyapunov) Let y be a critical integral curve of a vector field. 
If there exists a Lyapunov function for y, then y is stable. If there exists a strong 
Lyapunov function for y, then y is asymptotically stable. 


In the case of an asymptotically stable critical integral curve, the converse holds, 
too: if y is asymptotically stable, then there exists a strong Lyapunov function, see 
[44, Thm. V.2.2]. 


Proof Let f:W — R be a Lyapunov function. Let U be a neighbourhood of y. 
Since y is compact, it possesses an open neighbourhood Uo whose closure Uo is 
compact and satisfies Up C WU. Let 


B:=min{ f(m):m € Uo \ Uo}, V := {me Uo: f(m) < B}. 


Since Up \ Uo is compact and since f does not vanish there, 6 is nonzero. There- 
fore, y C V, so that, by continuity of f, V is an open neighbourhood of y. Since 
V CW, by point | of Definition 3.8.15, Ry x V C J. Letm e€ V andt € Ry. In 
order to show that ®;(m) € U, it suffices to show that ®;(m) € V. Assume, on the 
contrary, that ®,(m) ¢ V. Then, ®;(m) ¢ Uo, because f(®;(m)) < B by point 3 
of the definition. Let fo = inf{s € R+ : ®s(m) ¢ Up}. Since Uo is open, there holds 
to > O and ®,)(m) € Uo \ Uo. Then, f(®,,(m)) = B > f(m), in contradiction to 
point 3. Hence, &;(m) € V and y is stable. 

Next, assume that f is a strong Lyapunov function. As was just shown, then y is 
stable. Construct Uo, B and V as above, choosing U = W. We show that V is a basin 
of attraction for y. As noted above, Ri x V C Y. Let m € V. Since ®;(m) € V 
for all t € Ry and since V is compact, the sequence {®,(m)}, n = 1,2,..., has 
a cluster point mo in V. Hence, there exists a strictly increasing sequence {nx}, 
k=1,2,..., in N such that ®,,(m) — mo for k > oo. By continuity of f, then 
Ff (Pn, (m)) > f (mo). Moreover, for t € (0, 1), ®4n, (m0) > ®; (mo) and, accord- 
ingly, f(®r4n,(m)) > f(@;(mo)). Since ng < t + ny <ng4 for all k, property 3 
implies 

im f(Pn,(m)) = lim f(Pr4n(m)) = lim f (Pn, (m)) 
and hence f (mo) = f(®:(mo)). By the strict inequality in property 3, this im- 


plies mo € y. Thus, for any neighbourhood U of y, there exists fo € R+ such that 
@,(m) € U for all t > fo. 
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Further functional criteria for stability can be found in [44, Ch. V]. In a similar 
way, functional criteria for instability can be formulated. Functions whose existence 
implies instability are usually referred to as Chetaev functions. 


Example 3.8.17 


1. Consider the harmonic oscillator in one dimension with frequency @, linear fric- 
tion coefficient a > 0, elongation x and velocity y, modelled by the vector field 
(3.6.12) on R?, see Exercise 3.6.9. Let 


fy) i= 50" + wx’), 


This corresponds to the mechanical energy per unit mass of the oscillator. There 
holds f (0,0) =0 and f(x, y) > 0 for all (x, y) 4 (0, 0). Moreover, 


Xx, (f) = —2ay”. 


Thus, in case a = 0, f is constant along the integral curves of X and hence a 
Lyapunov function. In case a > 0, f is strictly decreasing and hence a strong 
Lyapunov function. Hence, the origin is stable for aw = 0 and asymptotically sta- 
ble for a > 0. 

2. Consider the flow given by the suspension with period T of an automorphism 
a of a finite-dimensional real vector space V. Choose a scalar product on V 
adapted to a as in the proof of Proposition 3.8.9 and let || - || denote the corre- 
sponding norm. Define a function f : V x R— R by 

fod := (T= (rE) Ja @| +(e Ea) fa @], 
where [t] denotes the largest integer multiple of T which is smaller than or equal 
to t. One can show that f descends to a Lyapunov function for the central integral 
curve y of the suspension if spec(a) = spec, (a) U spec4 (a) and to a strong Lya- 
punov function if spec(a) = spec; (a) (Exercise 3.8.6). Then, Theorem 3.8.16 
implies that y is stable in the first case and asymptotically stable in the second 


case. This result was used in the proof of Corollary 3.8.10/2, where it was ob- 
tained from Proposition 3.8.5 and Example 3.7.7/2. 


Remark 3.8.18 Let us discuss the relation between the concept of orbital stability 
presented here and the concept of pointwise stability which is usually referred to as 
Lyapunov stability. Let X be a vector field on M with flow ®: YJ — M and let p 
be a metric on M compatible with the topology. A point mo € M is called 


1. Lyapunov stable under @ if for every ¢ > 0 there exists 5 > 0 such that, for all 
mé€M andt €R+, o(m, mo) < 6 implies p(®;(m), B,(mo)) < €, 

2. asymptotically Lyapunov stable if there exists 6 > 0 such that, for all m € M, 
p(m,mo) <6 implies p(®;(m), ®;(mg)) > 0 for t > oo. 


While orbital stability is a topological concept, Lyapunov stability, in general, 
depends on the choice of the metric o. For the points of a critical integral curve, 
however, it is independent of this choice (Exercise 3.8.8). Using the characterization 
of stability in terms of p in Remark 3.8.3/2 one finds that 
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1. an equilibrium is (asymptotically) stable iff the corresponding equilibrium point 
is (asymptotically) Lyapunov stable, 

2. a periodic integral curve y is stable if all of its points are Lyapunov stable (Ex- 
ercise 3.8.9). 


The converse of the second statement does not hold in general. A counterexam- 
ple is given by the planar pendulum, where every periodic integral curve has another 
period, so that points which are arbitrarily close will separate under time evolution. 
Moreover, the second statement does not carry over to asymptotic (Lyapunov) sta- 
bility. In fact, a point of a periodic integral curve cannot be asymptotically Lyapunov 
stable, because the distance to any other point of this curve is the same after one pe- 
riod. However, here the following converse version of the second statement holds. If 
a periodic integral curve y is asymptotically stable and if V is a basin of attraction 
for y, then for every m € V there exists mg € y such that p(®,(m), ®;(mo)) > 0 
for t > o~, see [9, §1.5.4]. 


Remark 3.8.19 The concept of stability of a subset of M@ under the flow of a vector 
field should be clearly distinguished from the concept of structural stability. A vector 
field X is said to be structurally stable within a family 2° of vector fields and relative 
to a certain property (P) if there is a neighbourhood*® of X in .2” whose members 
have property (P). The use of this concept is that property (P) remains valid under 
small perturbations of the vector field. Analogously, there is a concept of structural 
stability for local diffeomorphisms. 


Exercises 

3.8.1 Prove the following. 

(a) If X isa vector field on M with flow @: Y— M andif A C M isacom- 
pact subset, then R x A C Y and there exists an open neighbourhood U 
of A in M and a complete vector field X on M such that X tu = Xu. 

(b) If g is a local diffeomorphism of M and if A is a compact subset of the 
domain of g, there exists an open neighbourhood U of A and a (global) 
diffeomorphism @ of M such that gry = gu. 

3.8.2 Show that a subset A of M which is stable or asymptotically stable under 
the flow of a vector field on M remains stable if the vector field is modi- 
fied outside a neighbourhood of A. Prove the analogous assertion for local 
diffeomorphisms. 

3.8.3 Prove the stability criteria in terms of a metric stated in Remark 3.8.3/2. 

3.8.4 Complete the proof of Lemma 3.8.7/1. 

3.8.5 Carry out the necessary calculations for Example 3.8.14. 

Hints. Use Remark 3.6.1 1/3 to determine the characteristic multipliers of the 

periodic integral curve y. Use polar coordinates to show that, for the initial 

condition r(0) > 0, r(t) converges to the following limits when t > oo. 


36This requires to have a topology on .2”. Most often, the family 2° is defined by certain parame- 
ters, and the topology is thus inherited from the space of parameters. 
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(a) If (s — 1) f(s?) <0 for all s £0, 1, then r(t) > 1. 
(b) If (s — 1) f(s?) > 0 for all s 4 0, 1, then r(t) > 0 for r(0) < 1 and 
r(t) > oo for r(O) > 1. 

3.8.6 Show that the function f defined in Example 3.8.17/2 descends to a (strong) 
Lyapunov function on the suspension. 

3.8.7 Let M =R? and X = yd, —x(1+xy)dy. Use the radius function f(x, y) = 
x? + y* to show that the origin is asymptotically stable. In addition, deter- 
mine the characteristic exponents. (This illustrates once more that asymp- 
totic stability does not imply linear asymptotic stability, see also Exam- 
ple 3.8.14.) 

3.8.8 Show that for the points of a critical integral curve, due to compactness, the 
definition of (asymptotic) Lyapunov stability does not depend on the choice 
of metric. 

3.8.9 Show that a periodic integral curve is stable provided all of its points are 
Lyapunov stable. 

3.8.10 Prove that linear vector fields on IR* generated by endomorphisms with 
purely imaginary eigenvalues are topologically conjugate if and only if the 
eigenvalues coincide. Use this and Remark 3.8.13/2 to classify the linear 
vector fields on R* by topological equivalence. As an application, assign to 
each of the linear vector fields of Example 3.6.13 its topological equivalence 
class. 


3.9 Invariant Manifolds 


In the preceding section we have studied the stability properties of a critical integral 
curve y with respect to the flow on a full neighbourhood U of y. However, in the 
case where y is unstable, it is not quite satisfactory to merely know that there exist 
points in U escaping from U under time evolution. There might as well be points 
staying in U or approaching y. Therefore, in this section, we are going to refine the 
preceding analysis by constructing submanifolds about y whose points behave in a 
certain distinguished way under the flow. The starting point of the construction is 
the decomposition of the tangent space at m € y, given by (3.8.4) and (3.8.5): 


TinM = Ep (Hessm(X)) ® Eo(Hessm(X)) @ Eg (Hessm(X)) (3.9.1) 
if y = {m} is an equilibrium and 
TnM = E; ((@r),,) ® E1((@r)\,) ® Ef (Prin) (3.9.2) 


if y is periodic. One first establishes the corresponding submanifold structure lo- 
cally around y and then extends it globally by means of the flow. Since the local part 
is intricate and would go beyond the scope of this book, we refer to the literature 
here. In this section, submanifolds are viewed as subsets, irrespective of whether 
they are embedded or not, and are allowed to be of class C with k = 1,2,...,00. 

Let X be a vector field on M with flow ®: Y— M and let y be acritical integral 
curve of X. Recall that a submanifold S is said to be invariant under ® if };(m) € S$ 
for allm € S andt € Dy. 
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Definition 3.9.1 (Invariant and local invariant manifold) A C*-submanifold of M 
is called an invariant manifold for y if it contains y and if it is invariant under @. 
It is called a local invariant manifold for y if it is embedded, contains y and if X is 
tangent to it. 


Remark 3.9.2 An invariant manifold need not be a local invariant manifold, as 
it need not be embedded. However, every invariant manifold for y possesses an 
open submanifold which is a local invariant manifold for y. This follows from Re- 
mark 1.6.13/1. 


Proposition 3.9.3 (Generating invariant manifolds) For every local invariant man- 
ifold So of class C* for y, there exists a unique invariant manifold S of class C* for 
y such that 


1. So is an open submanifold of S, 
2. for every m € S there exists t € Dy such that ®,(m) € So. 


We say that the invariant manifold S is generated by the local invariant man- 
ifold Sg. Note that S need not be embedded; this is the price one has to pay for 
achieving invariance under @. 


Proof Define S := {m € M : there exists t € Y, such that ;(m) € So}. By con- 
struction, S is invariant under ® and satisfies condition 2. To prove the existence 
of a C*-submanifold structure, we show that S satisfies condition (S) of Proposi- 
tion 1.7.1. First, assume that X is complete. Since So satisfies condition (S), there 
exists a countable family {So;} of subsets of Sg covering So and a corresponding 
family of local C k charts {(U;, k;)} on M such that conditions (S1) and (S2) of 
Proposition 1.7.1 hold. For every i and every rational t, denote S; ;:= ®;(So,;) and 
Uj.r := ®,(U;) and define the mapping 


kit: Ui > R", Ki(m) := Kj 0 B_;(m). 


The family {5;,;} is countable. Since So C S and since S is invariant under ®, there 
holds S;,, C S. The subsets S; + cover S: to see this, it suffices to check that for every 
m € § there exists a rational number t € Y,, such that ®,(m) € So. Now, since X 
is tangent to So, Remark 3.2.9/2 implies that if ®;(m) € So, then };(m) € So for 
Ss in some open interval containing ¢. This yields the assertion. Finally, every pair 
(U;i.1, Ki,t) 1S a local C*-chart on M, because Uj, is open and, by restriction, ®_,; 
induces a diffeomorphism U;,, — U;. It remains to check that conditions (S1) and 
(S2) of Proposition 1.7.1 hold for the families S$; and (Uj,r, «i,.). Condition (S1) 
catries over immediately. For condition (S2), let i, 7 and t, s be given. Then, 


Kit Ui, VU j,5) = Ki (Ui N Bs—1(Uj)). 
Since @,_;(U;) is open in M and hence U; 1 ®,_;(U;) is open in U;, the right hand 
side is open in «;(U;). This implies condition (S2). 
To remove the assumption that X be complete, first enlarge the original atlas of 
So by choosing a countable basis of the topology for each chart domain and adding 
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the local charts obtained by restricting the original chart mapping to the elements of 
this basis. Then, combine each local chart (Ug, kg) of this new atlas with rational 
te Olneer, Ym to define subsets Sy of S and local charts (U1, Kar) of M as 
before. We leave it to the reader to check that these data are well defined, cover S$ 
and satisfy conditions (S1) and (S2) (Exercise 3.9.1). This completes the proof of 
existence of the submanifold structure announced. 

To prove uniqueness, let S“, §@ be submanifolds of M satisfying the con- 
ditions 1 and 2. Due to condition 2, S“) = §@ as subsets of M. Thus, to prove 
that S and S®@) coincide as submanifolds, it suffices to show that the identical 
mapping id: S“ + §® is of class C*. By Proposition 1.6.10, it suffices to prove 
continuity, which is equivalent to showing that a neighbourhood in S“!) is also a 
neighbourhood in S$). Thus, let m € S“ and let W be a neighbourhood of m in 
S. Since S is invariant under ©, Remark 3.2.9/2 implies that the restriction of 
@ to (R x S) N F is the flow of the vector field on S induced by X. Hence, the 
local diffeomorphisms ®, of M, t € R, restrict to local diffeomorphisms of S“. Us- 
ing this and condition 1, we find that W may be shrunk so that there exists ¢ € IR with 
®,(W) C So. Since, by condition 1, So carries the relative topology induced from 
SY, @,(W) isa neighbourhood of @;(m) in Sg. Now, the same arguments, applied 
in the converse order to S®), yield that W = @_,(®;(W)) is a neighbourhood of m 
in S®, 


The tangent space of a (local) invariant manifold at m € y is a subspace of T,, M 
which is invariant under the Hessian endomorphism Hess,, (X) in case y is an equi- 
librium or under (@7)/,, in case y is periodic of period T. On the other hand, the 
corresponding decomposition (3.9.1) or (3.9.2) is invariant. Thus, it is natural to dis- 
tinguish (local) invariant manifolds for y whose tangent spaces at m € y correspond 
to a factor or a combination of factors in (3.9.1) or (3.9.2), respectively. 


Definition 3.9.4 A (local) invariant manifold S for y is called a (local) stable, un- 
stable, centre, centre-stable or centre-unstable manifold for y if the following holds. 
In case y = {m} is an equilibrium, T,,,S coincides with, respectively, 


Ej) (A), Eg(A), Eo(A), Ep (A)® Eo(A), Eg (A) @ Eo(A), 


where A = Hess,,(X). In case y is periodic with period 7, for all m € y, TS 
coincides with, respectively,*’ 


E,(a)+Tmy, E{@)+Tmy, Ea), E,(@)®E\(a), Ef (a) ® ia), 
where a = (®7)/,,. 
The critical integral curve y of X is also a critical integral curve of the vector 


field —X and a (local) invariant manifold for y relative to the flow of X is also a 
(local) invariant manifold for y relative to the flow of —X. Since on the level of the 


37Since in the periodic case, the subspaces E 1 (Pr)),) do not contain T,,y, the latter has to be 
added here. 
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factors in (3.9.1) and (3.9.2), passing from X to —X amounts to interchanging the 
subspaces E~ and E™, a (local) stable or centre-stable manifold for y relative to 
the flow of X is a (local) unstable or centre-unstable manifold for y relative to the 
flow of —X, and vice versa. Thus, statements about (local) stable and centre-stable 
manifolds for y carry over to (local) unstable and centre-unstable manifolds for y 
by passing to the vector field —X. 


Theorem 3.9.5 (Local existence) For every critical integral curve y, there exist 


1. smooth local stable and unstable manifolds; they are locally unique in the sense 
that the intersection of any two local stable (unstable) manifolds is an open sub- 
manifold of both, 

2. local centre, centre-stable and centre-unstable manifolds of class C* for every 
finite k. 


Proof A complete down-to-earth proof can be found in Appendix C of [2] by 
A. Kelley. It uses a local normal form of X, referred to there as a local pseudochart, 
which is constructed in §25 and §26 of this book. The standard reference nowadays 
is [132], which treats invariant manifolds from a more general perspective though 
and goes far beyond what is needed here. Detailed expositions for the cases of equi- 
libria and hyperbolic periodic integral curves can be found, for example, in [123, 
§IX.6] or [207, Ch. 4]. 


Let us add that from the proof of Theorem 3.9.5 there follow several intersection 
properties of the local invariant manifolds, the most important of which is that the 
local stable, the local unstable and the local centre manifold can be chosen so that 
their mutual intersections coincide with y. 

Extending the local invariant manifolds of Theorem 3.9.5 by means of Propo- 
sition 3.9.3, one obtains the following corollary. We say that m € M converges to 
y under @ if Ry C Y» and if for every neighbourhood U of y in M there exists 
to € R+ such that };(m) € U for all t > fo. 


Corollary 3.9.6 (Existence) For every critical integral curve y, there exists 


1. aunique stable manifold S~ (y) and a unique unstable manifold S* (y) of class 
C® such that every point of S*(y) converges to y under the flow of £X. 

2. a centre, a centre-stable and a centre-unstable manifold of class C* for every 
finite k. 


Proof The second assertion follows immediately from Proposition 3.9.3 and Theo- 
rem 3.9.5. For the first one it suffices to consider the stable manifold. The assertion 
about the unstable manifold follows by passing to —X. First, we prove existence. 
By Theorem 3.9.5, there exists a smooth local stable manifold So for y. Since X 
is tangent to Sg, by Proposition 2.7.16, it induces a vector field Xo on Sg. Let 
©: Z® — Sp denote the flow of Xo. In case y = {m} is an equilibrium of X, it is 
an equilibrium of Xo and Hess, (Xo) is given by the restriction of Hess, (X) to the 
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invariant subspace E, (Hess,,(X)). In case y is periodic of period T under 9, it is 
so under ©° and for every m € y, (ory is given by the restriction of (®7)/,, to the 
invariant subspace Ey ((®r),,). By definition of local stable manifold, this implies 
that y is a hyperbolic critical integral curve of XQ and that it is linearly asymptoti- 
cally stable under °, according to Corollary 3.8.10. Then, Corollary 3.8.12 yields 
that y is asymptotically stable under ©°. This means that there exists an open neigh- 
bourhood So of y in So whose points converge to y under ®°. Since, then, they 
converge to y under @ in M, So is a local stable manifold for y. Define S~(y) 
to be the invariant manifold generated by So in the sense of Proposition 3.9.3. By 
construction, every point of S~ (y) converges to y under @. 

To prove uniqueness, let S~ be a smooth stable manifold for y whose points con- 
verge to y under ®. According to Remark 3.9.2, S~ contains an open submanifold 
S~ which is a local invariant manifold for y. Since ‘t8 ~=T,S” forallmey, 
S~ is a local stable manifold for y. The local uniqueness of local stable manifolds, 
see Theorem 3.9.5/1, implies that Ka So is a local stable manifold for y and an 
open submanifold of S~ and So. Then, it is an open submanifold of S~ and S~ (vy). 
In view of Proposition 3.9.3, this implies S~ = S~ (vy) as submanifolds of M. 


The stable and unstable manifolds S*(yv) are minimal in the sense that every 
stable manifold for y contains S~ (y) and every unstable manifold for y contains 
St (y). As mentioned before, they need not be embedded and also not initial. This 
is illustrated by Example 3.9.9. Moreover, it is clear from Proposition 3.9.3 that any 
two centre manifolds for y coincide if there exists a local centre manifold generating 
both of them in the sense of that proposition. The same is true for centre-stable and 
centre-unstable manifolds. 

The convergence condition of Corollary 3.9.6/1 defines the subsets S¥(y) of 
M only in combination with the property to be a stable or an unstable manifold 
for y. Points outside SF(y) may converge to y under the flow of +X as well, as 
is shown by Examples 3.9.10 and 3.9.12 below. If, however, y is hyperbolic, the 
subsets S*(y) of M can be characterized by this property alone: 


Proposition 3.9.7 [If y is hyperbolic, S*(y) consists of the points of M converging 
to y under the flow of +X. 


The characterization given in this proposition is often taken as the definition of 
the stable and unstable manifolds in the hyperbolic case. Beware that this character- 
izes S¥(y) as subsets of M only, and not as submanifolds. To fix the submanifold 
structure one still has to add the information about the tangent spaces at the points 
of y. This is again illustrated by Example 3.9.9, where S~(y) and St (y) coincide 
as subsets, so that it is only the differentiable structure which distinguishes between 
them. 


Proof As before, it suffices to give the proof for S~ (v7). Denote the set of points of 
M which converge to y under ® by S~. According to Corollary 3.9.6/1, S~(v) C 
S~. To prove the converse inclusion, we first consider the case that y = {m} is 
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an equilibrium. Denote A = Hess,, (X). According to the Grobman-Hartman Theo- 
rem 3.8.11/1, there exist open neighbourhoods U of m in M and V of the origin in 
TmM and a homeomorphism  : U — V such that the flow YU: QY +U of Xtu 
satisfies 


(®,),, oh(m) =ho BY (i) (3.9.3) 


for all (t,m) € ZY. Application of Corollary 3.9.6/1 and Remark 3.9.2 to the flow 
®” implies that there exists a local stable manifold § y for y relative to &” whose 
points converge to m under ©”. That is, they converge to m under ® and their 
integral curves stay in U for t = 0. In particular, $7, C S (y). 

We show that the subset /(S/,) of T;,M is a neighbourhood of 0 in the subspace 
Eo (A). Due to (3.9.3), the points of h(S7,;) converge to 0 under (®,);,,. Using the 


scalar products on Eg (A) provided by Lemma 3.8.8 and invariance of Ej (A) under 
(D;)),, = e’4 one can check that, in the hyperbolic case, a nonzero element of T,, M 
converges to 0 under (@;)/,, iff it is contained in Eo (A). Hence, h(S7,) C Eo (A). 
Since S;, is an embedded submanifold of U, h induces a homeomorphism from 
Sy onto the subset h(S7,) of Eq (A), equipped with the relative topology. Since S7, 
and EF, (A) have the same dimension, the theorem on invariance of domain, in its 
manifold form,°® implies that A(S7,) is open in Ey (A) and hence a neighbourhood 
of 0 in Ep (A), as asserted. 

Now, let m,; € S~. There exists t; such that };(m,) € U for all t > t,. That is, 
mz := ®,,(m 1) converges to m under @”. Then, (3.9.3) implies that h(m2) con- 
verges to 0 under (@,)/,,. As was just noticed, then h(mz) € E (A). Hence, there 
is fg € Ry such that (®,,)/,, (h(mz2)) € h(S7). Then, by (3.9.3) again, Of (m2) — 
Py, 41,(m1) € Sy. Since Sy, C S” (y), it follows that m; € S” (vy). Thus, S™ Cc 
S~ (v). This proves the proposition in the equilibrium case. 

For the periodic case, we extend the terminology concerning convergence to local 
diffeomorphisms. We say that m € M converges to mo under the local diffeomor- 
phism ¢ of M if gk (m) is defined for all k € Z4 and gk (m) — mo for k + oo. Now, 
assume that y is periodic of period T. Choose m € y. According to Theorem 3.7.3, 
there exists a Poincaré mapping (Y, YW, ©) for y at m. Since m is a fixed point 
of the local diffeomorphism © of # which is hyperbolic by Proposition 3.7.6, 
the Grobman-Hartman Theorem 3.8.11/2 yields a homeomorphism / from an open 
neighbourhood of m in Y onto an open neighbourhood V of the origin in Ty, Y 
which conjugates © to @/,. By possibly shrinking Y and W one may assume that 
the domain of h coincides with Y and hence there holds 


Ol, ohtw =ho®. (3.9.4) 


38Let M, N be smooth manifolds of the same dimension and let A C M and B CN be subsets 
which are homeomorphic with respect to the relative topologies induced from M and N, respec- 
tively. Then, if A is open in M, B is open in N. See e.g. [199], Chapter VIII, Theorem 6.6 and 
Exercise 6.5. The reader is encouraged to work out that exercise. 
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Next, choose a neighbourhood U of y in M such that UNA CW.Let @Y : GY > 
U denote the flow of X;y. Let S;, be constructed from U as in the equilibrium case 
and denote Sa = Sy P= Sy W . Ne show that h(Sy) is a neighbourhood 
of 0 in E; (@,,). Due to the following two observations, the argument is completely 
analogous to the equilibrium case. 


1. Repeated application of (3.9.4) yields that the points of h(Sy) converge to 0 

under ©), . 
2. By the definition of Poincaré mapping, “ is transversal to all integral curves 

of X. Since X is tangent to S;,, Y and S7, are transversal. According to Corol- 

lary 1.8.5, then Sa is a submanifold of M; it is embedded, because so are 

FH and Sy, and the dimension formula yields dim Se = dimE 1 (Pry) = 

dim E, (©7,). 

Now let m, € S~. There is t; € R + such that };(m,) € U for all t > t;. One 
may choose ¢; such that m2 := ®;,(m;) € W. Then, m2 converges to m under the 
local diffeomorphism © of Y. Hence, by (3.9.4), h(mz2) converges to 0 under OF. 
Then, h(m2) € E; (@;,) and there is k € Z; such that (0!) (h(m2)) € nS). 
By (.9.4), we obtain OF (m2) € Sis To & there corresponds t2 € R+ such that 
O* (m2) = P1, (m2) = P4412, (m1). Since Si Cc Sy CS (y), we get m, € S“(y). 
Thus, S~ Cc S~ (7) holds also in the periodic case. 


Next, we give examples and discuss some of the phenomena related to invariant 
manifolds. We start with linear vector fields. 


Example 3.9.8 (Linear vector fields) Let V be a finite-dimensional real vector space, 
let A € End(V) and let X be the linear vector field on V generated by A. Then, 
S*({0}) = Ef (A). The subspaces Eo(A), Ey (A) ® Eo(A) and EG (A) ® Eo(A) 
yield, respectively, a centre, a centre-stable and a centre-unstable manifold for the 
origin. 


Let us continue with the hyperbolic case. 


Example 3.9.9 (Planar pendulum) As an example of the stable and unstable mani- 
folds of a hyperbolic equilibrium, consider the upper equilibrium y = {m,,} of the 
planar pendulum of Example 3.6.14. Let ~5 and yz) denote the two non-critical 
integral curves, see Fig. 3.2(a). Both S*(y) and S~(y) consist of y, Yo and yy. 
Hence, as subsets of M, they coincide with one another and with the separatrix, 
which forms a figure eight. The submanifold structures, on the other hand, are dis- 
tinct, because S*(y) has tangent space Et Hess, (X)) at m,. Thus, S~ (yv) and 
S* (y) are given by the separatrix, equipped with either one of the two inequivalent 
figure eight submanifold structures, cf. Example 1.6.6/2. This illustrates that S¥(y) 
need neither be embedded nor initial and that their differentiable structure is quite 
important, because it is the latter which distinguishes between S~ (vy) and ST (vy) 
here. 
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As Example 3.9.9 shows, the intersection S* (vy) S~ (vy) may contain more than 
just y. By invariance under ®, if the complement (S*(yv)MS7 (y)) \ y is nonempty, 
it consists of maximal integral curves. These are called homoclinic. Thus, the in- 
tegral curves y and ye) of Example 3.9.9 are homoclinic. Similarly, if y, and 
y2 are two distinct critical integral curves the intersections St (1) M S~ (y2) and 
S~ (v1) A S* (2) are invariant under @ as well. If these intersections are nonempty, 
the maximal integral curves contained are called heteroclinic. Examples for het- 
eroclinic integral curves can be obtained from 5 and ye) of Example 3.9.9 by 
replacing the angle variable of the planar pendulum by an ordinary Cartesian coor- 
dinate, thus producing a 27r-periodic vector field in the plane. Physically, this can 
be realized for example by adding a counter of rotations to the planar pendulum. To 
summarize, homoclinic integral curves connect a critical integral curve with itself, 
whereas heteroclinic integral curves connect different critical integral curves with 
one another. 


Example 3.9.10 (Modified harmonic oscillator) As an example of the stable and 
unstable manifolds of a hyperbolic periodic integral curve, consider 


x= (y+x(1 a y?)) ay + (—x + y(1 a y?)) ay + zd, 


on M =R’. This is just the modified harmonic oscillator of Example 3.6.15 with 
@ = | in the x—y-plane, combined with a one-dimensional linear vector field in 
the z-direction. The equations for the integral curves separate into the equation of 
the modified harmonic oscillator in the variables x and y and the equation z = z. 
Taking the solution of the first one from (3.6.10) we find that X is complete and 
that, in cylindrical coordinates r, @, z, its integral curves are given by 


,(r,$,z)=((l+(r?- le2)-2 g +t, ze’). (3.9.5) 


We read off that the critical integral curves are given by the origin 0 and by the single 
periodic integral curve y, given in coordinates by y(t) = (1, t, 0). Let E denote the 
x—y-plane and let C., C,, C. denote the subsets defined by r < 1,r = 1 andr > 1 
(that is, the open cylinder of radius | centered around the z-axis, its boundary and 
the rest). Analyzing the behaviour of (3.9.5) for t — oo one finds 


S~ ({0}) = {0}, S*({0}) =C., S”(y) = E \ {0}, St(y)=C.. 


All four invariant manifolds are embedded. The whole of E is a stable manifold as 
well, but it does not satisfy the convergence condition of Corollary 3.9.6/1. Since 
S~ ({0}) N St ({0}) = {0} and S~(v) N St(y) = y, there are no homoclinic integral 
curves. On the other hand, while S~ ({0}) N St (y) is empty, we get 


S~(v) 1 S*({0}) = (ENC) \ {0}, 


that is, the open disk of radius 1 in E with the origin removed. Thus, there is a 
continuum of heteroclinic integral curves joining the origin to the periodic integral 
curve y. 


Next, we turn to non-hyperbolic examples and the discussion of centre manifolds. 
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Example 3.9.11 Consider the vector field 
X = yoy — xdy — 20, 


on M = R?. This is the harmonic oscillator (3.6.12) with w = 1 and a = 0 in the 
x—y-plane, combined with a one-dimensional linear vector field in the z-direction. 
The integral curves are given in cylindrical coordinates r, ¢, z by 


G1 (r, 6,2) =(r.o+t, ze“). 


The set of critical integral curves consists of the origin and of the periodic integral 
curves y;, r > 0, given in coordinates by y;(t) = (7, wt, 0). Let C; denote the sur- 
face of the cylinder of radius r centered around the z-axis. The stable and unstable 
manifolds are 


S~ ({0}) = z-axis, S* ({0}) = {0}, S™ (Yr) = Cr, St) = yr. 


There are neither homoclinic nor heteroclinic integral curves. The x—y-plane pro- 
vides a centre manifold for all critical integral curves. Similarly, any open disk cen- 
tered at the origin in this plane is a centre manifold for the origin and any open 
annulus containing y, is a centre manifold for y,. 


While Example 3.9.11 shows that a centre manifold cannot be made unique by 
requiring convergence properties like for ST (y), any two centre manifolds still in- 
tersect in an open submanifold there, that is, they coincide locally. The next example 
illustrates that it may happen as well that there exist centre manifolds which do not 
coincide locally. 


Example 3.9.12 Consider the vector field 
X =x7d, — ydy 


on M = R’. This is Example 3.2.7/2, combined with a linear vector field in one 
dimension. According to that example, X is not complete and the flow is given by 


Q= {( (x,y))ERx M:t< “|. @,(x, y) = ( ye"), (3.9.6) 


x 
1—tx 


see Fig. 3.3. The only critical integral curve is the origin. The Hessian is 
0 0 
Hesso(X) = E ai 


hence Ep, (Hesso(X)) is given by the y-axis, Eg(Hesso(X)) coincides with the 
x-axis and Eg (Hesso(X)) = {0}. It follows that S~ ({0}) is given by the y-axis, 
whereas S* ({0}) = {0}. Since all the points of the closed half-plane x < 0 converge 
to the origin under @, this illustrates that Proposition 3.9.7 fails to hold in the non- 
hyperbolic case. Next, let us look for centre manifolds. In the open half-plane x > 0, 
any centre manifold must coincide with the positive x-axis. In the open half-plane 
x <0, by eliminating ¢ either from the differential equations for the flow or from 
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Fig. 3.3. The flow of 
Example 3.9.12 
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(3.9.6) one finds that the images of integral curves coincide with the graphs of the 
functions 


1 
y=fu@)=mer, MER. 
Since derivatives of f,, with respect to x to arbitrary order tend to zero as x > 0 
from below, f,, can be extended to a smooth function on R by letting f,,(x) = 0 
for x > 0. According to Example 1.6.12/2, the graph of this function is a smooth 
submanifold. Due to Lie (0) = 0, the tangent space of this submanifold at x = 0 
coincides with the x-axis and hence with Eg(Hessg(X)). Thus, all of these subman- 
ifolds are center manifolds, but neither two of them coincide locally, that is, the 
equilibrium is not an inner point of the intersection. 


While the centre manifolds of Examples 3.9.11 and 3.9.12 are all smooth, this 
need not be so in general. For counter-examples, see e.g. [296] or [285]. Of course, 
the above statements about non-uniqueness and non-smoothness of (local) centre 
manifolds carry over to centre-stable and centre-unstable manifolds. 

Let us, furthermore, mention that there exist approximation methods for a sys- 
tematic construction of centre manifolds, see for example [59] or the second part of 
Sect. 3.2 in [114]. 


Remark 3.9.13 (Reduction to the centre manifold) Using centre manifolds, the 
Grobman-Hartman Theorem can be generalized to arbitrary equilibria as follows. 
Let y = {m} be an equilibrium of X. Let X* denote the linear vector fields in- 
duced by Hess,,(X) on Eg (Hess (X)). For every k = 1,2,3,..., there exists a 
centre manifold S° for y of class C*, as well as open neighbourhoods U of m in 


M, V* of the origin in Ei (Hessm(X)) and V®° of m in S° such that the flow of 


X jy is topologically conjugate to the flow of the vector field (X hwo? xX yo? x ee 


on V~ x V° x V+, where X° denotes the vector field on S° induced by restriction 
of X. See [250, 271] or [17, §32] for a proof and [207, $4.2] for a more detailed 
discussion. There holds an analogous statement for fixed points of local diffeomor- 
phisms [131, Thm. 7.3] and for periodic integral curves [250]. 

Thus, for the qualitative analysis of a flow in the vicinity of a critical integral 
curve it suffices to study the linearization of the flows induced on the minimal sta- 
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ble and unstable manifolds S*(y) and the flow induced on an appropriate centre 
manifold. As already mentioned in Example 3.8.14, the latter flow is in general not 
topologically conjugate to its linearization. 


To conclude this section, let us mention that invariant manifolds for fixed points 
of local diffeomorphisms can be treated in a completely analogous way. For general- 
izations of the concept of stable, unstable and centre manifold from critical integral 
curves to certain classes of submanifolds, see [132]. A well-organized guide to all 
of this, with hints for further reading, is [257]. 


Exercises 

3.9.1 Complete the proof of the existence of a smooth submanifold structure on 
the subset generated by a local invariant manifold in Proposition 3.9.3 for the 
case of a non-complete vector field. 

3.9.2 Let y be a periodic integral curve of period T of the flow ®. Show that the 
subspaces 


EF (ry) Eo((@ry) EF (rq) ® Eo((@rIy)» me y. 


combine to smooth vector subbundles of Ty which are invariant under the 
linearized flow. Reformulate the definition of stable, unstable, centre, centre- 
stable and centre-unstable manifold for y in terms of these subbundles. 

3.9.3 Determine the flow of the vector field X = —xd, + (y+ x)dy on R? and use 
this to find the stable and unstable manifolds of the origin. 

3.9.4 Consider the vector field X = yd, + (x — x)ay on R*. This models a one- 
dimensional quartic oscillator (Mexican-hat potential). The associated second 
order differential equation is a special version of the autonomous Duffing 
equation. 

(a) Find the equilibrium points and their characteristic exponents. 
(b) Show that H(x, y) = sy" _ 5x7 + qx is a first integral of X, that is, 
there holds X(H) = 0. 
(c) Use H to determine the stable and unstable manifolds of the origin. Com- 
pare the result with that for the upper equilibrium of the planar pendulum 
in Example 3.9.9. 
3.9.5 Carry out an analysis similar to Exercise 3.9.4 for 


2 1 1 2 
a H(x, y)=—y* + —(V¥14 x2 —2)°. 
re ) » @ Was + 5(vi+x?—2) 


This models a particle which moves without friction along a rod and is at- 
tached to a fixed point outside the rod by a spring, whose equilibrium length 
is twice the distance of the fixed point from the rod. 


Kaya tx( 


Chapter 4 
Differential Forms 


We first present the elementary calculus of differential forms, including the calculus 
of integration and a discussion of integral invariants. Then, in Sect. 4.3, we give 
an introduction to de Rham cohomology. Next, in Sects. 4.4 and 4.5, we present 
some elements of Riemannian geometry, discuss Hodge duality in detail and show 
how classical vector analysis can be understood in a coordinate-free way using the 
language of differential forms. In Sect. 4.6, we apply this framework to classical 
Maxwell electrodynamics. In Sect. 4.7, we give an introduction to the theory of 
Pfaffian systems and differential ideals. In particular, we derive an equivalent for- 
mulation of the classical Frobenius Theorem. Finally, we apply these notions to 
classical mechanics with constraints. 


4.1 Basics 


Recall the following notions from Sect. 2.5. A differential k-form on a manifold 
M is a section in the vector bundle A‘T*M. We write 2*(M) for the space of 
differential k-forms and 


2*(M) = r(A TM) - @ 2*(M) 
k=0 


for the exterior algebra. Obviously, 2°(M ) = C%(M) and 2*(M )=0 fork > 
dim M. The exterior product A: Q*(M) x 2!(M) > 2**'(M) is given by 


(a A B)(Xq,..., Xk40) 


1 : 
= on > sign(7)a(Xz(1),---, Xa )B Xana ys «+s Xa(key)s 


TESK4I 


where w € 2*(M), B € Q'(M) and X; € X(M), cf. (2.4.17). The natural pairing 
of k-forms with k-vectors from multilinear algebra induces a C®-valued pairing of 
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k-differential forms with k-vector fields, given by 
(ay A+++ Aa, X1 A+++ A Xx) = det(aj (Xj) (4.1.1) 
for all a; € 2!(M) and X; € X(M). Then, by the definition of the exterior product, 
a(X1,..., Xn) = (a, X1 A+++ A XK) 


for alla é Q*(M) and X; € X(M). For k > r, we define the operation of inner 
multiplication of an r-vector field X with a k-differential form a by 


(Xia,Y):=(a,XAY), YexX"(M). (4.1.2) 
This operation is C°(M)-linear in both arguments. 


Let us describe the above structures in a local chart (U,«) on M. (U, x) induces 
local frames {0;} and {dx'} of TM and T*M, respectively, and by Example 2.4.5, 
the induced local frames in /\’ TM and /\’ T*M consist, respectively, of the local 
sections 


Oi, A+++ A O;,, dk" A---Adk", 1<i, <::: <i, <n. 


It is common to use the following condensed notation. For a subset J C {1,...,n} 
of r elements define 


Oy = 8), A+ AG, dk! = dk A--- Adk', 


where i;,...,4, denote the elements of J, ordered by magnitude, that is, J = 
{i,,...,i,-} and 1; <--- <i,. Thus, the local frames under consideration consist, 
respectively, of the local sections 0; and d«!, where J runs through the subsets of 
{1,...,} of cardinality r. In particular, by an extension of the summation conven- 
tion, pairs of capital indices 7 are summed over those subsets. We have 


(dic? , a4) = 44, (4.1.3) 
dx! A dk! = pyz,sdx! 4 Bing, (4.1.4) 
df adie! = py py sde! Byus1, (4.1.5) 


where 55 = 1if J =J and 0 otherwise, and o;, 7 = (—1)4%, where q is the number 
of pairs (i, 7) € J x J with i > 7 (Exercise 4.1.1). Then, locally, a € 2"(M) is 
represented by 


ary ade’ = So aiid A--- Ade’, i,j, = O84... 94,), (4.1.6) 
ij <:+<i; 
cf. (2.4.21), and r-vector fields X are represented by 
X=) ) XH = » MG. Aer Awin (4.1.7) 
J 


i] <:++<i, 
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As a consequence, the operations of exterior product and inner multiplication take 
the local form 


(@AB)u =a ABlU= >>) ar Bypr,sdx'. (4.1.8) 
I JnM=2 
(Xae) ty = Xpu sep = >) 0 aX! py,nsdx!. (4.1.9) 
I Jcl 
Remark 4.1.1 
1. Equation (4.1.6) can be used to define functions qa;,.;, for all sequences 
ij,...,7,, and it is sometimes more convenient to sum over all these sequences 
rather than just the increasing ones. Since both q;,__;, and dk’! A--- A dk’ are 
antisymmetric under a permutation of i,,...,7,, this produces a factor r!. Hence, 
1 : 
ary = = Mi,...j, dK" A+++ A dk", (4.1.10) 
r! 


(summation convention).! Let m € U. By an appropriate extension of the func- 


tions a;,..i, and ct from a smaller neighbourhood U c U of m to smooth func- 
tions @j,..i, and x’ on M which vanish outside U, we obtain 
L pe ~ i ~ ip 
mg = 5 (Hii de A+ Adk') 5. (4.1.11) 


Thus, locally, every k-form is a sum of k-forms of the type fod/fi A---Adfx 
with f; Ee C°(M). 

2. We determine the transformation laws for the local frames and for the corre- 
sponding coefficient functions under a change of local chart. Thus, let (V, p) be 
another local chart on M and let us denote 


A= [(e on!) ox], Ai = [(<op)' op], 
and 
Aly:= So signr)Ait Ale, A= sign(r)A'* ey ear 
weS; mweES, 


for 7 = {i,,...,i-} and J ={ji,..., j-}. Then, over UN V the following formu- 
lae hold (Exercise 4.1.2):? 


ar = Ava, do? = Aldc’, (4.1.12) 


'We caution the reader that for r > 2 this sum runs over a linearly dependent system of sections. 
In particular, the left hand side does not determine the coefficient functions on the right hand side 
uniquely. There is however a unique choice if we limit our attention to functions which are totally 
antisymmetric in the indices i;,...,i,, and this choice is given by (4.1.6). 

2One may also write, for example, do!! A --- A do’ = Ai - A dk/l A--» A dk, keeping in 
mind that on the right hand side the sum runs over a linearly dependent system rather than a basis. 
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and (4.1.6) and (4.1.7) imply 

a = Ajak, xed = Ales, (4.1.13) 

Next, let M and N be manifolds and let g : M — N be a smooth mapping. 


According to Example 2.4.5, g induces a linear mapping g* : Q*(N) > Q*(M), 
called the pull-back, by 


(pa) (X1,---, Xr) = agen) (G(X), +++ Gy (Xr) (4.1.14) 


where a € (2"(N) and X; € T,,M. This mapping is an algebra homomorphism, that 
1S, 


y*(a A B)= (y*a) A (v*B) (4.1.15) 
for all a, B € 2*(M). If g is a diffeomorphism, (4.1.14) entails 


(y*a)(X1,..., Xr) = 9" (@(PeX1,..., Oe Xr) (4.1.16) 


for all X; € X(M). In local charts (U,«) on M and (V, pe) on N, the pull-back is 
represented by 


K(m)), (4.1.17) 


P dyil dyJé 
(oa); (rm) = a... (YOM) (elm)... = 


where y/ = p/ ogox~! (Exercise 4.1.3). 


After these introductory remarks, we turn to the discussion of the exterior deriva- 
tive. 


Definition 4.1.2 (Exterior derivative) A system of linear mappings 
d: 2*(M) > 2**1(M), k=0,1,..., 


is called an exterior derivative on M if 


1. dod=0, 
2. d(a A B)=da A B + (—1ka A d£ for all a € 2*(M) and B € 2*(M), 
3. (df, X) = X(f) for all f € C©(M) and X € X(M). 


A linear mapping of £2*(M) satisfying condition 2 is called an anti-derivation. 
Condition 3 means that the exterior derivative of a function f coincides with the or- 
dinary differential of f, defined in Example 2.5.1. This yields existence and unique- 
ness for the exterior derivative of functions. Moreover, this implies that taking the 
exterior derivative of a function is a local operation in the following sense: 


(df)tu =d(fpu) (4.1.18) 
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for all f € C°(M) and all open subsets U of M, where on the right hand side, d 
stands for the exterior derivative of functions on the manifold U. Indeed, for ev- 
ery X € X(M) we have X;y € X(U) and (X(f)) tu = X;uCftu), and condition 3 
implies 


(df)tu, Xtu) = (df, X)pu = (X(f)) tu = Xtu (fru) = (d( fru), Xu). 


Lemma 4.1.3 An exterior derivative is a local operator, that is, if differential forms 
a and B coincide on a neighbourhood of a point m € M, then da(m) = dB(m). 


Proof Let U be an open neighbourhood of m such that a;y = yy. There exists 
a smaller neighbourhood U CU of m and a smooth function f on M such that 
fa = land ftm\v =0. Then, fa = fB on M and hence, by property 2, 


df Aa+ fda=df A B+ fap. (4.1.19) 


Since df (m) = 0 (by property 3) and f(m) = 1, (4.1.19) yields the assertion. 


Theorem 4.1.4 (Existence and uniqueness) On every manifold M there exists a 
unique exterior derivative. It satisfies 


(de) ty =) (akorr) de? A de! (4.1.20) 
I,p 


forallae 92*(M) and all local charts (U,«) on M. 


Proof First, we show existence. Let a € 9Q*(M). We choose a local chart (U, «) 
and define da pointwise on U by (4.1.20). We must show that this definition does 
not depend on the choice of the local chart. Thus, let (U, «) be another chart. We 


may assume U = U. A simple calculation using the transformation laws (4.1.12) 
and (4.1.13) yield 


S\(aker)dk? A de’ = S“(dSar) de? A dk’. (4.1.21) 
I,p Ip 


We leave it to the reader to check that the system of linear mappings d so defined 
is an exterior derivative (Exercise 4.1.5). Next, we prove uniqueness. Let d and 
d’ be two exterior derivatives on M and let a € 2*(M ). We show that (da), = 
(d'a)m for all m € M. By Formula (4.1.11), there exists a neighbourhood of m 
where a coincides with a sum of terms of the form fod fi A--- A dfx. Hence, by 
Lemma 4.1.3, it suffices to show that 


d(fod fi A--- Ad fx)(m) = d'(fodfi A ++» Ad fx)(m). 


This follows from the uniqueness of the exterior derivative on functions and from 
the defining properties | and 2. 
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Remark 4.1.5 By Lemma 4.1.3 and by the uniqueness of d, for every open subset 
U CM and every k-form a, we have 


(da) uy = d(a@ju). 
The exterior derivative can be characterized in a coordinate-free way as follows. 


Proposition 4.1.6 For a € 2*(M) and X,,...,X, € X(M), 


k 


' Xj 
da(Xo,..., Xx) = Ge. (a(Xo, .Y., Xe)) 
i=0 
oe X;Xj 
+ GD Ma([Xi, Xj], Xo... Xk), (4.1.22) 
i<j 


Xj 
where Y means that Xj; is omitted. 


Proof First, we show that both sides of the above equation are f-linear with respect 
to every X;. For the left hand side, this is obvious. Denoting the two terms on the 
right hand side by 7;(Xo,..., Xx), i = 1,2, and using the derivation property we 
obtain 


; Xj 
Ty (Xo, ..-5 fX1,-.«, Xz) =D (-1)' Xi (a(Xo, ..., $1. Xe) 
iAl 
1 Bu 


: Xj 
=) \[(-1)'Xi(f)a(Xo, ..., Xr, .1., Xe) 
iAl 


, Xj 

+ (-1)! f Xi (a(Xo,...,X1,.%., X0)] 
1 a 

+ (-1)' fXi(a(Xo, -Y., Xx) 


: Xj 
= fT) (Xo,..., Xe) + ))(-1)' Xi(P)a(Xo, 7, Xe), 
iAl 


and, using (3.1.1), 
(Xp, sf Xin = Y) Clo, B5), Moy FX lho Me) 


i<j 


i,jAl 
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Xj X) 


+ (Di Ha((X:, fX11, Xo, 1%, Xx) 
i<l 
, XX; 
+d! a([fX1, Xj], Xo, .1.%., Xx) 
l<j 
X; X; 
=f Yo CDi a([X;, Xf], X00 0-5 Xt Le Xe) 
ee 
: X; X) 
+ Sd" fo([Xi, Xi], Xo, Ys Xk) 
i<l 
; X; XI 
+ EDIT Xi (fa(Xr, Xo, 1%, Xt) 
i<l 
ver X1 Xj 
+S °1!4) fo ([X1, Xj], Xo, 1. %., Xe) 
l<j 
ee X1 Xj 
— Sen! X;(fya(X1, Xo, 1... Xx). 
l<j 


The first, the second and the fourth term on the right hand side give 
fTo(Xo,..., Xx). In the remaining two terms we bring the vector fields X; to their 
original position. This yields a factor (—1)!~! in the third term and a factor (—1)! 
in the fifth term. Thus, 


. Xj 
T2(Xo,..., fX1,..., Xk) = fT2(Xo,..., Xe) — ) (DXi (P)a(Xo, .., Xe), 
iAl 
and the right hand side of (4.1.22) is f-linear in every vector field X ;, indeed. 
Now, since d, the derivations X; of C°°(M) and the commutator of vector fields 
are local operations, it is enough to show that the assertion holds for the case X; = 


di , where (U, «) is an arbitrary local chart and jo <--- < jx. In this case, the right 
hand side reads 


aj, 


xe Diat (a(a%,.¥., 38) Eye Ce ae a nee 


i=0 


and for the left hand side we obtain, using (4.1.20), 


de oe090e) = (Daparae’ Ade! (8-59, 


pl 
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i=0 


This completes the proof. 


The next proposition shows that the exterior derivative commutes with the pull- 
back operation. 


Proposition 4.1.7 For a smooth mapping 9: M > N, 
d(g*a) = g* (da), we Q*(N). (4.1.23) 


Proof Due to Remarks 4.1.1/1 and 4.1.5, and since y* (@;y) = (y*a@) to!) for all 
open subsets U of N, it suffices to prove the assertion fora = fodf, A---Adfx, 
where fo,..., f% are arbitrary smooth functions on M. Formula (4.1.23) holds for 
functions, because 


(40°F) a» Xm) = Xm (G"F) = (9 Xm) FY = (CAP ecm), @ Xm) = ((P"AF) ys Xm) 


for all m € M and Xy, € TM. Using this and (4.1.15), we obtain 


d(y* (fod fi A--- Ad fx)) = 9" (d(fod fi A--- Ad fx))- 
Finally, we derive relations for the Lie derivative of differential forms. 


Proposition 4.1.8 For a € 2*(M), f € C~(M) and X € X(M), 


La = X sda + d(X), (4.1.24) 

Lx (da) = d( Lye), (4.1.25) 
Lx (X10) = X (Lye), (4.1.26) 
Lyya = f Lyat+df A (Xa). (4.1.27) 


Using the notation X 1 = ix, the identity (4.1.24) can be written in the form 
Ly =ixod+doiy. 


Proof We evaluate both sides of (4.1.24) on a k-vector field Yj A --- A Yg. For the 
two terms on the right hand side, Proposition 4.1.6 yields 


4.1 Basics 173 


(X ida, ¥y A+++ A Ye) = (da, X AY, A-++ A Yx) 


= X((a, Yi A+++ A Ye)) 
k 


; Yj 
+ S\(-DIY;((a, XA. A ¥x)) 


i=1 
+e 1)'(a, [X,YIANA. Y A Yx) 


Y;Y; 
+> (Cv a, 1, WIA XAYLA YY AY), 


i<l 
k y; 
(d(Xs0), ¥p A+++ A Ye) = YC DitY; ((Xs0, ¥y A... A Ye) 
t=). 


f Y; Yi 
a Ye) { Xa, [Vi MAYA YY AY:) 


i<l 


k y; 
= d- DItHY;((a, XA YLA.Y.A Ye) 


; Y; Yi 
+ edi a, X ALK, YA YA YY AY). 


i<l 


According to Propositions 3.3.2 and 3.3.3, the sum of these terms gives 


k 
X((a, Yy A+++ A Ye) - > (a, Yp A+++ A[X, Yi] A+++ A Ye) = (Lea, YpA-+- AY a). 
l=1 


The proofs of (4.1.25)-(4.1.27) are left to the reader (Exercise 4.1.6). 


For later use, we note the following consequence of (4.1.25). Let X be a time- 
dependent vector field on M with flow  : Y— M. Then, for all w € 2*(M) and 
all (t, t9,m) € J, 

d 

Gy (Pinto) = (Pinto LX) (4.1.28) 
where .Zy, is the Lie derivative with respect to the ordinary vector field X; with 
fixed t. Indeed, in the case aw = f € C“(M), we find 


d d 
a (ote In = af (Pr.t0(m)) = (X1(f))(Gr,1 (m)) = (2; t,to ° Ly, f )(m). 


Since both &F,, and “x, commute with the exterior derivative on M and since both 
sides of (4.1.28) are derivations with respect to the wedge product, the assertion 
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follows. Note that (4.1.28) also holds for time-independent vector fields. In this 
form, it is a special case of (3.3.3). 


We conclude this section by two remarks. 


Remark 4.1.9 (Structure of 2*(M)) 


1. 


By the defining properties 1 and 2 of the exterior derivative, (2*(M), +, A, d) is 
an associative graded commutative differential algebra, called the Cartan algebra. 


. Since d? = 0, the sequence of linear mappings 


0> 2°(m) $ 2k) $..-$ 2"(mM) $0 (4.1.29) 


is a chain complex, called the de Rham complex of M. It contains important 
information about the topological structure of M, see Sect. 4.3. 


. The following generalization of the de Rham complex is an important element 


of noncommutative geometry. Let 2( be a unital associative «-algebra over C. An 
involutive graded differential algebra over 2 is a tuple 


{Ag+ d}, 


where Ax is an 2(-bimodule of the form Ax = Broo Aes with AS. =A, - is 
an 2-bilinear multiplication in A}, fulfilling As, : Ay Cc Aa and d: Ar > 
a is a derivation fulfilling d(A*) = (—1)*(dA)* for any A € Ag Moreover, 
for given e € Endy (2?) with e? = e, € := eA? is a right projective module 
over 2l. In view of the Serre-Swan Theorem [156], finitely generated projective 
modules naturally generalize vector bundles. 


The second remark introduces smooth families of differential forms and differ- 


ential forms with values in a vector space. Both concepts will occasionally be used 
later on. 


Remark 4.1.10 


ils 


A one-parameter family of r-forms {a; : t € [a, b]} on M is said to be smooth if 
the mapping 


r 


[a,b]}x M> [A\T*M, (t,m) a(n), 


is smooth. Given such a family, by pointwise integration is a;(m) dt in TSM, 
m € M, we obtain an r-form on M, denoted by { a;dt. Similarly, by pointwise 
differentiation 4 1, Os (m) in T*, M, we obtain a smooth family of r-forms on M, 
denoted by oly. In this sense, one may omit the point m in the Formulae (3.3.3) 


and (4.1.28). Application of the operations of exterior derivative, pull-back, in- 
ner product or Lie derivative to a smooth family of differential forms yields a 
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smooth family again. Using Proposition 4.1.6, one can show that these opera- 
tions commute with the operations of integration or differentiation with respect 
to the parameter. That is, 


d d 
af at = f dave da ot = —da; (4.1.30) 


and similar formulae with d replaced by y* for any smooth mapping gy: N > M 
or by X or “y for any vector field X on M (Exercise 4.1.7). 

2. Let V be a finite-dimensional K-vector space. A differential r-form on M with 
values in V is a section in the vector bundle whose fibres are the r-forms on 
TinM with values in V, that is, the antisymmetric r-linear mappings 


TmM x---x TnwM—- V. 


We denote the vector space of these forms by 92"(M,V) and we define 
2*(M,V):= @Br=0 92"(M, V). The bundle of r-forms on T,,M with values 
in V can be identified with the tensor product NN’ T*M @ (M x V). Hence, 
92"(M, V) can be identified with the space of sections of this bundle, which 
in turn can be identified with 2"(M) ® V. Accordingly, if {e,} is a basis in V, 
every differential r-form @ with values in V can be written in the form 


a=a" Qe, 


(summation convention), where w“ are uniquely determined ordinary differential 
r-forms and the e, can be interpreted either as elements of V, in which case a is 
viewed as an element of §2’(M) @ V, or as global sections in M x V, in which 
case @ is viewed as a section of N’ T*M @ (M x V). As a consequence, with 
respect to a local chart (U, «) on M, a@ has the local representation 

del A+: A dk" @ eg (4.1.31) 


pees 


with smooth functions ae - gl R. The statements of this section about 
ordinary differential forms carry over in an obvious way to differential forms 
with values in V, except for those which have to do with the exterior product. An 
exterior product exists on 22*(M, V) only if V carries in addition the structure 
of an algebra. If so, the exterior product is defined by (2.4.17). 


Exercises 

4.1.1 Prove Formulae (4.1.4) and (4.1.5). 

4.1.2 Prove the transformation laws given in (4.1.12). 

4.1.3 Prove the local Formula (4.1.17) for the pull-back of differential forms. 

4.1.4 Prove that the operation of inner multiplication is f-linear. 

4.1.5 Complete the proof of Theorem 4.1.4 by verifying that the system of linear 
mappings d defined by (4.1.21) is an exterior derivative. 

4.1.6 Prove Eqs. (4.1.25)-(4.1.27). 

4.1.7 Use Proposition 4.1.6 to prove Formula (4.1.30) as well as analogous formu- 
lae for the operations g*, X41 and Ly. 
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4.2 Integration and Integral Invariants 


The idea of integration of a differential n-form @ on an n-dimensional manifold M@ 
is as follows. We choose an atlas and take the pull-backs of a, weighted with the 
elements of a subordinate partition of unity, by the local chart mappings. This yields 
a family of differential n-forms on open subsets of IR”, which we integrate and sum 


up. 


Definition 4.2.1 (Orientation) Let M be a manifold. An atlas .%y of M is called 
oriented, or an orientation of M, if for every pair of charts (Uj, «,) and (U2, x2) 
belonging to </y and such that U; N U2 # @ one has 


det(k) ok; |)’ >0. (4.2.1) 


A manifold is called orientable if it admits an orientation. It is called oriented if an 
orientation has been chosen. 


Let M be oriented by the atlas ./. A local chart on M is said to be oriented if 
f remains oriented when this chart is added to it. A local frame in TM is said to 
be oriented if the matrix transforming this frame at some point to the frame {0;} of 
some chart in & has positive determinant. 


Remark 4.2.2 The set of ordered bases of a vector space V decomposes into the 
following two equivalence classes: (e€1,...,@,) and (e\, ...,,) are equivalent iff 
the transition matrix A, defined by e; = A;/e;, has positive determinant. Each of 
these two equivalence classes defines an orientation of V. This definition immedi- 
ately carries over to manifolds: choosing an orientation means choosing orientations 
in each tangent space in a smooth way. Indeed, the local frame {d/,..., 0°} induced 
by the local chart (U,«) provides an orientation of the open subset U and (4.2.1) 
yields the consistency condition for the orientations of all charts belonging to the 
chosen atlas. 


Definition 4.2.3 (Volume form) A nowhere-vanishing n-form on an n-dimensional 
manifold is called a volume form. 


Proposition 4.2.4 A manifold is orientable iff it admits a volume form. 


Proof First, assume that M is orientable. Let {(U;, «;)} be a countable, locally finite, 
oriented atlas and let { f;} be a subordinate partition of unity. We define the following 
family of n-forms on M: 


m € Uj; 
fi(m)dk} A+++ Adk mej. 


(Vim t= 
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Since {supp(f;)} is locally finite, v:= )°; v; is a well-defined smooth n-form on M. 
It remains to show that vm 4 0 for all m € M. Choose ig such that m € Uj,. By the 
transformation law (4.1.12) for local frames, in the chart (Uj,, Kig) we have 


a (fio (m) + > film) det(k; 0 ")' (Kip (m)) dr, Av Adkit, 


where the sum runs over all i such that m € Uj. Since fj; > 0 and fj,(m) + 
>= fi(m) = 1, and since the determinants are positive, we conclude v,, 4 0. Con- 
versely, assume that there exists a volume form v. In a local chart (U, «) the n-form 
v is given by 


viu =hdk! A---Adk". 
Since vj, 4 0 for all m € U, we have either h > 0 or h < 0. If h > 0, we leave k as 
it is. If h < 0, we define a new chart by kK = (—x!,«2,...,«"). By this procedure, 
we obtain an atlas with h > 0 in every chart. Then, for any two charts (U,«) and 
(U, x) of this atlas, we have h = det(K ox—!)/h and thus det(K ox!) > 0. 


Example 4.2.5 


1. The vector space R” is orientable. One possible orientation, called the standard 
orientation, is given by the identity chart mapping. A volume form corresponding 
to this orientation is dx! A--- A dx” with x!,..., x” being the standard coordi- 
nates. 

2. The Mobius strip is not orientable (Exercise 4.2.1). 

3. The odd-dimensional real projective spaces are orientable, the even-dimensional 
real projective spaces are not orientable (Exercise 4.2.1). 

4. The level set of a regular value of a differentiable mapping f: R” — R” is 
orientable (Exercise 4.2.1). 

5. Every parallelizable manifold is orientable, because if {X1,..., X,} is a global 
frame in TM and {é,..., &,} is the dual frame in T*M, then | A--- A & isa 
volume form. In particular, Lie groups are orientable, see Chap. 5. 


Let {x'} denote the standard coordinates on R” and let there be chosen the vol- 
ume form dx! A ---A dx". Then, for every n-form a € 2” (IR”) there exists a unique 
function f : R” > R such that a = fdx! A--- A dx”. We define 


/ a= fd’x, (4.2.2) 


provided the integral on the right hand side exists. Here, d”x denotes the Lebesgue 
measure® on R”. 


3For our purposes, the Riemann integral would do as well. 
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Definition 4.2.6 (Integral) Let M be an oriented manifold of dimension n. Let 
{(U;, «;)} be a countable, locally finite and oriented atlas and let {f;} be a sub- 
ordinate partition of lek A differential n-form a@ € §2”(M) is called integrable 
if the integrals i ‘es yk; i (fia) exist and the family of their absolute values is 
summable. If a is integrable, we define 


= —1)* (fia). 4.2.3 
[ie ye )* (fier) (4.2.3) 


L 


One easily shows that the right hand side of Eq. (4.2.3) neither depends on the 
choice of the atlas nor on the choice of the partition of unity (Exercise 4.2.2). Obvi- 
ously, the integral (4.2.3) always exists if M is compact or if a has compact support. 
Sometimes, one wishes to integrate a differential k-form over a k-dimensional sub- 
manifold: 


Definition 4.2.7 (Integral over a submanifold) Let (NV, gy) be a k-dimensional ori- 
ented submanifold of M and let 6 € Q*(M). The integral of 6 over N is defined 


by 
i p= i o*B. (4.2.4) 
N N 


Next, we extend the famous Stokes Theorem of classical calculus to the case 
of manifolds. For that purpose, we need the notion of manifold with boundary. 
Consider the closed half-space R” := {x € R” : x; <0}, equipped with the rela- 
tive topology induced from R”. The boundary of R” is defined to be the closed 
subset JR” = {0} x R”~!. The boundary of an open subset U C R” is defined to 
be JU :=U dR". This is an open subset of dR” =R"~!. Note that, in general, 
aU does not coincide with the topological boundary of U as a subset of R”. 


Definition 4.2.8 Let U Cc R” be open and let x e U. A mapping 6: U > R” 
is smooth at x if there exists an open neighbourhood U of x in R” and a smooth 
mapping ® : U > R” such that Pung = Puna: A diffeomorphism between two 
open subsets of R” is a smooth bijective mapping whose inverse is also smooth. 


Remark 4.2.9 


1. If 6: U > V isa diffeomorphism of open subsets of R” , then (dU) = dV and 
the restriction ®gy : IU — AV is a diffeomorphism of open subsets of Re), 
For x € OU, the first derivative ®’(x) : R" — R" is of the form 


ad) 
aa O «. 0 
dD | AD2 a®2 AYO) 
2 aes 1 
@' (x)= | 9% | 9%2 Xn | ae (4.2.5) 
2]: : 1 
IPDy | IDy dDn 


Ox, | 0x2 “9 OXn 
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Hence, ®'(x) maps the subspace {0} x R”~! and the half-spaces Rt to them- 
selves, respectively. 

2. While a differentiable mapping © : U > R” has many distinct extensions @ : 
U — R” at a boundary point x € U, its derivative ©’ (x) at x is unique: since the 
derivative exists, it suffices to check uniqueness of the directional derivatives. 
The latter follows from the fact that for arbitrary h € R” either h or —h (or both) 
are in R” . 


Now, we carry over the notions of Sect. 1.1 to the present situation by replacing 
the linear space R” in Definitions 1.1.1—1.1.4 by the half-space R” . This way, we 
obtain the notions of 


— topological manifold with boundary, including its dimension, 

— local chart, compatibility of local charts and atlas for a topological manifold with 
boundary, 

— manifold with boundary and differentiable mapping, 

— orientation for a manifold with boundary. 


In addition, by replacing the term manifold by the term manifold with boundary in 
the respective definition, we obtain the notion of partition of unity for a manifold 
with boundary and the notion of vector bundle with boundary, including the notions 
of morphism and section. Next, we construct the tangent space at a point m of a 
manifold M with boundary in the same way as for ordinary manifolds, cf. Sect. 1.4, 
with the following changes. 


— Instead of curves through m and ordinary derivatives in (1.4.2), we take curves 
y : [0,1] — M starting at m and curves y : [—1,0] — M ending at m and use 
one-sided derivatives. 

— For x € R", we define the curve y* in (1.4.3) as a curve starting at x (that is, 
t € [0, 1]) if x € R” and as a curve ending at x (that is, t € [—1, 0]) otherwise. 


This way, the tangent spaces become linear spaces. The construction of the tangent 
bundle and the tangent mapping carries over without change, rendering vector bun- 
dles with boundary and morphisms thereof. The same is true for the various tensor 
bundles. Based on this, the calculus on manifolds carries over to manifolds with 
boundary in an obvious way, including the notions of vector field and flow, tensor 
field, transport and Lie derivative, differential form, pull-back, exterior derivative 
and integral, as well as immersion and submersion, submanifold and vector subbun- 
dle. 

Next, we define the interior and the boundary of a manifold with boundary. As 
a consequence of Remark 4.2.9/1, the transition mappings of local charts of a topo- 
logical manifold with boundary map points in JR” to points in JR” . That is, if for 
m € M there holds «(m) € dR” for some local chart « at m, then this holds for any 
local chart at m. 


Definition 4.2.10 (Boundary and interior) Let M be a manifold with boundary. 
A point m € M is called a boundary point if there exists a local chart (U,«) at m 
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with «(m) € OR”. Otherwise, m is called an interior or inner point. The subset of 
boundary points is denoted by 0M and is called the boundary of M. The subset of 
interior points is denoted by Int(M) and is called the interior of M. 


Example 4.2.11 The closed unit disk D” of IR” is a smooth manifold with bound- 
ary. The interior is the open unit ball and the boundary coincides with the smooth 
manifold S”~! (Exercise 4.2.4).4 


Example 4.2.12 An ordinary manifold is a manifold with boundary. To see this, 
replace the coordinate x; by —e*! in each local chart. For an ordinary manifold, 
0M = © and Int(M) = M. Hence any statement for manifolds with boundary holds, 
in particular, for ordinary manifolds. 


Proposition 4.2.13 Let M be a manifold with boundary and let n = dim M. 


1. Int(M) and 0M are embedded submanifolds of M of dimension n and n — 1, 
respectively. 

2. Int(M) and 0M are ordinary manifolds, that is, 0 Int(M) = @ and 0(0M) = ©. 

3. Every orientation on M induces orientations on Int(M) and on 0M. 


Proof For the first assertion, we construct atlases on Int(M) and 0M by restriction 
of appropriate local charts of an atlas for M (Exercise 4.2.6). Then, the second 
assertion is obvious. For Int(M), the third assertion is obvious as well. For 0M, this 
assertion follows from the fact that the derivative of the transition mapping of two 
induced charts is given by the lower right block of the restriction of the original 
transition mapping to boundary points, given in (4.2.5). 


Now we can extend Stokes’ Theorem of classical integral calculus to manifolds. 


Theorem 4.2.14 (Stokes) Let M be an n-dimensional compact and oriented mani- 
fold with boundary and let 0M be endowed with the induced orientation. Then, for 
every (n — 1)-form B on M, 


i; dp=| B. (4.2.6) 
M aM 


We note that all the integral theorems of classical vector analysis follow from 
Stokes’ Theorem, see Sect. 4.5. 


Proof We choose a finite atlas {(U;, «;)} and a subordinate partition of unity { /;}. 
By finiteness, it is enough to show that ta d( fib) = fang fi B for alli. Thus, we may 
assume that 8 has support in one chart domain U; and that, therefore, the partition 


4Beware that the interior and the boundary of a manifold with boundary are defined by the manifold 
alone, whereas the interior and the boundary of a subset of a topological space are defined with 
respect to the ambient space. By chance, in this example, these two notions coincide. 
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of unity {fj} can be chosen so that fx }suppcs) = 1 and fj psuppcp) = 0 for all i # k. 
Under these assumptions, the calculation of i M df reduces to 


[w= ye B= ‘ pl)’ 


We expand 
(Kc ')"B = do bide! A+. A dx! 1 A dx!th A. A dx” (4.2.7) 


with smooth? functions b; on kx (U;). Then, 
” ab 
d((«-!)*p) = Y\(—y! Sax! a... a dx” 
((k; ) B) D ) ax! x 


and hence by (4.2.2) 


i, dp = rev f Ok ai (4.2.8) 
M l=1 K, 


Ug) Ox! 


Since supp(b;) C «,(supp(B)), the functions b; have compact support in «x, (Ux). 
First, this implies that we can extend them by zero to R” . Second, this implies that 
there exists R > O such that supp(d;) is contained in the interior of the half-cube 
[—R, 0] x [—R, R]"~!, so that we can replace the range of integration on the right 
hand side of (4.2.8) by this half-cube. Integration over x! in the /-th term yields 


/ dp =f (b1(0, x, ...,x") —bi(—R, x7, ...,x"))dx?.. dx” 
M [—R,R]"-! 


+ cot [ Dilx! , .ccpxtt, Rox ns x 
d, eer ( ) 
_ bi(x', hag eRe ae ae”) de dx! dx tl dx” 


-| Dia axe eda ade 
[—R,R]"-! 


By extending the range of integration to R’~! we finally arrive at 


[ w= [ DOs secon" da sda (4.2.9) 
M Rr! 


5In the sense of Definition 4.2.8. 
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Next, we calculate y B- For that purpose, we need the natural inclusion mapping 
t: 0M — M and the mappings 


p: Rk? >R™, D(X], --+5Xn) = (X2,---, Xn); 


j:R™!'SR’, j(X2,...5Xn) = (0, 2, ... Xn). 
To evaluate the integral we use the induced atlas {(U; , k;)} on 0M, defined by U; i= 


U; 10M and k; ‘= p ok; 01, and the induced subordinate partition of unity { fi} of 
aM, defined by fj := fj ov. Since j okj =; 04, Formula (4.2.4) yields 


[s=f ee=f wene=f iter'y'e. 
aM aM kx (Ux) kx Uk) 


Plugging in the expansion (4.2.7) for (Kj, ')*B and using that, due to j*dx! =0, 
only the contribution of / = | survives, we obtain 


/ p= [ Ose czas de 3s eda" 
dM &(Ux) 


As before, b; can be extended by zero to R’~! and the range of integration can be 
replaced by R”~!. Then, comparison with (4.2.9) yields the assertion. 


In the remaining part of this section we discuss integral invariants. They play a 
role in symplectic geometry and in the theory of Hamiltonian systems, see Sect. 9.3. 
For an exhaustive presentation we refer to [181], see also [60]. 


Definition 4.2.15 Let X « X(M). A differential k-form a on M is called 


1. invariant with respect to X if Zyra = 0, 
2. relatively invariant with respect to X if Xda =0, 
3. absolutely invariant with respect to X if Xa =0 and Xda =0. 


Equivalently, w is invariant with respect to X iff the flow ® of X fulfils Oa =a 
for all t. Due to 
L xa = fLyat+df A (Xia), 
a is absolutely invariant with respect to X iff it is invariant with respect to fX for 
all f € C°(M). If a is relatively invariant with respect to X, then “ya is closed. 
Proposition 4.2.16 (Poincaré-Cartan) Let X be a complete vector field on M with 
flow ® and leta € Q*(M). Then, 


1. @ is invariant with respect to X iff 


i (®,09)*a= / pra (4.2.10) 
N N 
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for all oriented compact k-dimensional manifolds N with boundary, for all 
smooth mappings g: N > M and for allt € R, 
2. a@ is relatively invariant with respect to X iff 


/ (D,ogp0l)*a= / (got)*a (4.2.11) 
aN aN 


for all oriented compact (k + 1)-dimensional manifolds N with boundary dN, 
for all smooth mappings p: N — M and for allt € R. Here,.t: 0N — N denotes 
the natural inclusion mapping, 

3. a is absolutely invariant with respect to X iff (4.2.10) holds for the flows of f X 
forall f €C™°(M). 


Accordingly, invariant forms are often called integral invariants. 


Proof 1. If w is invariant, then a = q@ and the statement follows trivially. Con- 
versely, assume that (4.2.10) holds. Then, by assumption, for every embedding 
t: DK > N of the closed unit ball D‘ c R* we have 


i (®oponta= | (gy 01)*a, 
Dk Dé 


because Dé is compact. But the Lebesgue measurable sets A C N are generated by 
balls, that is, for every such set we obtain fa (Do 9)*a= Va y*a and, therefore, 
(®; 0 y)*a = y*a. If now N runs through all k-dimensional manifolds, we can 
conclude Oa = a. 

2. Let @ be relatively invariant. Then, “yda = dYya = 0, that is, daw is invari- 
ant. Using Stokes’ Theorem and point 1, we obtain 


/ @cpon'a= | @oy)'da= [ orda= [ Uogta. 
aN N N aN 


The proof of the converse direction is analogous to point |. 
3. This follows from the remark after Definition 4.2.15. 


Remark 4.2.17 Let a, B € 2*(M) be invariant under X € X(M). Since the opera- 
tions ix and d commute with x and since y is a derivation of the tensor algebra, 
ixa, da anda A £ are invariant with respect to X, too. Thus, the set of X-invariant 
differential forms on M forms a subalgebra of 2*(M) which is closed under iy 
and d. 


Besides the forms which are invariant with respect to a given vector field, it is 
also interesting to study the vector fields leaving invariant a given form. The kernel 
of an antisymmetric multilinear form 7 on a vector space V is defined as 


kern := {v € V: vin =O}. (4.2.12) 
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Definition 4.2.18 Let a ¢ 2*(M). 


1. The subspace F := kera,, M ker(da), of TM is called the characteristic sub- 
space of a at m. 

2. A local vector field X on M with domain U is called characteristic for a if 
Xm € FO for allm €U. 

3. The distribution spanned by the characteristic local vector fields for a is called 
the characteristic distribution of @ and is denoted by D®. 


While the dimension of F;" is locally non-increasing, it may suddenly decrease. 
Accordingly, the subset F* := U,,<y Fe of TM need not establish a distribution 
in the sense of Definition 3.5.1.° Rather, the characteristic distribution D® is the 
maximal distribution on M contained in F®. If dim F® is constant on M, then D® 
and F® coincide, and both are regular distributions. We also note that, by the above 


definition, a vector field X is characteristic for a iff 
Xia =0, X ida = 0, (4.2.13) 
that is, iff @ is absolutely invariant with respect to X. 


Example 4.2.19 Let M = R? with standard coordinates x, y. For a = dx, all char- 
acteristic subspaces F” . coincide with the y-axis. Therefore, F* = D®% and both 


(x,y) 
are regular distributions of rank 1. For a = xdx, Ee is given by the y-axis if 


x,y) 
x #0 and by R? otherwise. In contrast, since the characteristic vector fields are 


continuous, Di y) coincides with the y-axis everywhere. 


Proposition 4.2.20 For every a € Q*(M), the characteristic distribution D® is in- 
tegrable. 


The corresponding foliation of M by the maximal integral manifolds of D® is 
called the characteristic foliation of a. 


Proof We show that D® is homogeneous and apply Theorem 3.5.10. Let 
XE L eeel (M). Since X is characteristic for w, a is invariant with respect to X. 
Hence, 


OX*y =a. (4.2.14) 


Now, let (t,m) € Y*. For Y,, € D%, choose a characteristic vector field Y taking 


m? 
this value at m. Due to (4.2.14), OXY is characteristic for wa. Hence, 


e D® 


(O*)) Ym = (®XY) bX (m)’ 


bX (m) 


Nevertheless, in [181], F% is referred to as the characteristic distribution of a, which is consistent 
with the notion of distribution used there. 
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It follows that (px yj}, DPC DS X(m)" The same argument yields 
t 


a 


xX)! a 
(9x1) 6X my POX on) CDi 


and hence equality. Thus, D® is homogeneous and hence integrable. 


Remark 4.2.21 If dim F* is constant on M and hence D® = F is regular, it is 


enough to prove involutivity: 


[X, Yjaa = (LxYY)sa = Ly (Y sa) — YiLya =0, 
[X, Yada = Lyx yjya=LyLya —- Ly Lya=0 


for arbitrary characteristic vector fields X, Y of a. 


Exercises 

4.2.1 Show the following. 
(a) The M6bius strip is not orientable. 
(b) The real projective space R P” is orientable iff n is odd. 
(c) Level sets of smooth mappings f: R” — R” at regular values are ori- 

entable. 

4.2.2 Show that the right hand side of Eq. (4.2.3) neither depends on the choice of 
the atlas nor on the choice of the partition of unity. 

4.2.3 Prove the statements of Remark 4.2.9/1. 

4.2.4 Prove the statements in Example 4.2.11. 

4.2.5 Show that the closed unit disk D” = {x € R”: ||x|| < 1} is a smooth manifold 
with boundary and find the interior and the boundary. 

4.2.6 Prove Proposition 4.2.13/1. 


4.3 De Rham Cohomology 


In this section we show that the Cartan algebra of differential forms on a manifold 
contains important information about the topology of this manifold. 


Definition 4.3.1 Let M be a manifold. 


1. A k-form a is called closed if daw = 0. The set of closed k-forms is denoted by 
Z*(M). 

2. A k-form q@ is called exact if there exists a (k — 1)-form f such that a = df. In 
this case, 6 is called a potential for aw. The set of exact k-forms is denoted by 
BE(M). 


Both Z‘(M) and B*‘(M) are vector spaces. Since d* = 0, we have B‘(M) c 
Zz (M), that is, every exact form is closed. 
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Definition 4.3.2 Let M be a manifold. 


ile 


2: 


The additive Abelian group underlying the vector space H*(M) := 
Z*(M)/B*(M) is called the k-th de Rham cohomology group of M. 


The numbers dim H'(M) are called the Betti numbers of M and the sum 
n 
x(M) := Yop! dim H'(M) (4.3.1) 
i=0 


is called the Euler characteristic of M. 


Remark 4.3.3 


1. 
2. 


For k > dim M we have Z‘(M) = 0 and thus H*(M) = 0. 
Consider the direct sums 


Z*(M):= QD Z*(M), B*(M) := QD B‘(M). 
k=0 k=0 
For a € Z*(M) and B € Z’(M) we have 
d(a A B) = (da) A B + (—1)*a a (dB) = 0, 


that is, a A B € Z**"(M). Thus, Z*(M) is a subalgebra of Q*(M). For a € 
B*(M) with potential & and for B € Z”(M) we get 


d(& A B) = (da) A B=a A B, 


that is, ~@\ B € B‘+"(M). The same is true for w € Z*(M) and B € B’(M). Thus, 
B*(M) is a two-sided ideal in Z*(M). We conclude that the exterior product 
induces a multiplication in H*(M) := Z*(M)/B*(M), 


[w] UB] :=[a  B], 


called the cup product. This way, H*(M) becomes an associative, graded com- 
mutative algebra with grading 


H*(M) = @ H*(M). 
k=0 


H*(M) is called the cohomology algebra of M and the underlying ring is called 
the cohomology ring of M. 


. Let g: M; — Mp2 be asmooth mapping. Since the pull-back by g commutes with 


the operation of taking the exterior differential, it maps closed forms to closed 
forms and exact forms to exact forms. Since it is linear, it induces a homomor- 
phism of the cohomology groups, denoted by the same symbol: 


y*: H*(M) > H*(M)), y* ([a]) = [o*(@)]. (4.3.2) 
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Equation (4.1.15) implies 


that is, y* is a homomorphism of algebras. Moreover, 


(x 0g)*=g*% 0x", idyy = idy*(m)- (4.3.3) 


Example 4.3.4 


1. 


We compute H°(M). From df = 0 we conclude that the function f must be 
constant on each connected component of M. Thus, f is given by g real numbers, 
where q denotes the number of connected components, and we obtain Z9(M) = 
IR@. Since exact 0-forms do not exist, we have B°(M) = 0 and thus 


H°(M)=R‘. 


. Let M =R. By point 1, we have H°(R) =R. Let us calculate H!(R). Let a € 


@'(R). Then, for dimensional reasons a is closed. Moreover, it is also exact. 
Indeed, writing w(x) = a(x)dx and defining the function f(x) := i a(y)dy, we 
get 


df 
df (x) = ag =a(x)dx =a(x). 


Thus, B!(R) = Z!(R) = 2!(R) and we conclude H!(R) = 0. 


. Let M =S!. By point 1, we have H°(S!) = R. We show that H'(S!) = R. In 


the standard parameterization of S! by x € R, a 1-form @ on S! is given by 
a = a(x)dx, with a(x) = a(x + 277). Since dimS! = 1, a is closed. It is exact 
iff there exists a function f, fulfilling f(x + 27n) = f(x), such thata =df, 
that is, a(x) = se (x). Then, using x € (0, 277) as a coordinate, we obtain 


20 20 df 
[ a= | acoder = | —(x)dx = f(z) — f(0O) =0. 
s! 0 0 dx 
This means that two forms a; and a2 belong to the same cohomology class iff 


fa={ a2. 
s! s! 


Thus, H'(S!) is in one-to-one correspondence with the values of such integrals. 
If we put w = (277) ~'rdx, r ER, then we get Jgi.@ =r. Thus, H\(S)=R. 


. For the Euler characteristics we obtain x (IR) = 1 and x (S!) =0. This shows, in 


particular, that R and S! are topologically distinct, cf. Corollary 4.3.10. 


. For the one-point space {*«} we get 


R k=0 
FAG) E ae 
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Definition 4.3.5 (Smooth homotopy) Let M and N be manifolds. 


1. A smooth homotopy of a smooth mapping g: M — N is a smooth mapping 
F: M x [0,1] > N such that F(-,0) = g. A smooth homotopy of two smooth 
mapping go, ¢1: M — N is a smooth mapping F: M x [0,1] > N such that 
F(¢,i) = @, i = 1,2. If a smooth homotopy exists, gp and g are said to be 
smoothly homotopic. 

2. M and N are called smoothly homotopy-equivalent if there exist smooth map- 
pings gy: M — N and x: N > M such that the compositions x og: M—> M 
and go x: N —> N are smoothly homotopic to the respective identical mapping. 

3. M is called contractible if for some mo € M the constant mapping M 35 mt 
mo € M and the identical mapping idjy are smoothly homotopic. 


Remark 4.3.6 


1. There is an analogous notion of homotopy in the category of topological spaces 
and continuous mappings. In the case of smooth manifolds and smooth map- 
pings, the two notions of homotopy are related as follows. If two smooth map- 
pings are continuously homotopic, they are smoothly homotopic. Every contin- 
uous mapping between manifolds is continuously homotopic to a smooth map- 
ping, see [130]. After having clarified this, in the sequel we may drop the adjec- 
tive smooth and just speak of homotopies. 

2. To be homotopic is an equivalence relation in the set of smooth mappings from NV 
to M. The set of homotopy classes of mappings S' + M, such that some chosen 
point so € S! is mapped to some chosen point x9 € M, is called the i-th homo- 
topy group’ of M and is denoted by 1; (M, xo). If M is connected, 1;(M, xo) is 
naturally isomorphic to 2;(M, x;) for all x9,x; € M. Hence, in this case, it is 
common to suppress the base point xo in the notation and to simply write 7;(M). 

3. To be homotopy-equivalent is an equivalence relation in the totality of smooth 
manifolds. The equivalence class of a manifold M is called the homotopy type 
of M. This terminology extends in an obvious way to topological spaces. 

4. Let M and N be compact oriented manifolds with volume forms vy and vy, 
respectively, and let g: M — N be a smooth mapping. The mapping degree 
deg(¢) of g is defined by 


deg) | vu =f g*VN. 
M M 


One can show that deg(¢) is an integer, see for example [76]. There, the reader 
can find an exhaustive discussion of this important topological concept. In the 
special case M = N = S!, the mapping degree can be interpreted as the winding 
number. It defines an isomorphism from 771 (S!) to Z (Exercise 4.3.4). 


7For i = 1, the group multiplication is induced from an appropriate composition of closed paths. 
For i > 1 there is an analogous construction, see [76] for a detailed presentation. In contrast, 
mtq(M, xo), which is in bijective correspondence to the set of connected components of M, in 
general does not carry a group structure. 
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5. A manifold is contractible iff it is homotopy-equivalent to the one-point space 
{*}: for mo € M, let Qm) : M — M denote the constant mapping m +> mo. Define 


g: M = {x}, g(m) :=*, x: {x} > M, x(*) = mo. 


By construction, we have y o x = id;,; and x 0 @ = Qm,. Hence, x o g is homo- 
topic to idy iff so is Po. 


Now, let M be a manifold and let 7 = [0,1]. Every 2 € 2*(M x 1) can be 


uniquely decomposed into $2 = a, +a@2 Adt, where aj(-, t) € Q*(M) and a2(-, t) € 
2*-1!(M) for every t € J. Consider the family of mappings 


1 
Dy: 2*(M x I) > 2M, (Dp2)(m) := ye | a2(m, t)dt, 
0 


(4.3.4) 
where the integral is to be understood as an integral of a function on the interval J 
with values in the vector space is T.M. 


Lemma 4.3.7 We have 
(D2) + Dey (d2) = Qy=1 — Ly1=0. (4.3.5) 
Proof Letm € M and let (U, x) be a local chart at m. Let 
ao(m,t)=ay(m,t)de’, ay (m, t) = by (m, t)d«! 


(summation convention) be the corresponding local formulae for the components of 
92. Then, 


1 
(Dy 22)(m) = cof ay(m, na Jae! 
0 
and thus 
1 
(d(Dx2)) (m) = cof sau(m, tara? A dk! 
On the other hand, 


(Dx+1(d@)) (m) 
= (Dg+1(dary))(m) + (Dizi (deez A dt))(m) 
= (Deyi (dk byde4 A de! + dbs dt A de’)) (mn) 


+ (Dypi(dardk? A de! A dt))(m) 


1 
= nk f babs, at Jae! 
0 
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1 
a cot f aay (m, Dar) de? A dk! 
0 


= (by(m, 1) — by (m, 0))dk! — (d(Dg.2)) (m) 
= Qy=1(m) — Qy=0(m) — (d(Dg2))(m). 


This completes the proof. 


Proposition 4.3.8 Homotopic mappings induce the same homomorphism of the de 
Rham cohomology groups. 


Proof Let ¢o,¢1: M — N be smooth mappings and let F: M x I > N bea ho- 
motopy fulfilling F(m,0) = go(m) and F(m, 1) = gi (m). Let a € Q*(N). If we 
set 2 = F*a in (4.3.5), we get 


dDy F*a + DyyidF*a = gfa — goa. 


If w is a representative of a cohomology class in H*(N), then dD; F*a = gra — 
gpa. Thus, g}a and ype differ by an exact form, that is, they define the same coho- 
mology class. 


Remark 4.3.9 Given smooth mappings go, 1 : M > N,a family of mappings Hy : 
Q*(N) > Q*-!(M) fulfilling 


Ay, 0d+do Kh =gf — 


is called a homotopy operator for gp and ¢). If F is a homotopy of g and ¢ , then 
Ay = Dx o F* is a homotopy operator for go and 9}. 


Corollary 4.3.10 Homotopy-equivalent manifolds possess isomorphic de Rham co- 
homology groups. 


Proof Let M and N be homotopy-equivalent and let g¢: M — N and x: N> M 
be smooth mappings yielding this equivalence. We consider the induced homomor- 
phisms y*: H*(N) > H*(M) and x*: H*(M) > H¥(N). Since x og ~ idy and 
gox ~ idy, respectively, Proposition 4.3.8 and (4.3.3) imply 


gp ox*=(xXog)*= idyy, = idyk (yy): 


and, in the same way, x* op* = idjx(y,)- Thus, g* and x* are isomorphisms, which 
are inverse to each other. 


Corollary 4.3.11 (Lemma of Poincaré) If M is contractible, then 


R |k=0 


H*(M) = 
a 0 |k>0. 
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Proof By Remark 4.3.6/5, M is homotopy-equivalent to the one-point space. Thus, 
Corollary 4.3.10 implies H«(M) = H* ({x}). 


Remark 4.3.12 


1. The Lemma of Poincaré tells us, in particular, that on a contractible manifold 
every closed form is exact. 

2. The operator Dx, defined by (4.3.4), yields an explicit construction for potentials 
of a closed form on a contractible manifold: let F: M x I > M be a homo- 
topy of M to mo, that is, F(m,0) =m and F(m, 1) = mp. By (4.3.5), we have 
dD F*a + DgyidF*a = —a for all wa € 2*(M). If da = 0, we read off the 
following potential for a: 


B= —D,F*a. (4.3.6) 
Example 4.3.13 Let a € 27 (IR+) be given by 
a=xydx Ady + 2xdy A dz+ 2ydx A dz, 


in standard coordinates x, y and z on R3. Obviously, da = 0. To construct a potential 
we choose a homotopy to the origin: 


F(@,9,2),0) =(@,9,10), +e (0,11. 
Then, 
F*a =(xy77 + 2yzt*)dx A dt + (2x20? - x*yt?)dy A dt — Axyt*dz A dt 
+ xyttdx A dy + 2xtrdy A dz+ 2yt3dx A dz. 
Thus, we obtain the potential 


B= —DoF*a 


1 1 
=(-1)7"! It (xy7#? + 2yzt?)d da + (/ (2x21? = yr )ay 
0 0 


= (faa) 


Sede elds oe ed 
=—( aay + yz }dx gt ¥— 3Xe dy t sxyde. 
The following generalization of the Poincaré Lemma will be useful later on. The 
proof uses the Tubular Neighbourhood Theorem for embedded submanifolds, cf. 
Remark 6.4.7, and the fact that the normal bundle of a submanifold N is homotopy 


equivalent to N. We leave the latter as an advanced exercise to the reader (Exer- 
cise 4.3.5). 


192 4 Differential Forms 


Proposition 4.3.14 (Generalized Poincaré Lemma) Let (N,@) be an embedded 
submanifold of M and let a be a closed k-form fulfilling p*a = 0. Then, there exists 
a (k — 1)-form B on a neighbourhood of N in M fulfilling 

a=d6, Brn =0. 
If, in addition, a vanishes® on N, then B can be chosen such that, in any local chart, 


the first derivatives of the coefficients of B vanish on N. 


Algebraic topology provides a lot of tools for calculating the de Rham cohomol- 
ogy groups of a manifold, see [52], [96] and also [55]. Here, we only make some 
elementary remarks. First, the cohomology groups of a direct product M x N can 
be computed by means of the Ktinneth Formula: 


H"(M x N)= QB H?(M)® H4(N). (4.3.7) 
p+q=r 
Thus, 
H*(M x N) = H*(M)® H*(N), (4.3.8) 
which is to be understood as a tensor product of associative algebras. 


Example 4.3.15 Consider the 2-torus T* = S! x S!. We calculate its de Rham co- 
homology groups: 


H°(T’) = H°(S') @ H°(S')=R@®R=R 
H'(T’) =(H'(S') @ HY(S')) ® (H"(S') @ H'(S')) 
=(R@R)GROR=RK 
H?(T’) = H'(S') @ H'(S')=R®RE=R. 
As a second tool, one has the following integral criterion for exact forms. 


Proposition 4.3.16 A closed differential k-form « on a manifold M is exact iff 
te a = 0, for every compact k-dimensional submanifold N of M. 


We give a sketch of the proof. Assume that a € 2*(M) is exact, w = dB. Then, 
using dN = © and Stokes’ Theorem, we get Jue = fon? = 0. The converse 
follows from the de Rham Theorem, which states that there exists a natural iso- 
morphism between the de Rham cohomology algebra and the real-valued differen- 
tiable singular cohomology algebra.’ The latter is dual to the differentiable singu- 
lar homology. We provide the reader with a rough idea of these notions. Singular 


8If w vanishes on N, then gy*a vanishes, too. Note that the converse is, of course, not true in 
general. 


°For a proof see [302], Sects. 4.17, 5.36 and 5.45. 
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homology theory is built from formal real linear combinations (called k-chains) 
o = >); Ai0; of differentiable mappings (called differentiable singular simplices) 
Oj: Ak + M, where 


Af = {xe Rix) >0,x'+---4x% <1} 


denotes the standard k-dimensional simplex. There is a natural boundary operator 
defined by 


k 
80; = DD! oi pat 
j=0 


where At Oey AF denote the faces of Ak, defined by x* =0, and Ar denotes the 
remaining face. This operator is extended to arbitrary chains by linearity. One easily 
shows that 0 o d = 0. Now, the k-th differential singular homology group of M with 
real coefficients is defined by ker 0, /im 0x41. Elements of ker 0g and im 0x41 are 
called differentiable k-cycles and differentiable k-boundaries, respectively. The cor- 
responding differentiable singular cohomology is then built from linear functionals 
(called singular cochains), which assign to each singular simplex a number. Now, 
the link to de Rham cohomology is provided by defining how to evaluate a closed 
k-form @ ona k-cycle o = 90; Ajo;: 


(a,0) = a L O; a. (4.3.9) 


This induces a linear mapping from the k-th de Rham cohomology into the real 
differentiable singular cohomology, which turns out to be an isomorphism. It can 
be easily seen that the integrals over all k-dimensional compact submanifolds of 
a closed k-form @ vanish iff @ vanishes on every k-cycle. Then, injectivity of the 
isomorphism defined by (4.3.9) implies that a closed form vanishing on all k-cycles 
must be exact. 


Example 4.3.17 As an application of Proposition 4.3.16, let us calculate the de 
Rham cohomology groups of the spheres S$”, n > 1: 


R |k= 
H*(s") = ee (4.3.10) 
0 | otherwise. 


The case k = 0 was dealt with in Example 4.3.4/1. LetO <k <nandleta e€ OFS) 
be closed. Let N C S” be an arbitrary compact k-dimensional submanifold of S”. 
Then, there exists a point x € S” such that N is contained in the open submanifold 
St := 8S” \ {x} CS”. Since Sf is contractible, according to the Poincaré Lemma, 
asx is exact. Applying Proposition 4.3.16 to M = SX we get ya = 0. Since N 
was arbitrary, the same proposition, applied to M = S”, yields that a is exact on S”. 
We conclude H*(S”) = 0 for 0 < k <n. Consider the case k =n. By the theorem on 
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invariance of domain, see Footnote 38 on page 159, the only compact n-dimensional 
submanifold of S” is S” itself. Hence, Proposition 4.3.16 entails that H”(S”) is in 
one-to-one correspondence with values of the integrals Tes a. Since S” is orientable, 
it admits a volume form v. By multiplying v by a real number, one can obtain any 
real value for the above integral. Thus, H”(S”) = R. It also follows that x (S”) = 0 
if n is odd and x (S”) = 2 if n is even. 


Exercises 

4.3.1 Check that the 1-form 8 constructed in Example 4.3.13 is a potential for the 
2-form a, indeed. 

4.3.2 Calculate the de Rham cohomology groups for the annulus 


1 
A= {xeR*: l< (x? + y?)? <2}. 
4.3.3 Consider the following 1-form on R? \ {0}: 


xdy — ydx 
a= ——__. 
x? + y? 


(a) Prove that aw is closed. Calculate the integral of the restriction of a to the 
unit circle S! C R? \ {0} and explain why a is not exact. 

(b) Show that the curves y, : [0,1] ~ R2 \ {0}, m@) = (cos(27n?t), 
sin(2znt)) are not homotopic for different values of n € Z. 
Hint. Calculate the integrals of a over yy. 

(c) Let 6 be another closed 1-form on R? \ {0}. Show that there exists a num- 
ber c and a function f : R* \ {0} > R such that 6 = ca + df. Conclude 
that @ generates the first de Rnam cohomology of R? \ {0}. 

4.3.4 Prove that the mapping degree defines an isomorphism from 77; (S!) to Z, cf. 
Remark 4.3.6/4. 

4.3.5 Show that every vector bundle is homotopy equivalent to its base man- 
ifold. Use this and the Tubular Neighbourhood Theorem for embedded 
submanifolds, stated in Remark 6.4.7, to prove the Generalized Poincaré 
Lemma 4.3.14. 


4.4 Riemannian Manifolds 


In this section, we discuss some elements of Riemannian geometry needed in the 
sequel. A systematic treatment will be contained in Part II of this book. 


Definition 4.4.1 (Riemannian manifold) Let M be an n-dimensional manifold. 
A metric on M is a covariant tensor field g € I” (T3(M )) such that for every m € M 
the bilinear form 


Em: TmM x TnM > R 
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is symmetric and non-degenerate. If g,, is positive-definite for all m € M, then g is 
called a Riemannian metric and the pair (M, g) is called a Riemannian manifold. 
Otherwise, g is called a pseudo-Riemannian metric and the pair (M, g) is called a 
pseudo-Riemannian manifold. 


Let m € M. It is well-known from linear algebra that there exists an orthonormal 
basis {e;} in T,, M, that is, 


+1 0 
Em (Ei, ej) = ( 0. a . 


We denote 7; ; (7) = gm (ei, e;). According to the Theorem of Sylvester, the integers 
r and s do not depend on the choice of the basis. Using Proposition 2.3.15 and 
an inductive orthonormalization procedure one can show that {e;} can be extended 
to a local orthonormal frame! at mo (Exercise 4.4.1). Thus, g(é;,e;) =nij ina 
neighbourhood of mo, and the numbers r and s are constant on each connected 
component of M. The pair (r,s) is called the signature!! and s is called the index 
of g. 


Proposition 4.4.2 Every manifold admits a Riemannian metric. 


Proof Choose a countable atlas {(U;, «;) : i € J} on M and a subordinate partition 
of unity { f;}. Via «;, the standard scalar product on R” induces a Riemannian metric 
gi on Uj. For every 1, the tensor field f;g; extends to a smooth tensor field g; on 
M with supp(g;) = supp(f;). Since the covering {supp(f;)} is locally finite, we can 
define g := )-, gj and this is a smooth covariant tensor field of second order on M. 
Since on the interiors of their supports, the g; are Riemannian metrics, and since the 
sum of Riemannian metrics is a Riemannian metric, g is a Riemannian metric. 


Since g» is a non-degenerate bilinear form, it can be viewed as an isomorphism 
of vector spaces: 


Em: TmM > T;,M, Xmt> &m(Xm,-). (4.4.1) 
The isomorphisms g,, combine to a vector bundle isomorphism g: TM —> T*M, 


which induces isomorphisms 


k k 
g: /\tM > A\T*M, g(X1A---A Xp) =B(XI) A+ A g(Xy). (4.4.2) 


The metric g induces natural symmetric non-degenerate bilinear forms g,, and a 
on \‘T,,M and /\* T*,M, respectively: 


Bm (X1 A+++ A XK, Yi A+++ A Yq) = det(gm(Xi, Yj)), (4.4.3) 


‘ODenoted by the same letter. 


‘Some authors prefer to call t =r — s the signature. 
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é,, (2) =tale, 8, ©). (4.4.4) 


Remark 4.4.3 Let us write down the isomorphisms (4.4.1) and (4.4.2) in a local 
chart (U, x). We denote 


gij = 8(9F,9%), gi = go! (de', de). 


Then, 
g=eijde' @dei, gg t=gilat @ak, (4.4.5) 
with @ denoting the symmetric tensor product. Since, as mappings, go g~! = id, 
we have 
gigi = ole. (4.4.6) 


In these notations, the isomorphism (4.4.1) and its inverse take the form 
g(X'd") = X'gijdk/, g | (aidk') = aig"! d*. (4.4.7) 


In the physics literature, one usually writes X; = x gij and a/ = ag’! and one 
says that by the help of the metric one raises or lowers indices. Similarly, we denote 


g1y :=g(97, 95), gl! :=g ' (dk! , dx’). 
We have 
g1j(m) = det(gi, ;,(m)), gre * = 65h, 
and we can use g,, and g!/ to raise and lower indices: 
X;=g1)X’, ow! = gay. 
Then, 


ge '(@)=ala%, ¢ '@,p) =a Ber. (4.4.8) 


If we assume M to be oriented, there exists a distinguished volume form: choose 
an oriented orthonormal basis {e;} in T,, M@ and define 


(Ve) (X1, 20. Xn) = det(gm (Xi, ej), Xi; €TnM. (4.4.9) 


This definition is independent of the choice of the basis, because for another ori- 
ented orthonormal basis {e} one has e; = A;/ e; with det A = +1. Since {e;} can be 
extended to an orthonormal local frame in TM, (vg)m depends smoothly on m. 


Definition 4.4.4 The differential n-form vg defined by (4.4.9) is called the canoni- 
cal volume form of (M, g). 
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Let {e;} be an oriented orthonormal local frame in TM and let {0/} be the dual 
frame in T* M. Then, 


g=nijvi @ 0 (4.4.10) 
and 
gle=nijo!, g '(0')=ne;. (4.4.11) 
Analogously, 
g(en=niy0’, = g (8!) =n"e,. (4.4.12) 
Moreover, 


vg(e1,..-,€n) =det(nij) =(-1)*, g | (vg) =e1 A+++ A €n. (4.4.13) 
Thus, denoting J, = {1,...,}, we obtain 
vg = (-1)59". (4.4.14) 


Remark 4.4.5 We determine the representative of vg in a local chart (U, «). There 
exists a positive definite function f such that 


Ve = fda A---Adk" = fda, (4.4.15) 
Writing e; = e;/ 0% and 3! = o! jdk/ and using (4.4.14), we obtain 
Vg = (1) 1; «+; dicll A+» Adult = (—1)§ det dda. 


Hence, f(m) = (—1)° - det(#(m)) > 0. Moreover, from e;9/, = 6;/ and 
Bimei/e;” = ni; we conclude 


dete- det? = 1, detg - (det e)* =(-1)*, (4.4.16) 
and thus 


det ¥ = —— =+,/|detg]. (4.4.17) 


In order to have f > 0, we must choose det 7 = (—1)*./| det g]. Then, 


1 . , 
Ve = J |detglde’ A---Adk” = — /|det glei,..i,de'! A+ A dk’, 
nN: 


with ¢;,..;, denoting the completely antisymmetric tensor of rank n. In the physics 
. . —_— L 5 
literature, one often uses the Levi-Civita tensor a i, = /| det gle;,..i,. Then, the 


volume form reads 


el” de Aes Adee, (4.4.18) 
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Exercises 

4.4.1 Let (M, g) be an oriented pseudo-Riemannian manifold and let m « M. Show 
that every oriented orthonormal basis in T,,,M can be extended to an oriented 
orthonormal local frame in TM at m. 


4.5 Hodge Duality 


In this section, we discuss the Hodge star operator and the operator dual to the 
exterior derivative in the sense of Hodge. Finally, we build the bridge to classical 
vector analysis. Throughout this section, let (M, g) be an n-dimensional oriented 
pseudo-Riemannian manifold with signature (7, s). 

The isomorphism (4.4.2) and the canonical volume form vg induce the following 
natural linear mapping. 


Definition 4.5.1 (Hodge star operator) The linear mapping 
*: Q*(M) > 2"-*(M), #a = (—1)°g7!(a) vg, (4.5.1) 


is called the Hodge star operator of (M, g). The (n — k)-form *q@ is called the Hodge 
dual of a. 


Obviously, 
*1 = (—1)*vg. (4.5.2) 
Moreover, from (4.4.13) we read off 


*Vg = 1. (4.5.3) 


For J C In, let 1° denote the complement. Formula (4.1.5) implies 


c I c 
oma co! =sion( ” |e". 
eT PII g Gs 


Using this, together with (4.4.12) and (4.4.14), we obtain 
vf c 
#0! = (-1)'g7!(8/)a(- 10" =n! e710" = sign ) nto. (4.5.4) 


Let (U, x) be an oriented local chart. Then, 
xa = (—1)°g7! (a) vg 
= (-1)' (a! df) 1/| det gid” 
1, é 
= (—1)° /|detgla’ sign (i)! 


= (—1)'a! ebCdk”. (4.5.5) 
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Remark 4.5.2 One can show that the Hodge dual «a of a € 2*(M) is the unique 
(n — k)-form such that 


g | (xa, B)vg =a A B (4.5.6) 


for all BE 92"-*(M) (Exercise 4.5.1). Hence, this formula can be taken as an alter- 
native definition for the Hodge star operator. 


Proposition 4.5.3 For a, B € 2*(M), 


sags (—D Ye, (4.5.7) 
g | (xa, *B) =(—1)*g"'(a, B), (4.5.8) 
a A*B =(—1)'g '(a, B)vg. (4.5.9) 


Proof Since the *-operator is linear, it is enough to show (4.5.7) for a = 0. The 
latter follows from (4.5.4) (Exercise 4.5.2). Then, using (4.5.6) and (4.5.7), we cal- 
culate 
g(a, *B)Vg =a A xB 
= (16 (*B) Aa 
= (1) gl eB, avg 


= (-1)°g"' (a, B)vg. 


This identity entails both Formulae (4.5.8) and (4.5.9). 


A k-form a on M is said to be square-integrable if the n-form a A «a is inte- 
grable. On the subspace of square-integrable k-forms we define an inner product 
by 


(a, B) =) aA xB. (4.5.10) 
M 


This inner product is positive definite, and hence a scalar product, iff (M, g) is 
Riemannian, that is, iff s = 0. It satisfies (Exercise 4.5.3) 


(+o, *B) = (—1)*(@, B). (4.5.11) 
Remark 4.5.4 For an oriented local chart (U, «), Formulae (4.4.7) and (4.5.9) imply 


a A *B = (—1)°,/|detgla! BY g; de. (4.5.12) 


200 4 Differential Forms 


Definition 4.5.5 Let (M, g) be an oriented pseudo-Riemannian manifold. 


1. The operator d* : Q*(M) — 2*—1(M) defined by 
(d*w, 6) = (w, dp) 


for all BE 2*-!(M) is called the Hodge dual of the exterior derivative d. 
2. The operator U: Q*(M) — @*(M) defined by 


:= dd* + d*d 


is called the Hodge-Laplace operator of (M, g). 


Proposition 4.5.6 For a € 2*(M), 


d®a = (—1)"—D 4541 x dea, (4.5.13) 


Proof Since M has no boundary, Stokes’ Theorem implies 
(B, d*e) =i ap rsa= BA (—1)kdxa 
M M 


= / BAx{(—D DTT x dar}. 
M 


Remark 4.5.7 The Hodge dual d* satisfies (d*)* = 0, but in contrast to the exterior 
derivative, it is not an anti-derivation on Q*(M). 


Example 4.5.8 (Classical vector analysis) Let (M,g) be R” with the Euclidean 
metric. With respect to the standard orientation, the volume form is given by 
Ve = dx! A--- A dx”. The natural isomorphism 


X(M) > 2'(M), XH goXx 


relates classical vector analysis to the theory of differential forms. The classical 
differential operators of gradient and divergence may be defined in a coordinate- 
free manner as follows. 


1. For f €¢ C“(M), the gradient of f is the vector field 
grad f =g-!o(df). (4.5.14) 
2. For X € X(M), the divergence of X is the function 
div X = *d(*(g 0 X)) =—d*(go X). (4.5.15) 


In case n = 3, there is a third classical differential operator on vector fields, the 
curl: 
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3. For X € X(M), the curl of X is the vector field 


curl X = g! 0 (xd(go X)) =d*(Xuvg). (4.5.16) 
Formulae (4.5.14)-(4.5.16) make sense on any oriented (pseudo-)Riemannian 
manifold (M, g).!? Thus, they can be taken to define the operators of gradient, di- 


vergence and, in three dimensions, of curl on M. In the case of three dimensions, 
these operators fit into the following commutative diagram: 


0 —+ C™®(M) ig 2'(M) ae 2?(M) ee 23(M) ——~> 0 


Mae oe ee 


rad 
0 — > C™(M) coe %(M) ———> X(M) — + C™~(M) —— 0. 


This means that grad, curl and div establish a complex parallel to the de Rham 
complex. In particular, curl o grad = 0 and div ocurl = 0. 


Example 4.5.9 (Hodge-Laplace operator on Minkowski space) Let (M,n) be 
Minkowski space, that is, M = R¢ with inner product 


+1 0 
i 0 —13)° 


(dd* + d*d) f =d*df = (-D*0-D 734! x dadf, 


For f € C°(M), we find 


that is, 


f=xdxdf. (4.5.17) 


In the standard coordinates ct, x1, x2, x3 on M, 


1 a2 2 2 2 
pee oe 


eee Se hp 
c* or? ax? axs axe 


That is, up to the sign,'* CJ coincides with the wave operator. 


Exercises 

4.5.1 Prove Remark 4.5.2. 

4.5.2 Prove Formula (4.5.7) by showing that it holds for a = /. 
4.5.3, Prove Formula (4.5.11). 


2A sign (—1)* has to be inserted on the right hand sides of Formulae (4.5.14) and (4.5.15). 
'3The proper sign would be obtained for the signature (— + + +). 
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4.5.4 


4.5.6 


4.5.7 


4 Differential Forms 


Consider M = R” endowed with the Euclidean metric and let X = X‘0; € 

X(R"). Prove the following formulae. 

(a) goX=X!du! +..-4 X"dx", 

(b) grad f = 0) fd) +--+ + On f dn, 

(c) div X = 0;X', 

(d) curl X = (0X? — 83X7)d; + (03X! — 0X7). + (8X? — aX!)as, 
(n=3). 

Consider M = R”, endowed with the Euclidean metric. Show that application 

of U to a function f € C™(R") yields the Laplace operator, that is, 


i= aa a(x! Ges 


Moreover, show that A f = divo grad f. 

Let U be a contractible open subset of R*. Prove the following. 

(a) If curl X =0 on U, there exists f € C(U) such that X = grad f. 

(b) If div X =0 on U, there exists Y € ¥(R#) such that X = curl Y. 

Derive the integral theorems of classical vector analysis from Stokes’ Theo- 

rem for differential forms. 

(a) Theorem of Gauf: let X € X(R”). For any n-dimensional manifold M Cc 
R” with boundary, 


[ civ = | (X-n)vam. 
M aM 


Here, vy denotes the volume form on M, vay the induced volume form 
on 0M and n is the (normalized) normal vector field on the boundary, 
pointing outwards. 

(b) Classical Theorem of Stokes: let X € X(IR*). For any 2-dimensional ori- 
ented surface, 


i" ((curl X) -n)viy =| (X- vam. 
M aM 


Here, n denotes the (normalized) normal vector field, compatible with the 
orientation of the surface, and t denotes the (normalized) tangent vector 
field of the boundary, compatible with the induced orientation. 
Let (M, 7) be Minkowski space, cf. Example 4.5.9. The Levi-Civita tensor 
Ate written down in the standard basis, is defined as the signature of the 


permutation (1234) + (ijkl). Show that ein fulfils 


gill »LC ijk giJkl LC reil gijkt LC i 
ELC Emnpl — — —Srinp, Etc Emnkl = —2!8inns ELC Empkl = —3! Sins 
where ne . = = sign(j" a) if j,,..., jr is a permutation of i,,...,i, and 

ir 
——— 0 ieee 


JL-eJIr 
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4.6 Maxwell’s Equations 


We consider the classical Maxwell theory of electrodynamics. This is a relativistic 
field theory on Minkowski space (M, 7), with the metric 7 defined in Example 4.5.9. 
We use a reduced system of units by setting c = 1 and wp = €9 = 1. We limit our 
attention to the vacuum case, that is, we set the magnetic permeability jz and the 
relative permittivity ¢ equal to 1, but we allow for sources. 

To start with, let us recall the standard relativistic formulation of Maxwell theory 
in a chosen Lorentz!* system! {x“} in M. The electromagnetic field is described 
by the field strength tensor 


(4.6.1) 


where E = (E,, Ey, E,) and B= (B,, By, B,) are the electric field and the mag- 
netic induction, respectively. In the above system of units, in the vacuum we have 
D=E and B = H. Thus, the electrodynamic displacement tensor, built from D and 
H, coincides with f,,,. The charge density and the current density j constitute the 
4-covector of the relativistic current density 


ju = (0, -j.- (4.6.2) 
In these notations, Maxwell’s equations take the form 


Gu toe + Outou + Col we = 0: (4.6.3) 
ey ia ae (4.6.4) 


Equations (4.6.3) are equivalent to the homogeneous Maxwell equations 
0 : 
curl E + Tae divB = 0, 
and Eqs. (4.6.4) correspond to 
a ; . 
curl B — ape = divE= p. 


The transformation to another Lorentz frame {x’"} is given by x’4 = A",x”, with 
A denoting a proper orthochronous Lorentz transformation, that is, 


A nA=n, det A =1, A% > 1. (4.6.5) 


'4Named after the Dutch physicist Hendrik Lorentz (1853-1928). 


'5 As usual in the physics literature we use Greek indices here. 
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The corresponding transformation laws for the field strength tensor and the current 
density are 

fer aA a ei, tng 


The transformation law for f,,, and j,, is obtained by lowering the indices by nyv. 
Now, we pass to a coordinate-free description. For that purpose, we define the 
following differential forms on M: 


1 
= 5 fuvdx® A dx”, J = je". (4.6.6) 
In this notation, Maxwell’s equations (4.6.3) and (4.6.4) take the form 


df =0, (4.6.7) 
d* f =—-j, (4.6.8) 


with d* = «dx when acting on 2-forms (Exercise 4.6.1). 


Remark 4.6.1 


1. Applying the Hodge operator to (4.6.8) and using (4.5.7), we obtain d * f = 
— * j. Application of the exterior derivative then yields d* j = 0 or 


d*j =0. (4.6.9) 


This is the continuity equation for the current density. It reflects the charge 
conservation law. In a Lorentz system, this equation reads 0,,j = 0, that is, 
0,0 + divj = 0. 

2. Since Minkowski space is contractible, the Poincaré Lemma applies: due tod f = 
0, there exists a differential 1-form A, called the electromagnetic potential, such 
that 


f=dA. (4.6.10) 


In view of (4.6.6), in a Lorentz system this reads fy» = 0,,Ay — 0)A,. Since 
d* = 0, the potential A is determined by f up to the differential of a smooth 
function, that is, for every 4 € C®(M), the 1-form 


A'=A+da (4.6.11) 


yield the same field strength f as A. The mapping At A’ is called a gauge 
transformation. To fix A, one may impose gauge conditions like the so-called 
Lorenz!® gauge, 


d*A=0, (4.6.12) 


'6Named after the Danish mathematician and physicist Ludvig Valentin Lorenz (1829-1891). 
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which in a Lorentz system reads 0,, A“ = 0. This gauge fixing is not complete, 
that is, it does not fix A uniquely. If charges and currents are absent, in addition 
we may impose A° = 0, which then yields the so-called Coulomb gauge:!7 


divA =0, (4.6.13) 


see Exercise 4.6.3. 
3. Let us write down the Maxwell equations in terms of the potential in the Lorenz 
gauge. Equation (4.6.7) is trivially fulfilled and (4.6.8) implies 


—j=d* f =d*dA = (d*d4+dd*)A=DA. 


Thus, we obtain a homogeneous wave equation, 


Aaj. (4.6.14) 


4. One can check that f A« f and f A f are Lorentz-invariant 4-forms on M. Ina 
Lorentz frame they read 


1 
fAaxf= 5 Suv Shae? A+++ A dx? = —(B* — E?)dx° A--- Adx?, (4.6.15) 


1 v 4,0 3 0 3 
PA £5 fav fd A+ Ade? = —(E-B)dx® A Ade? (4.6.16) 


(Exercise 4.6.4). Since f A f =d(A A dA), the invariant f A f is a total diver- 
gence, hence its integral over M vanishes. Consequently, as a natural candidate 
for the Lagrange density L of the electromagnetic field there remains f A «f. In 
the case where charges and currents are absent, we put 


La-Sf Aas (4.6.17) 


Then, the physical action becomes 


1 


1 
s=-5f fasf=-ZhP (4.6.18) 


and Hamilton’s variational principle yields the field equation'® d* f = 0. Indeed, 
denoting the variation by 5, we compute 


0=-5S = (8f, f) = (dA, f) = (5A, f) = (5A, d* f). (4.6.19) 


Since this equation must hold for any 6A, we conclude d* f = 0. 


'TMore generally, the Coulomb gauge condition can also be imposed if charges and currents are 
present, but A° cannot be set equal to zero then. Instead, it is determined dynamically. 


'8The equation df = 0 is of geometric nature, it does not follow from a variational principle. 
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5. An electromagnetic field which fulfils 
fAf=9, fAxf=0 (4.6.20) 


is called isotropic. Using the complex-valued 2-form F := f —ix f, these two 
conditions can be summarized in F A F = 0. By (4.6.15) and (4.6.16), they are 
equivalent to 


E-B=0 and E*—B’=0. (4.6.21) 
In particular, plane waves are isotropic. Since the conditions (4.6.20) are Lorentz- 
invariant, a nonzero isotropic electromagnetic field has the following properties. 
— There does not exist a Lorentz system such that E and B are parallel. 
— There does not exist a Lorentz system such that E or B vanish separately. 


6. The invariant form (4.6.7) and (4.6.8) of the Maxwell equations immediately 
generalizes to an arbitrary oriented Lorentzian spacetime manifold, that is, with 
a pseudo-Riemannian metric g of signature (+ — — —). Since it is only the 
Hodge star operator which is modified, in local coordinates, Eq. (4.6.7) is still 
given by (4.6.3). To analyse Eq. (4.6.8), we denote |g| = | det(g)| and use the 
identities 

gh gPX pV eg. = (det(g)) 2%, (4.6.22) 
Epvpo EP? = 2(51,55 - 515). (4.6.23) 
together with det(g) = (—1)*| det(g)|, to calculate 
1 
viel 


(Exercise 4.6.6). Thus, in local coordinates, (4.6.8) reads 


1 
d* f = «dx (5 fina A ax") = Ou( fe’ Vlel)grade*, (4.6.24) 


1 
viel 


7. The gauge transformation given by (4.6.11) exhibits an additional symmetry of 
the Maxwell theory, which is obviously not related to a change of the Lorentz 
system. Thus, the question arises whether one can give a geometric interpreta- 
tion to gauge transformations. Indeed, it turns out that A may be interpreted as 
the local representative of a connection form!? w in a principal U(1)-bundle P 
over spacetime. In this picture, the field strength f is the local representative 
of the curvature form of w and Eq. (4.6.7) coincides with the Bianchi identity. 
A gauge transformation (4.6.11) may be interpreted either as a vertical automor- 
phism of P (active transformation) or as a change of local bundle coordinates 


du( fe’ Viel) =i". (4.6.25) 


'°The mathematical tools needed for understanding the following interpretation will be presented 
in detail in part II of this book. 
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(passive transformation). In the latter interpretation, Eq. (4.6.11) describes the 
change of a reference frame. 


Exercises 
4.6.1 Prove that the Maxwell equations (4.6.3) and (4.6.4) are equivalent to (4.6.7) 
and (4.6.8). 
4.6.2 Analyse Formula (4.6.9) in a Lorentz frame. 
4.6.3 Prove that in the case where charges and currents are absent, the Coulomb 
gauge is complete, cf. Remark 4.6.1/2. In detail, show that 
(a) for every 2-form f, there exists a unique 1-form A satisfying A° = 0, 
(4.6.13) and (4.6.10), 
(b) for every 1-form A, there exists a smooth function A such that A’ = A+ 
dd satisfies A” = 0 and (4.6.13). 
4.6.4 Verify Eqs. (4.6.15) and (4.6.16). 
4.6.5 Prove the statements of Remark 4.6.1/5. 
4.6.6 Prove the identity (4.6.22) (the identity (4.6.23) was already proved in Exer- 
cise 4.5.8) and confirm Formula (4.6.24). Write down the continuity equation 
for an arbitrary Lorentzian spacetime manifold in local coordinates. 


4.7 Pfaffian Systems and Differential Ideals 


The notion of Pfaffian system is dual to the notion of distribution as developed in 
Sect. 3.5. It plays an important role in the theory of first order partial differential 
equations and in many physical applications, notably in thermodynamics and in 
mechanical systems with constraints. For an exhaustive presentation of this topic 
we refer to [64], Sect. IV.C. 

For a subset A Cc T*M, let Q , (M) denote the set of 1-forms on M with values 
in A. 


Definition 4.7.1 (Pfaffian system) A Pfaffian system on M is a subset A of T*M 
such that for all m € M the following holds. 


1. Am := ANT* M is a linear subspace. 
2. For every B € A,, there exists a € ea (M) such that a, = B. 


The function which assigns to m € M the dimension of A, is called the rank of A. 
If the rank is constant, A is called regular. Otherwise, it is called singular. 


As we did with distributions in Sect. 3.5, we follow the terminology used e.g. in 
[181]. Moreover, Remark 3.5.2 about distributions carries over to Pfaffian systems 
in an obvious way. In particular, the rank is locally non-decreasing. 
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Remark 4.7.2 A Pfaffian system A is regular of rank r iff it is an r-dimensional 
vertical vector subbundle of T*M. In this case, it admits local frames built from 
local 1-forms on M. Let {o!, ...,0"} be such a local frame over U C M. If the 
local r-form 3% = 0! A--- A # is closed, the restriction of A to U coincides with 
the annihilator of the characteristic distribution of 7 (Exercise 4.7.1). 


Definition 4.7.3 (Integral manifold) Let A be a Pfaffian system on M. A connected 
submanifold (NV, y) of M is called an integral manifold of A through m € M if 
meéw(N) and 
0 
W'(TpN) = Ay) (4.7.1) 
for all p € N, where AY, we Ty(p)M denotes the annihilator of Ay»). The Pfaf- 


fian system A is said to be integrable if for every m € M there exists an integral 
manifold of A through m. 


Remark 4.7.4 Assume that A is regular, that is, it is a vertical subbundle of T*M. 
Then, its annihilator A° C TM is a regular distribution and the ranks of A and A° 
add up to the dimension of M. By (4.7.1), a regular Pfaffian system A is integrable 
iff so is the distribution A°. In this case, the integral manifolds of A and A° co- 
incide. Similarly, if D C TM is a regular distribution, then the annihilator D® is a 
regular Pfaffian system. 


Example 4.7.5 


1. Every subset A C 2!(M) generates a Pfaffian system A, with A,, being defined 
as the linear span of the set {a : a € A}, and every Pfaffian system can be 
generated this way. 

2. The Pfaffian system on R* spanned by the 1-forms dx and xdy is singular, be- 
cause it has rank | on the y-axis and rank 2 outside. Similarly, the Pfaffian system 
spanned by dx and ydy is singular, because it has rank | on the x-axis and rank 2 
outside. The first system is integrable, with integral manifolds being the y-axis 
and the single points outside. In contrast, the second system is not integrable. 


Pfaffian systems naturally arise in the theory of partial differential equations. 
Consider the first order system 


dy/ j(,1 p yi n—p - A 
pat FE (PW ey ), l<i<p,1l<j<n-p, (4.7.2) 


in the unknowns y/ € C®(R?), with fi € C™(R"). Since the graph of the com- 
bined mapping y = (y!,..., y"~?) : R? — R"~P is a p-dimensional embedded 
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submanifold in R” and since y solves (4.7.2) iff 
dy? — 0 fi(x!, 0.2? yoy Pde! =0, lsjsn-p, (47.3) 
i=1 


solutions of (4.7.2) can be interpreted geometrically as integral manifolds of the 
Pfaffian system A on R”, spanned by the 1-forms7° 


P . : 
Nal een de l1<j<n-p. 
i=l 


If the 9 are linearly independent, A is regular. Then, by Remark 4.7.4, A is in- 
tegrable iff so is the annihilator A° = kerd! 1--- ker"~?. In the special case 
p =n — 1, we obtain the following simple solvability criterion. 


Proposition 4.7.6 A regular Pfaffian system A of rank 1, spanned by 3 € 2'(M), 
is integrable iff dd (X 1, X2) =0 for all X1, X1€ x*4°(M). 


Proof Proposition 4.1.6 implies 


dd(X1, X2) =—0([X1, X2)). 


Since A° = ker 1, the Frobenius Theorem 3.5.12 yields the assertion. 


Now, we are going to derive an integrability criterion for regular Pfaffian systems 
in the general case. For a Pfaffian system A, let 2%(M) denote the subspace of 
§2*(M) spanned by the forms w € 2"(M), r = 1, which satisfy a@,,(X1,...,X;-) =0 
for all X1,...,X; € Ae andmeM. 2", (M) is a two-sided ideal in the associative 
algebra 2*(M). 


Definition 4.7.7 (Differential ideal) An ideal J C 92*(M) is called a differential 
ideal if dJ Cc J. 


Theorem 4.7.8 (Frobenius Theorem for Pfaffian systems) A regular Pfaffian system 
A on M is integrable iff 2%, (M) is a differential ideal. 


Proof By Remark 4.7.4 and the Frobenius Theorem 3.5.12 for regular distributions, 
we must show that d2Q% (M) Cc 24, (M) is a necessary and sufficient condition for 
the regular distribution A° to be involutive. That it is necessary follows from Propo- 
sition 4.1.6. To see that it is sufficient, let r denote the rank of A and let me M. 


20Tt is common to refer to (4.7.2), rather than to A, as a Pfaffian system and to the |-forms o/ as 
Pfaffian forms. 
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Choose 3!,..., 0" € fan (M) such that (ol. ..., 07} is a basis in A,,. According 
to Proposition 4.1.6, for X, Y € x4” (M), 


(do! (X, Y))(m) = —8),([X, ¥]m), 1s<is<r. 


m 


If d2%,(M) C 27,(M), the left hand side vanishes, hence [X, Y]m € a. Since m 
was arbitrary, it follows that A° is involutive. 


Next, we derive local criteria for a regular Pfaffian system to be integrable. We 
need 


Lemma 4.7.9 Let A be a regular Pfaffian system of rank r on M, letm € M and let 
{0!,..., 9} be a local frame in A at m. Then, m admits an open neighbourhood U 
with the property that for every k-form a in Q%(M) there exist local (k — 1)-forms 
B1,..-, By over U such that 


r 
au = yw /\ Bj. 
j=l 


Proof On some neighbourhood U of m, the local frame (oe, ..., 07} in A can be 
complemented by local 1-forms #’*!,...,%” to a local frame in T*M at m. Let 
{X1,..., Xn} be the dual local frame in TM. Then, {X;+1,..., Xn} is a local frame 
in A°. Expand 


atu = > j,i F A ADE, 
I] <+++<ig 


Since a € 24 (M), we have that aj, i, = py (Xi,,..., Xi,) = 0 whenever r < ij. 
r j 1 _—_— ae . j j 
Hence, ay = LS v/ A B; with Bj = ie ene O ji. D? Av ADE, 


Proposition 4.7.10 Let A be a regular Pfaffian system of rank r on M. The follow- 
ing statements are equivalent. 


1. A is integrable. 


2. For every m € M, there exist a local frame {P0045 07) in A atm and 
local 1-forms Yj> i,j =1,...,7r, such that for alli =1,...,r one has dd! = 
jai OY AY}. 

3. For every m € M, there exists a local frame {8!,...,"} in A at m such that for 


alli=1,...,r one has dd! AD! A---A 0" =0. 


Proof 1 = 2: Choose a local frame {i!,...,”} in A over a neighbourhood U of 
m. Since A is integrable, so is the Pfaffian system Ay := AN (T*U) on U. Hence, 
Theorem 4.7.8 implies di! € Qh, (U). Application of Lemma 4.7.9 to the manifold 
U and the Pfaffian system Ay yields the assertion. 

2 => 1: According to Theorem 4.7.8, it suffices to show that 2%, (M) is a differ- 
ential ideal. Thus, let a € 2%(M) be of degree k and let m € M. Let (3! 0... 07} 
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be the local frame in A over a neighbourhood U of m and let vi, on ea rere 


be the local 1-forms over U, existing by assumption. By Lemma 4.7.9, ary = 
ei 0 A B; for appropriate local (k — 1)-forms 6; on U. Then, 


r af 
d@pu)= D> oH Avi A Bi - >) 0! AaB; 


i,j=l i=l 
and hence (da) (X1,..., Xx) =0 for all X; € AY. Since m was arbitrary, we get 
da € 24(M). 
2 => 3: This is obvious. 
3 => 2: We may complement the local frame {!,..., #”} in A by local 1-forms 


prt! |. 9” to a local frame in T*M at m. Expanding do! = ee oi A ok, 
we obtain 
0= lai WH Ave rndi a. Aav, 
j<k 
It follows a a, =0 for all r < j. Hence, do! = 7_, 0 Ay; with yj = 
Vick Ui? . 


Example 4.7.11 Let M = R? and let A be the Pfaffian system spanned by a 
nowhere-vanishing 1-form % on M. According to Proposition 4.7.10/3, A is in- 
tegrable iff dd A % = 0. Writing 0 = 3;dx', we have 


did = (81 92 — 291) dx! A dx? + (8203 — 9302)dx? A dx? + (8301 — 4193)dx? A dx! 
and di? A 3 = 0 is equivalent to 

(0203 — 0302)01 + (0301 — 0103)02 + (0102 — 0201) 03 = 0. 
Thus, A is integrable iff the vector field }7, 30; is perpendicular to its curl. 


To conclude this section, we derive an integrability criterion for regular Pfaffian 
systems of rank one in terms of so-called integrating factors. Since integrability is a 
local property, it is no loss of generality to assume the Pfaffian system to be spanned 
by a nowhere-vanishing 1-form. 


Definition 4.7.12 Let % € Q'(M). A nowhere-vanishing function f € C°(M) is 
called an integrating factor for 0 if the 1-form fv is exact. 


Proposition 4.7.13 The Pfaffian system A spanned by a single nowhere-vanishing 
1-form § on M is integrable iff for every m € M there exists an open neighbourhood 
U and an integrating factor f € C~(U) for d;y. For every g € C°(U) satisfying 
fd ;u = dg, the level set components of g are integral manifolds of A. 
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Proof First, assume that for every m € M there exists an open neighbourhood U 
and an integrating factor f ¢ C°(U) for 3+y. Then, 


O=d(foju) =df A vi + fdrtu, 


hence d?;y = ru A af and Proposition 4.7.10/2 yields that A is integrable. Con- 


versely, if A is integrable, so is the regular distribution A° of rank n — 1 on M. 
Hence, according to Theorem 3.5.10/4, for every m € M there exists a local chart 
(U, «) atm such that {d/,..., 0%_,} is a local frame in A° at m. Then, 3 = hd” for 
some h € C®(U). Since is nowhere-vanishing, so is h. Thus, f := Me Cry 
is an integrating factor for Dyv. 

Next, let g € C°(U) be such that fd}y = dg and let » be a level set component 
of g. Since dg = fd;y is nowhere-vanishing, every point of U is regular for g. 
Thus, »' is an embedded submanifold of dimension n — | of U (and hence of M). 
Let 1: & — M denote the natural inclusion mapping. Due to 


1 1 1 
*97 = —U(fo = ——.*(dg) = —-d =0, 
Udy Fa (f?iu) fs (dg) Fs (gtx) 


one has T,, 27 C ae for all m € 2. By counting dimensions we obtain equality. 
Hence, » is an integral manifold of A. 


Example 4.7.14 (Ideal gas) The concept of integrating factors is important in ther- 
modynamics. Let us consider one mol of an ideal gas, described by the variables 
V (volume), T (temperature) and p (pressure) which fulfil the thermal equation of 
state, 


pV =RT. (4.7.4) 


Note that (4.7.4) defines a 2-dimensional surface A C R?, which for example can be 
parameterized by the variables (V, T) with V > O and T > 0. A change of state is 
represented by a curve in A and the heat exchange 6Q with the environment during 
this process is obtained by integrating the 1-form 


9 =cydT + p(V,T)dV 


along this curve. Thus, an adiabatic change of state, that is, a process for which there 
is no heat exchange with the environment, corresponds to an integral manifold of 
the Pfaffian system A on A defined by . By Proposition 4.7.10, A is integrable, 
because d? A # = 0 holds trivially on a 2-dimensional manifold. Then, by Proposi- 
tion 4.7.13, there exist local integrating factors f for 3%. To find them, it suffices to 
consider the condition d( fv) = 0. This yields 


af af ap 
0=cy —dV AdT + — pdT AdV —dT AdV 
a ay. + or? + Sor 
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and with (4.7.4) we conclude 


of RT af R 
Cc = 
YoV Var v 


f. 
Any nowhere-vanishing function f which fulfils this differential equation is an in- 
tegrating factor. Let us consider the following examples: 
1. f(T, V) = V¥/v: a potential g of fi can be obtained by applying the proce- 
dure described in Sect. 4.3: 
g(T, V) =cy(TVR/Y — Tov’). 
The integral manifolds of the Pfaffian system A are given by the level set com- 
ponents of g in A, 
TV*/ev — const. 
2. f(T, V) =T7!: the potential g coincides with the entropy 
S(T, V) 1 = + RI : 
VV) =cylh— n—, 
ae Vo 


and the integral manifolds of A provided by this integrating factor correspond to 
reversible processes. For the ideal gas, with dU = cydT, we obtain 


cy R 1 D 
dS = —dT dV = —dU dV, 
T 7 Vv T r T 


that is, 
dU =TdS — pdVv. 


The integrability of the 1-form 3? = cy dT + pdvV is equivalent to the second 
law of thermodynamics or, more precisely, it yields the mathematical foundation 
of this law for the ideal gas. 


Exercises 
4.7.1 Prove the statements of Remark 4.7.2. 


4.8 Constraints in Classical Mechanics 


The motion of a system of N particles in the configuration space R*" is often re- 
stricted by constraints, that is, in addition to the external forces acting upon the 
system there are constraining forces, which ensure that the system evolves in ac- 
cordance with the constraints. For simplicity, let us restrict our attention to time- 
independent constraints. Let rg denote the position of the a-th particle. There are 
two qualitatively different cases. 
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(a) The constraints are defined by a system of equations 
f'(@,.-.,tv)=0, a=l1,...,5, (4.8.1) 


where f“ : R°’ — R are smooth functions which are assumed to be function- 
ally independent. Constraints of this type are called holonomic. The Level Set 
Theorem 1.2.1 implies that 


Q={n1,....tw) eR : f*ir1,...,1w) =} (4.8.2) 


is a (3N — s)-dimensional embedded submanifold of IR? called the constraint 
manifold or the reduced configuration space of the system. We have 


(\ker(4f*) «, Ae ry) = Ket for, oe rw)? 
a 


where f = (f!,..., f°). Thus, the Pfaffian system generated by the exact 1- 
forms df“ is integrable and the integral manifolds are the level set components 
of f. 


(b) The constraints are defined by a system of equations 


mM 


f(r, ..-, tn T1,..., rn) =0, a=1,...,58, (4.8.3) 


with f% being smooth functions which depend both on positions and veloci- 
ties. Constraints of this type are called nonholonomic. While, in general, the 
dependence on the velocities may be nonlinear, in practice one rather meets the 
case where this dependence is linear. This is referred to as the case of linear 
nonholonomic constraints. Let us confine ourselves to this situation. In general, 
there will be both holonomic and genuinely nonholonomic constraints. Let Q 
denote the constraint surface defined by the first ones and let g!,..., g/ be local 
coordinates on Q. The remaining constraints read peg! = 0, or 


pi=ptdg'=0, a=1,...,s. (4.8.4) 


The 1-forms 4“ generate a Pfaffian system A which is not integrable, because 
otherwise the constraints would be holonomic. The corresponding distribution 
A? is called the constraint distribution. Allowed trajectories t > q'(t) have to 
fulfil q(t) € A° for all t or, equivalently, 


(4 (a), 4) =0 (4.8.5) 


for all t and a. We conclude that linear nonholonomic constraints are given by 
a non-integrable Pfaffian system. This type of constraints does not lead to a 
reduction of the configuration space. 


Example 4.8.1 We consider a disk of radius R rolling in upright position on a hor- 
izontal plane P without gliding. Denote the Cartesian coordinates on P by (x, y). 
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At each time f, the disk is located in a plane P(t) which intersects P along a line 
l(t) and touches the plane P ata point A(t). Let us denote the Cartesian coordi- 
nates of this point by (x4, ya) and let @ be the angle between the line /(t) and the 
x-coordinate axis. Then, the position of the disk is completely described by the co- 
ordinates (x4, ya, %, W), with w denoting the angle of rotation in the plane P(t). 
Consequently, the configuration space of the system is Q = IR x R x S! x S!. The 
non-gliding condition imposes the following constraints on the system: the infinites- 
imal displacement of the touching point is Rdy and its projections onto the x- and 
y-axes are given by dx, = Reos(¢)dy and dy, = Rsin(#)dy, respectively. Thus, 
we find the velocity-dependent constraints 


x4 = Reos(o)W, ya = Rsin(p) Wp. (4.8.6) 
They correspond to the Pfaffian forms 
w' =dxa—Reos(o)dy, — u =dya — Rsin(d)dy. 


According to point 3 of Proposition 4.7.10, this system is not integrable. Thus, we 
deal with a genuinely nonholonomic system. This fact is in accordance with our 
intuition, which tells us that at any touching point A(t) the intersection line /(t) 
may have an arbitrary direction. For a detailed discussion of further examples we 
refer to Benenti [39]. 


Now, let us come back to a system of N particles with masses my and position 
vectors fy which are acted upon by external forces Fy. Assume that this system 
is subject to linear nonholonomic constraints defined by 1-forms jz“ and let A be 
the Pfaffian system generated by these forms. Let us denote the corresponding con- 
straining forces by Z,. Newton’s second law reads 


Meky =Fy+Zeg, a=l,...,N. (4.8.7) 


There is a fundamental principle, called d’ Alembert’s principle, which in geometric 
terms”! states that the constraining forces Z,, viewed as 1-forms,”* belong to the 
annihilator (A°)° = A of the constraint distribution A°. Thus, we have 


Zo = 2 Xa be 
a 
and the system of Eqs. (4.8.7) takes the form of the so-called Lagrange equations of 


the first kind: 
Moy =Fa+ > dats, (4.8.8) 


a 


?!Tn mechanics, this principle is usually spelled out by saying that the constraining forces do no 
virtual work. 


2We identify vectors and covectors on R*% via the Euclidean metric. 
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with the Lagrange multipliers A, to be determined from Eqs. (4.8.5). It sometimes 
happens that the external forces are given by a single potential function V which 
depends on both positions and velocities: 


— av id aV 
* ary dt \aty J 


ss d /0T 
m = —| — 
ve dt \ ary 


with T(r) = 5 ay Maks denoting the kinetic energy of the system and defining 
the Lagrangian function 


In this case, rewriting 


L: TR24 SR, L(tq, ty) = T (te) — V(tas ke), (4.8.9) 


we arrive at the so-called Lagrange equations of the second kind: 
d/o0L OL 
—({ — ]—-—=) Agpi. 4.8.10 
alan) Ore dX es ween 


As already noted, in the case of holonomic constraints, the dynamics reduces to the 
constraint submanifold Q given by (4.8.2). Let us choose local coordinates g' with 
i=1,...,3N —s on Q. If we assume that the so-called generalized forces 


OXq 
F;=) Fy: iq! 
Qa 


possess a potential** V, the Lagrangian equations of the second kind take the form 
d /oL OL 
—(—})-—=0, (4.8.11) 
dt \dq' 0q' 


where again L = T — V is the Lagrangian function of the system. One can cast the 
Lagrangian equations (4.8.10) and (4.8.11) into a coordinate-free form by means of 
the symplectic structure on TQ induced from the natural symplectic structure on 
T*Q, see Sect. 9.1. 

Finally, we stress that, nowadays, nonholonomic constraints constitute a huge 
field of research in mathematical physics. There are at least two important streams 
to be mentioned: 


(a) nonholonomic systems with symmetries, with the aim of understanding robotics, 
trajectory tracking, dynamic stability, feedback stabilization and control. For 
this line of research we refer to Koon and Marsden [170], where the reader will 


23R.g. if the original external forces possess a potential. 
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(b 


wm 


find a lot of further references. We also recommend the papers of Bates, Cush- 
man, Kemppainen and Sniatycki, see [35] and [70], and a paper of Marle [193] 
for an introduction to the subject, 

efforts to develop a general theory (which for the time being is still lacking) 
with mathematical techniques like jets and algebroids. For this line of research 
we refer to the paper [147] by Iglesias, Marrero, De Diego and Sosa, where the 
reader can find a lot of further references. In this context, we also mention the 
classical paper [298] by Vershik and Gershkovich. 


Chapter 5 
Lie Groups 


In this chapter, we give an introduction to the theory of Lie groups. In Sect. 5.1, 
we discuss the basic notions and provide the reader with a number of examples. In 
particular, we take up the classical groups which have already been introduced in 
Sect. 1.2. Next, in Sect. 5.2, we come to the notion of Lie algebra of a Lie group. 
We consider a number of examples, again with some emphasis on the Lie algebras 
of classical Lie groups. Section 5.3 is devoted to an important tool, the exponential 
mapping. This mapping constitutes a link between the Lie group and its Lie algebra 
which turns out to be useful both for the study of the local structure of Lie groups 
and for the study of their representations. Next, in Sect. 5.4, we discuss a number 
of important representations—the adjoint and the coadjoint representations of the 
Lie group and the corresponding derived representations of its Lie algebra. Using 
the adjoint representation of the Lie algebra, one can construct a natural symmetric 
bilinear form on the Lie algebra, the Killing form, which is invariant under the 
adjoint representation of the group. Next, in Sect. 5.5, we discuss the concept of 
left-invariant forms which in particular yields a unique (up to multiplication by a 
number) left-invariant volume form on every Lie group. The latter gives rise to the 
so-called Haar measure. We discuss the relation with Ad-invariant scalar products 
on the Lie algebra and conclude, in particular, that every compact Lie group admits 
a bi-invariant Riemannian metric. The final two sections are devoted to the theory 
of Lie subgroups and to homogeneous spaces. Concerning the latter, we discuss 
three important examples in detail: Stiefel manifolds, GrafgSmann manifolds and flag 
manifolds. 


5.1 Basic Notions and Examples 


The notion of Lie group arises naturally by combining the algebraic structure of a 
group with the differentiable structure of a smooth manifold and requiring that the 
two structures are compatible. 

For a group G and elements a,b € G, we denote the product by ab, the unit 
element by 11 (or sometimes also be e) and the inverse of a by a~!. The assignment 
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at» a7! defines a bijective mapping inv : G > G, called the inversion mapping 
of G. 


Definition 5.1.1 (Lie group) A Lie group is a set which carries the algebraic struc- 
ture of a group and the differentiable structure of a smooth manifold such that the 


mapping 
GxG—>G, (a,b) ab! (5.1.1) 


is smooth. A homomorphism of Lie groups is a mapping which is both a group 
homomorphism and smooth. 


Remark 5.1.2 


1. The inversion mapping g — g~! is smooth, because it is the restriction of the 


mapping (5.1.1) to the submanifold {1} x G. Moreover, the multiplication map- 
ping is smooth, because it can be written as the composition of (5.1.1) with the 
inversion mapping. Using the Inverse Mapping Theorem one can show that in 
Definition 5.1.1, instead of (5.1.1), it suffices to require that the multiplication 
mapping be smooth (Exercise 5.1.1). 

2. Notions associated with manifolds, like dimension, compactness, connectedness, 
tangent and cotangent bundles etc., carry over to Lie groups in an obvious way. 
To some extent, this applies to notions associated with groups, too. For example, 
a Lie group is Abelian if so is the underlying group, or the order of an element is 
the order of this element of the underlying group. There are exceptions, however, 
see Example 5.2.20. 

3. Analogously to Definition 5.1.1 one defines the notion of real and complex an- 
alytic Lie group by requiring the multiplication mapping to be real or complex 
analytic, respectively. It turns out that every Lie group is in fact real analytic. 
That is, the differentiable structure of an arbitrary Lie group G contains a unique 
real analytic structure [243]. For this reason, it is common in the literature to 
define Lie groups to be real analytic manifolds. 


Example 5.1.3 (Lie groups) 


1. Let K=R, C or H. Every K-vector space V is a Lie group with respect to its 
additive structure and the manifold structure of the real vector space obtained 
from V by field restriction. The dimension is dimp(K) - dim(V). Lie groups of 
this form are sometimes called vector Lie groups. 

2. Every finite or countable group, equipped with the discrete topology and the triv- 
ial zero-dimensional manifold structure, is a Lie group. In particular, the cyclic 
groups of finite order and the group of integers are Lie groups. 

3. Let G1, G2 be Lie groups. The direct product G1 x G2 carries the group struc- 
ture of the direct product of the underlying groups, given by the multiplication 
mapping (a1, a2)(b1, b2) = (a, b1, ab), and the differentiable structure of the 
direct product of the underlying manifolds. These two structures combine to a 
Lie group structure (Exercise 5.1.2). 
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Example 5.1.4 (Classical Lie groups) Let A be a finite-dimensional associative al- 
gebra over K = R, C. View A as a real algebra by restricting scalar multiplication 
to R. Let G C A be an initial submanifold which forms a group under the multipli- 
cation induced from A. Then, the multiplication mapping of G is the restriction in 
domain and range of the multiplication in A. Since the latter is bilinear, it is smooth. 
Hence, G is a Lie group. This argument implies that the following groups are Lie 
groups. 


1. K \ {0} for K=R, C, H, with multiplication induced from K. 

2. The unit spheres in K = R, C,H with respect to the natural Hermitian inner 
product defined by conjugation and multiplication. They are groups under the 
multiplication induced from K and coincide with the classical groups O(1) for 
K=R, UC) for K=C and Sp(1) for K = H, cf. Example 1.2.6. Hence, as 
manifolds, these classical groups are diffeomorphic to, respectively, S°, S! and 
57! 

3. The general linear group GL(n, K), because according to Example 1.1.14 it is 
open in M,,(K). The dimension of GL(n, K) is given by the dimension of M,, (IK) 
over the reals, that is, dimp(K) - n?. For n = 1, this reproduces point | above. 
By the same argument, the automorphism group GL(V) of a K-vector space V 
with composition of mappings as multiplication is open in the associative algebra 
End(V) and hence a Lie group as well. 

4. The classical groups of Example 1.2.6, because each of them can be represented 
as a level set of a smooth function on some M,, (K) at a regular value and is hence 
an embedded submanifold of the latter. 


Example 5.1.5 (Lie group homomorphisms) 


1. Let K=R or C. The determinant det : M, (IK) — K restricts to a group homo- 
morphism 


det : GL(n, K) > K \ {0}. (5.1.2) 


The latter is obtained by restriction of an n-linear mapping 
M, (K)x -"- xMn(K) > K 


in domain to the submanifold (GL(n, K), A,), where A, (a) = (a,..., a), and in 
range to the open submanifold K \ {0}. Hence, it is smooth and thus a Lie group 
homomorphism. 

2. Let V, W be vector spaces over K and let g¢ : V — W be an isomorphism. The 
mapping GL(V) — GL(W), defined by at goaog |, is a Lie group isomor- 
phism. In particular, if dim V =n, W = K” and ¢ assigns to v € V its coeffi- 
cients with respect to a chosen basis, the above mapping yields an isomorphism 


‘For the unit spheres of R and C there are simpler arguments ensuring smoothness of the multi- 
plication mapping. In the first case, the group is finite and in the second case, in terms of the angle 
coordinate, multiplication is given by addition. 
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from GL(V) onto GL(n, K), and this isomorphism assigns to an automorphism 
of V its matrix with respect to this basis. 

3. A Lie group homomorphism g : R > G is called a one-parameter subgroup. 
The image of a one-parameter subgroup is an Abelian subgroup of G. It turns 
out that all one-parameter subgroups are integral curves of certain vector fields, 
see Remark 5.3.2. 

4. By definition of the group and manifold structures on the direct product of Lie 
groups G,, G2, the natural projections G; x G2 > G;, i = 1,2 are group ho- 
momorphisms and smooth. Hence, they are Lie group homomorphisms. 


We continue with deriving some basic facts about Lie groups. For that purpose, 
note that every a € G induces the following mappings of G: 


La(b):=ab, — Ra(b):=ba, Cab) = Lg oR, -1(b) Saba!. (5.1.3) 


They are called, respectively, left translation, right translation* and conjugation by a. 
For all a, b € G we have 


La o Ly = Lap, Rg o Rp = Roa, La oRp = Ry oLg. (5.1.4) 


Thus, L, and R, are diffeomorphisms with inverse L,-1 and R,-1, respectively. For 
all a,b,c € G, we have Cg(bc) = Cqg(b)Cag(c) and Cg o Cp = Cap, that is, Cg is 
an automorphism of G for every a, and the assignment a +> C, defines a group 
homomorphism from G to the group of automorphisms of G. Automorphisms of G 
of the form C, are called inner automorphisms. 


Proposition 5.1.6 (Parallelizability) Let G be a Lie group. The mappings 
XL, XR: Gx T1G > TG, ya, X) = LX, xr(a, X):=R/X, (5.1.5) 


are vertical vector bundle isomorphisms and hence they define global trivializations 
of TG. In particular, Lie groups are parallelizable. 


Proof Let 4: G x G — G denote the multiplication mapping of G, so: G > TG 
the zero section and j :T;G — TG the natural inclusion mapping. The mapping x. 
is obtained by composing sp x j : G x T1G > TG x TG with the natural isomor- 
phism TG x TG — T(G x G) and the tangent mapping of jz. Hence, x, is smooth. 
Since it preserves the fibres, and since it is fibrewise linear and projects to idg, it 
is a vector bundle morphism. Since xy, is obviously bijective, it is a vertical vector 
bundle isomorphism. The argument for xp is completely analogous. 


According to Example 2.4.1, the vertical vector bundle isomorphisms x, and 
xr yield dual isomorphisms xt ; Xn :T*G > G x T7G. A brief calculation 


?Or left and right multiplication, respectively. 
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(Exercise 5.1.3) shows that the inverse isomorphisms Cane : OG) :GxTiG> 
T*G are given by 


(xt) '@.n)=noWg),, (x6) (an) =no R,-1),,- (5.1.6) 


The isomorphisms x, . XR ' and ree ; pea will be referred to as, respectively, the left 
and the right trivialization of TG and T*G. 


Proposition 5.1.7 A connected Lie group is generated, as a group, by any neigh- 
bourhood of the unit element. 


Proof Let U be a neighbourhood of 1 in G. The group generated by U is G:= 
Ure, U" and we have to show that G =G. Without loss of generality, we may 
assume that U is open. Then, G is open, hence G \ G is closed. On the other hand, 
since Lg is ahomeomorphism, aG is open for all a € G, hence 


G\G=\|JaG 
a¢G 


is open. Since G \ G # G and since G is connected, this implies G \ G = @. This 
yields the assertion. 


Remark 5.1.8 (Identity component) The connected component of G containing 1 
will be denoted by Go and will be referred to as the identity component of G. Let 
a € Go. Since L,-1 is ahomeomorphism of G, L,-1(Go) is a connected component. 
Since it contains 1, it coincides with Go. It follows that a~!b € Go for all a, b € Go. 
Hence, Go is a subgroup. By a similar argument, one can show that Go is normal. 
Since it is an open submanifold of G, it is a normal Lie group. 


Remark 5.1.9 Let G, H be Lie groups and let g: G > H be a group homomor- 
phism. Then, for alla € G, 


g =Ly(a) 09 oLg-t. (5.1.7) 


Since L,-1 and Lyiq) are diffeomorphisms of G and H, respectively, we obtain the 
following. 


1. For a group homomorphism of Lie groups to be a Lie group homomorphism 
it suffices to be smooth in some neighbourhood of 1. Indeed, if U is such a 
neighbourhood, then (5.1.7) implies gay = Ly) o tu ° (Lg-1) tau - 

2. For a Lie group homomorphism to be an immersion or a submersion it suffices 
to be an immersion or a submersion at 1. Indeed, differentiation of (5.1.7) at 
aéG yields 9 = (Lya))4,, ° Pig ° (Lg-1) and the statement follows, because 
(L,-1)/, and (Lea), are bijective. 


To conclude this section, we discuss a couple of specific homomorphisms of 
some classical Lie groups which are of particular relevance for physics. 
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Example 5.1.10 (isomorphism of SU(2) and Sp(1)) From Example 1.2.6, we recall 
that 


SU(2) = {a € GL(2, C): a’ =a, det(a) = 1}, 
Sp(1) = {ae H: |lal]| = 1}. 


According to Remark 1.1.13, the mapping H — M2(C), given by 


. ° ao + ia, az +1a3 
a=aol+ai+a a3kh , : : 5.1.8 
oh ei alae pan ee ( ) 
where ao, ..., a3 are real numbers, is an injective homomorphism of real algebras 


from H to M2(C). One can check that the image of Sp(1) under this homomorphism 
coincides with SU(2) (Exercise 5.1.4). Hence, by restriction, (5.1.8) induces a group 
isomorphism from Sp(1) onto SU(2) which by Proposition 1.6.10 is smooth in both 
directions and hence a Lie group isomorphism. Since Sp(1) coincides with the unit 
sphere in H, this shows in particular that SU(2) is diffeomorphic to the sphere S°. 


Example 5.1.11 (Universal coverings of SO(3) and SO(4)) Consider the isometric 
isomorphism of real vector spaces 


1:R*-> HH, A(x) = x91 + xpi + xj +.23k. (5.1.9) 


Every pair of quaternions (a, b) defines a real linear mapping Hl > HI by qr aqb 
and hence a linear mapping ¢(a, b) of R* by 


o(a,b)x:=A7!(ad(x)b) forall x eR’. (5.1.10) 


By assigning (a, b) to (a, b) one obtains a mapping ¢ : H @ H — Mg(R), where 
HH @ H denotes the direct product of algebras with multiplication defined as usual 
by (a1, bj) (a2, b2) := (aja2, b;b2). The mapping ¢@ is a homomorphism of real 
algebras and, hence, it is in particular smooth. 

First, consider o(a, a) for a € Sp(1). Due to |lal|? = 1, ¢(a,a) is orthogonal. 
Since it leaves invariant the subspace R x {0} C IR*, it also leaves invariant the 
subspace {0} x R* C R*. Hence, restriction of @ in domain to the submanifold 
{(a, a) : a © Sp(1)} of H @ H and in range to the embedded submanifold of Ma(R) 


consisting of the matrices 
1 0 
lo "| , e€€O(3), 


yields a Lie group homomorphism (denoted by the same symbol) ¢ : Sp(1) — O(3). 
The defining equation (5.1.10) reduces to 


o(a)x:= A! (a(xyi+ x2j +.x3k)a) for allx e R*. (5.1.11) 
For a= ao1 + a1i+ aj + a3k, an explicit calculation yields 


[o(a)],, = (a9 — a} — a3 — 45) 8;j + 2(aaj — aneijxag): (5.1.12) 
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We determine the kernel and the image of ¢. According to (5.1.11), a € ker(@) 
iff a commutes with all quaternions. Then a= al, a € R, hence ker(@) = {+1}, 
the centre of Sp(1). To find im(@), one first shows that @ is an immersion (Exer- 
cise 5.1.5). Then, since Sp(1) and SO(3) have the same dimension, ¢ is a submer- 
sion and hence, by Remark 1.5.16, an open mapping. See Exercise 5.1.6 for an alter- 
native argument proving openness. It follows that im(@) contains a neighbourhood 
of 1 in O(3). Since im(@) is a subgroup, Proposition 5.1.7 implies that it contains 
the identity component SO(3) of O(3). Since Sp(1) is connected, we finally obtain 
im(@) = SO(). 

Note that since Sp(1) coincides with the unit sphere in H and since ker(@) = 
{1} implies that the preimage of an element of SO(3) under ¢ consists of antipodal 
points, ¢ induces a bijection from the projective space RP* onto SO(3). Since both 
¢ and the projection S* —> RP? are submersions, Remark 1.5.16 yields that this 
bijection is in fact a diffeomorphism.* 

Next, consider #(a, b) for a, b € Sp(1). Due to |lal|? = ||b||? = 1, d(a, b) is or- 
thogonal. Hence, by restriction in domain and range, @ induces a Lie group ho- 
momorphism (again denoted by the same symbol) @: Sp(1) x Sp(1) — O(4) with 
defining equation (5.1.10). Arguing as before one finds ker(@) = {(1, 1), (—1, —D} 
and im(@) = SO(4) (Exercise 5.1.7). 

Let us add that via the algebra homomorphism (5.1.8), all of the above has an 
equivalent formulation in terms of two-dimensional complex matrices. This formu- 
lation is obtained by replacing Sp(1) by SU(2), H by the real vector space spanned 
by 12 and the traceless skew-Hermitian matrices, equipped with the scalar product 
(A, B) = 5 tr(A‘B), and the subspace of H spanned by i, j and k by the subspace 
of traceless skew-Hermitian matrices. 


Remark 5.1.12 It remains to explain why this example runs under the name of 
universal covering. Let G and G denote either SO(3) and Sp(1) or SO(4) and 
Sp(1) x Sp(1). Since ¢: G = G is both an immersion and a submersion, by the 
Inverse Mapping Theorem, it is a local diffeomorphism. Since the preimages of 
points of G consist of two points in G, ¢ is a two-fold covering of G. Since G, as 
(a product of two copies of) a 3-sphere, is simply connected, it follows by covering 
theory that @ is universal in the sense that for any covering y : H — G there exists 
a covering Vv: G > H such that wo v= ¢. If y is in addition a Lie group ho- 
momorphism, then so is w. Thus, ¢ is universal among the coverings of G by Lie 
group homomorphisms. It is, therefore, referred to as the universal covering homo- 
morphism of G, and G as the universal covering group of G. The universal covering 
plays a crucial role in the representation theory of Lie groups, one of the reasons be- 
ing that the set of representations of G contains the set of representations of G as a 
subset, see Exercise 5.1.8. 


3The relation between SO(3) and RP? can be made still more explicit by characterizing rotations 
by an axis and by an angle of rotation about this axis between 0 and zr. We encourage the reader 
to verify that this yields an identification of SO(3) with the closed ball of radius 7 in R? with 
antipodal points on the boundary identified. 
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Example 5.1.13 (Universal covering of SO(3, 1)9) The construction is essentially 
analogous to that for SO(3) and SO(4). Details are left to the reader as Exer- 
cise 5.1.7. Let S2(C) denote the real vector space of two-dimensional complex Her- 
mitian matrices. The mapping 
. 14 — | xotx = 1x2 +3 
4: R* > S2(C), A(x) = ee bares 
is an isomorphism of real vector spaces. It is isometric with respect to the quadratic 
forms x b> if a te ae on R* and At det(A) on S2(C). For every a € 
M>(C), the assignment A bh aAa' defines a linear mapping of S2(C) and hence a 
linear mapping ¢(a) of R* by 


b(a)x:= A! (ad(x)a'). 


The assignment a +> ¢(a), in turn, defines a mapping ¢ : M2(C) — Ma(R). This 
mapping is real homogeneous of degree two, hence smooth, and satisfies (ab) = 
(a) (b). Since for every a € SL(2, C) we have det(a Aa‘) = det(A), ¢ restricts to 
a Lie group homomorphism (denoted by the same symbol) ¢ : SL(2, C) > O(3, 1). 
By similar arguments as in Example 5.1.11 one finds ker(¢) = {+1} (the centre 
of SL(2, C)), im(@) = SO(3, 1)o (the identity component of SO(3, 1))* and that ) 
is a covering homomorphism. To prove that it is universal one has to check that 
SL(2, C) is simply connected. To see this, one may use that polar decomposition 
yields a diffeomorphism U(2) x S2(C) > GL(2, C), given by (a, A) + ae“, see 
Exercise 5.1.9. Restriction of this diffeomorphism to the submanifold SU(2) of U(2) 
times the subspace of traceless elements of S2(C) induces a diffeomorphism S3 x 
IR? — SL(2, C). Hence, ¢ is the universal covering homomorphism and SL(2, C) is 
the universal covering group of SO(3, 1)o. 

Let us add that under the isomorphism between Hermitian matrices and skew- 
Hermitian matrices given by multiplication by i, the restriction of @ to the subgroup 
SU(2) Cc SL(, C) reproduces the Lie group homomorphism Sp(1) — SO(3) of Ex- 
ample 5.1.11, transported to M2(C) according to the remarks made there. 


Exercises 

5.1.1 Use the Inverse Mapping Theorem to show that in Definition 5.1.1 it suffices 
to require that the multiplication mapping be smooth. 

5.1.2 Prove that the direct product of Lie groups as defined in Example 5.1.3/3 is a 
Lie group. 

5.1.3 Verify that the mapping (5.1.6) is the inverse of the dual of the vector bundle 
isomorphism (5.1.5). 

5.1.4 Show that the image of Sp(1) under the mapping (5.1.8) coincides with 
SU(2). 


4Tn physics, the elements of SO(3, 1)o are referred to as proper orthochronous Lorentz transforma- 
tions. 
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5.1.5 
5.1.6 


5.1.7 


5.1.8 


Prove that the mapping ¢@ of Example 5.1.11 is an immersion. 
Show that {R € SO(3) : tr(R) 4 —1} is a dense subset whose elements fulfil 
R= ¢(a), with 


_ (+ tr(R))1 + (R23 — R32)i+ (R31 — Ris)j + (Riz — Rak 
: 2./T + tr(R) 


Use this to prove that the Lie group homomorphism ¢ : Sp(1) — O(3) of 

Example 5.1.11 is an open mapping. 

Hint. Write the defining equation for R = ¢(a) in the form yy Ruiti 

ata, where tT; =i, t2 = j, and t3 = k and use the relation ys TIQT] 

q — 4qo1 for all q € H. 

This exercise complements Examples 5.1.11 and 5.1.13. 

(a) Prove (5.1.12). Use this formula to compute ¢ (cos 51 + sin 5q) for q= 
i,j, k. 

(b) Fill in the details for the universal covering homomorphisms of Exam- 
ples 5.1.11 and 5.1.13. 

Let G, G and H be Lie groups and let ¢: G—>Gbea covering homomotr- 

phism, see Remark 5.1.12. Show that Lie group homomorphisms wy : G > H 

correspond bijectively to Lie group homomorphisms w : G > H satisfying 

ker(@) C ker(w). (This applies in particular to representations, i.e., Lie group 

homomorphisms G + GL(V), where V is some K-vector space.) 

Show that the mappings U(n) x S,(C) > GL(n, C) and O(n) x S;,(R) > 

GL(n, R), given by (a, A) Bb ae“, are diffeomorphisms. (The inverse map- 

pings are referred to as polar decomposition of GL(n, C) and GL(n, R), re- 

spectively.) 

Hint. Show that a and A can be reconstructed from b = ae“ by means of 

-1 

the formulae A = InVb*b and a= bVb*b . Moreover, prove that the as- 

signment of (a, A) to b is smooth. The square root Vbib is defined by the 

condition that it acts on the eigenspaces of b‘b as multiplication by the posi- 

tive square root of the corresponding eigenvalue of b‘b. 


Sp(1). 
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In this section, we will construct the Lie algebra associated with a Lie group. For 
the notion of Lie algebra and Lie algebra homomorphism, see Definition 3.1.2. Let 
a Lie group G be given. 


Definition 5.2.1 (Left-invariant vector fields) A vector field X on G is called left- 
invariant if L,,X = X forallaeG. 


Written pointwise, the defining condition reads L/,X» = Xqp for alla, be G. 
Due to (5.1.4), it is equivalent to 


X,=LX1 forallaeG. (5.2.1) 
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Proposition 5.2.2 The left-invariant vector fields on G form a Lie subalgebra 
of X(G). 


Proof This follows from the linearity of the transport operator and Proposi- 
tion 3.1.5/4. 


Definition 5.2.3 The Lie subalgebra of X(G) of left-invariant vector fields is called 
the Lie algebra of G. 


In the following, we will write g for the Lie algebra of G and X,Y,... for its 
elements.” 


Proposition 5.2.4 The mapping g > T1G, defined by X > X4, is an isomorphism 
of real vector spaces. 


Proof The mapping is linear. Due to (5.2.1), it is injective. To see that it is surjec- 
tive, let X; € T1G and let xz denote the left trivialization of TG, see (5.1.5). The 
mapping G > TG, defined by at x (a, Xz), is a left-invariant vector field which 
takes the value Xq at 1. 


Remark 5.2.5 


1. Combining the natural isomorphism g — T1G with the identical mapping of G 
and composing this with the left and right trivializations (5.1.5) of TG, one can 
express these trivializations in terms of left-invariant vector fields: 


XL: XR:GxXg—> TG, xL(a,X)=Xq, xr(a,X)= C)-1Xa. (5.2.2) 


2. The proofs of the following statements are left to the reader (Exercise 5.2.1). Let 
g be a Lie algebra over K and let {e1,..., e,} be a basis of g. Expansion of the 
commutators of the basis elements, 


k eS 
ler,ejlacyerk, iG=l,....n, 
(summation convention) yields n> elements rae of K, called the structure con- 


stants of g relative to the given basis. In terms of the structure constants, the 
defining properties of a Lie algebra, cf. Definition 3.1.2, read as follows. 


Anticommutativity: cH + ci, =O for alli, j,k =1,...,n. 
Jacobi identity: cf,cf) + chef + chjeg, = 0 for all i, j,,m=1,...,n. 


Conversely, every system e . of elements of KK with these properties defines a Lie 
algebra. If g and h are Lie algebras over K and if one can find a basis in g and 


5Starting from the next chapter, we will preferably write A, B,... for the elements of the Lie 
algebra of a Lie group, because we will often deal with situations where further vector fields occur. 
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a basis in § such that the structure constants of g and 6 relative to these bases 
coincide, the assignment of the appropriate basis vectors to one another defines 
a Lie algebra isomorphism between g and b. 


Example 5.2.6 (Classical Lie groups) Let V be a finite-dimensional K-vector space 
and let G Cc End(V) be a classical Lie group. Let g be the Lie algebra of G. We de- 
termine the smooth mappings X : G > End(V) which correspond to the elements 
of g under the natural representation of vector fields on G of Remark 2.7.15. Ac- 
cording to Proposition 3.2.13/3, left-invariance implies 


@* (a) = &* oLg(1) =Ly 0 O* (1) =a GX (1) 


and hence 
d 


d 
X@=— ao =— ae) —aX(1). 


Hence, the elements of g are represented by the mappings 
X4:G— End(V), X4(a)=aA (5.2.3) 


with A €T,G C End(V). Via this representation of g, the assignment At» X4 co- 
incides with the inverse of the natural isomorphism of Proposition 5.2.4. We com- 
pute the commutator in g in terms of the mappings X“. Since the equation for the 
integral curves of X4 is y(t) = y(t)A, the flow of X4 is given by 


@*"(a)=ae'4, aeG. (5.2.4) 
Using (3.1.4), for A, B € Tz G anda € G we obtain 
[x4, x](a) =alA, B]. 


This yields 
[x4,x?]=x!481 forall A, BeET1G. (5.2.5) 


Evaluation of (5.2.5) at a = 1 shows that, in particular, T;G is closed under the 
commutator of endomorphisms. Let gl(V) denote the real vector space underly- 
ing End(V), equipped with the commutator of endomorphisms as a multiplication. 
Then, gl(V) is a real Lie algebra, T1G is a Lie subalgebra and (5.2.5) means that 
the natural isomorphism T;G — g of Proposition 5.2.4 is an isomorphism of Lie 
algebras. Henceforth, we will identify g with the Lie subalgebra T1G of gl(V). To 
conclude, we take V = K” and list the subspaces T1; G = ker(f;) of gl(n, K) for 
the classical groups of Example 1.2.6.° Computations are left to the reader (Exer- 


Several of these Lie algebras are in addition, and more naturally, complex Lie algebras, notably 
gl(n, C) itself and s(n, C). This corresponds to the fact that the respective classical Lie groups are 
in addition, and again more naturally, complex analytic Lie groups. 
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cise 5.2.2). In the following, K = R or C. 
s(n, K) = {Ae M,(K): trA=0}, 
o(n,m) = so0(n,m) = {A € Mn4m(R): InmA +A? Inm =O}, 
o(n,C) =s0(n,C) = {A eM, (C): A+ A’ =O}, 
u(n,m) = {A € Mn4m(C): InmA + A‘ 1pm =O}, 


su(n,m) = {A © Mnsm(C): Injm A+ A‘tam =0,trA= o} ’ 


{ 
sp(n, K) = {A € Mon (RK): AT Jn + Jn A =O}, 
sp(n,m) ={AeM, m (HH): nmA+ At elle 


Remark 5.2.7 The vector field X4 of Example 5.2.6 is the restriction to G of 
the linear vector field on End(V) which is generated by the linear mapping Ry : 
End(V) > End(V), R4(B) := BA. According to Example 3.2.8, the flow of this 
vector field is given by the family of vector space automorphisms e’®4 of End(V), 
where ¢ € R. A brief computation shows e! Ra(B) = Be! for all B € End(V). For 
B =a, this reproduces (5.2.4). 


Example 5.2.8 From Example 5.2.6 we read off that the Lie algebras sp(1), su(2) 
and so0(3) are spanned, respectively, by the bases 


1 1 1 
u_l, wl. H 
50 I= 55: haa 
c_tfi 0 cif | e_tfo i 
re" 2|0 -i}’ 2 9}-1 OF’ oO O)? 
0 1 0 0 0 -!l 0 oO O 
R=|-1 0 of, B=|0 0 oO], B=/0 0 1 
0 oO 0 1 0 O 0 -l1 O 


Since these bases satisfy 


[Si], K=R,C.H, 


they define isomorphisms between these Lie algebras. In addition, since the vector 
product in R? is given by 


ej x ej = bij «ek, 


these isomorphisms relate the respective Lie bracket to the vector product in R?. 
Thus, the Lie algebras sp(1), su(2) and so(3) are isomorphic to one another and to 
the Lie algebra (IR?, x). In particular, for further use we note the explicit form of 
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this isomorphism for so0(3), 


0) x! — x? 
Q: Rs so(3), XP g(x) := —x! 0 x? |. (5.2.6) 
x? =z" 0 
Then, 
g(x x y)=[9(%), 9(y)], x-y=—tr(y@ey)). 6.2.7) 


Remark 5.2.9 We comment on the proof of the parallelizability of the spheres S! 
and S3, cf. Proposition 2.3.17. Identifying S! with U(1) and TU(1) with a submani- 
fold of U1) x C, the Lie algebra u(1) =T;U(1) corresponds to the imaginary axis 
and the vector field used in the proof of the parallelizability of S! is the left-invariant 
vector field generated by the complex imaginary unit i. Similarly, S* = Sp(1) and 
the vector fields used in the proof of the parallelizability of S* are the left-invariant 
vector fields generated by the quaternionic imaginary units i, j and k. 


Example 5.2.10 (Vector groups) Let V be a K-vector space, viewed as a Lie group. 
A computation analogous to that for the general linear group of Example 5.2.6 
yields that, via the natural representation of vector fields on V by smooth map- 
pings V — VJ, left-invariant vector fields correspond to constant mappings and that 
the commutator is trivial (Exercise 5.2.4). Hence, via this representation, the natural 
isomorphism of Proposition 5.2.4 identifies the Lie algebra of V with the real Lie 
algebra whose underlying vector space is obtained from V by field restriction and 
whose multiplication is trivial. 


Example 5.2.11 (Discrete groups) Let G be a discrete Lie group. Since the manifold 
structure of G has dimension zero, the tangent bundle has dimension zero as well. 
Hence, the only vector field is the zero section, which is obviously left-invariant. 
Thus, the Lie algebra of G is given by {0} with the obvious multiplication. 


Next, we show that every homomorphism of Lie groups induces a homomor- 
phism of the associated Lie algebras. Let G and H be Lie groups with Lie algebras 
g and 5, respectively, and let g: G — H be a Lie group homomorphism. Due to 
y(1) =1, gy’ maps T1G to TH. Hence, composition with the natural vector space 
isomorphisms g + T1G and h — T,H of Proposition 5.2.4 yields a linear mapping 
dg : g — 5, given by 


(dg(X)) , :=L,0og'(X1) forallbe H. (5.2.8) 


Remark 5.2.12 Since g'Xq = 9g! oL/,(X1) = Lia) 0 g'(X1) = (dg(X))gca), the im- 
age dg(X) is the unique left-invariant vector field on H which is g-related to X. 


Proposition 5.2.13. The mapping dg is a homomorphism of Lie algebras. 
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The mapping d¢ is called the homomorphism of Lie algebras induced by ¢. 
Proof Let X,Y € g. Since dg(X) and dg(Y) are g-related to X and Y, respec- 


tively, Proposition 3.1.5/2 yields that [dg(X), dg(Y)] is g-related to [X, Y]. Since 
[dg(X), dg(Y)] is left-invariant, Remark 5.2.12 implies 


[dy(X), dp(Y)] = dg(LX, Y]). 


Remark 5.2.14 


1. For Lie group homomorphisms g : G > H and yw: H — K we have 
d(y ogy) =dyw odg, didg = idg. (5.2.9) 


2. If g is anisomorphism, d¢ is bijective and hence an isomorphism of Lie algebras. 
Moreover, since dy(X) is g-related to X, then dg(X) = g,X for all X € g. 


Example 5.2.15 (Classical Lie groups) Let G and H be classical Lie groups. Un- 
der the natural identification of the Lie algebras of the classical Lie groups with 
Lie subalgebras of gl(V) for appropriate K-vector spaces V, see Example 5.2.6, 
the induced homomorphism dg : g > h coincides with the tangent mapping 94. In 
particular, this implies that the latter respects the commutator of vector space endo- 
morphisms. This way, for the Lie group homomorphisms of Sect. 5.1 one obtains 
the following induced homomorphisms (Exercise 5.2.5). 


1. For the natural inclusion mapping j : G > GL(V), the induced homomorphism 
dj is given by the natural inclusion mapping g > gI(V). 

2. For the matrix representation GL(V) — GL(n, K) associated with a basis in V, 
the induced homomorphism gl(V) — gl(n, KK) is given by the corresponding 
matrix representation of endomorphisms. 

3. For the determinant homomorphism det : GL(n, K) > K \ {0}, where K= R, C, 
the induced homomorphism d det : gl(n, K) > K is given by the trace, 


ddet=tr. (5.2.10) 


4. The Lie algebra isomorphism sp(1) — su(2) induced by the Lie group isomor- 
phism Sp(1) > SU(2) of Example 5.1.10 coincides with the isomorphism de- 
fined in Example 5.2.8 in terms of bases. 

5. For the covering homomorphisms Sp(1) — SO(3), Sp(1) x Sp(1) — SO(4) of 
Example 5.1.11 and SL(2, C) > SO(3, lo of Example 5.1.13, the induced ho- 
momorphisms are bijective and hence isomorphisms. For the first one, the in- 
duced isomorphism coincides with the isomorphism of Example 5.2.8. For the 
second one, under the identification R* = H of Example 5.1.11, we find that the 
induced isomorphism is given by 


dg(A, B)qg=Aq—qB, A,Besp(l), qeH. (5.2.11) 
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Example 5.2.16 (Direct product) Let G; and G2 be Lie groups with Lie alge- 
bras g; and gz, respectively. The Lie algebra of the direct product of Lie groups 
G = G, x Gz is isomorphic to the direct product of Lie algebras g; © g2 with mul- 
tiplication 


[(X1, X2), (V1, Y2)] = ([X1, Yi, [X2, Yo), Xi, Yi eg. 

Under this identification, for a pair of Lie group homomorphisms ¢g; : G; > Hj, 
i= 1,2, we have 

d(gi x 92) = dg @ dgz. (5.2.12) 
Proofs are left to the reader (Exercise 5.2.7). 
Example 5.2.17 (Vector groups) Let V and W be K-vector spaces, viewed as Lie 
groups, and let g: V — W be a Lie group homomorphism. Then, ¢ is continuous 
and additive. Since every continuous additive mapping of real vector spaces is in 
fact linear (Exercise 5.2.6), g induces a linear mapping of the real vector spaces 


underlying V and W. Under the identification of the Lie algebras of V and W with 
these real vector spaces, dg coincides with ¢. 


Remark 5.2.18 There exist several isomorphisms between the classical Lie algebras 
in low dimensions. The isomorphism between SU(2) and Sp(1) of Example 5.1.10 
and the covering homomorphisms of Examples 5.1.11 and 5.1.13 yield 


s0(3) = su(2) = sp(1), 50(4) = su(2) @ su(2), 50(3, 1) =s(2, ©), 


cf. Examples 5.2.15/4 and 5.2.15/5. Analogously to the first two isomorphisms one 
constructs isomorphisms 


s0(3, C) =sl(2, ©), 50(4,C) =sl(2, C) Gsl(2, ©), 


respectively. To prove the latter, for example, one uses the isomorphism 


+iz1 9 22 +123 
F:C4+End(C?), 9 F@) :=| “7” hs 
Pe) (@) bake Zo — 121 


and considers the mapping 
gy: SL(2,C) x SL(2,C) > Ma(C), g(a, b)z:= F7!(aF(@)b“'). 


Using that det F(z) is a non-degenerate quadratic form on C+, one shows that y 
is the universal covering homomorphism of SO(4, C). Then, the induced Lie alge- 
bra homomorphism yields the desired isomorphism. Alternatively, this isomorphism 
follows by complexification of the isomorphism so0(4) = su(2) @ su(2). By similar 
arguments [291], one obtains 
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so0(2, 1) =sudi, 1) =slQ2,R), 

$0(2, 2) = sl(4, R) = sl(2, R) @ sl(2, R), 
$0(5) = sp(2), 
50(6) = su(4), 

50(6, C) =sl(4, C), 

g0(2, 4) = su(2, 2). 


For occasional further use, we state the definitions of the following special types 
of Lie algebras and Lie groups [127, 145, 149]. Recall that an ideal in a Lie algebra 
g is a linear subspace i such that [1, g] C1. 


Definition 5.2.19 A Lie algebra g is called 


1. simple if it is not Abelian and if it does not contain a nontrivial ideal, 

2. semisimple if it does not contain a nonzero Abelian ideal, 

3. solvable if for some n, the subspace g™, defined recursively by g&+) = 
(go, g®] and g = g, satisfies g” = {0}, 

4. nilpotent if for some n, the subspace gi,), defined recursively by g(x41) = 
[9. 9] and gio) = g, satisfies gin) = {0}. 


A Lie group G is called, respectively, simple, semisimple, solvable or nilpotent if 
its Lie algebra is simple, semisimple, solvable or nilpotent. 


Since ideals of g are in particular Lie algebras, these notions apply to them as 
well. Using the Jacobi identity, by induction on n one can show that the subspaces 
g” and Gin) are ideals of g. It follows that every nonzero solvable or nilpotent ideal 
contains a nonzero Abelian ideal. Hence, a Lie algebra is semisimple iff it does not 
have nonzero solvable ideals, or iff it does not have nonzero nilpotent ideals. 


Example 5.2.20 The classical Lie groups SL(n, K) for n > 2, SU(m) for n > 2, 
SO(n) for n ~ 1,2,4 and Sp(m) for n => 1 are simple. Note that, except for SO(n) 
with n odd, none of these groups is simple in the sense of ordinary group theory, 
because each of them has a nontrivial centre and hence possesses a nontrivial normal 
subgroup. According to Remark 5.2.18, SO(4) is semisimple but not simple. The 
subgroup of GL(7, KK) of upper triangular matrices is solvable. The subgroup of 
GL(n, K) of strictly upper triangular matrices is nilpotent. Finally, for n > 2, as Lie 
algebras 


gl, K) =K@slm, RK), u(n) =iR @ su(n), 


where K and iR carry the trivial multiplication. Hence, GL(n, K) and U(n) are 
neither semisimple nor solvable or nilpotent. Proofs are left to the reader (Exer- 
cise 5.2.8). 


We conclude this section with a remark on right-invariant vector fields. 
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Remark 5.2.21 Similarly to left-invariant vector fields, by the condition R,,.X = X, 
one defines right-invariant vector fields on G and shows the following (Exer- 
cise 5.2.9). 


1. 


2. 


Equivalent conditions for a vector field X to be right-invariant are R!, Xp, = Xa 
for alla,b € G and Xg =R/, Xz forallaeG. 

The right-invariant vector fields on G form a Lie subalgebra of X(G). The as- 
signment X +» Xy defines an isomorphism of real vector spaces from this sub- 
algebra onto T1G. Via this isomorphism, the left and right trivializations of TG 
can be expressed in terms of right-invariant vector fields by 


x(a,X)=CXq, —-xr(a, X) = Xa. (5.2.13) 


. Let inv: G > G be the inversion mapping. Since Ly o inv = inv oR, -1 and 


inv, = —idy,G, the negative of the transport operator inv, maps the left and 
right-invariant vector fields which share the same value at 1 onto one another. 
Thus, —inv, defines an anti-isomorphism between the Lie subalgebras of left- 
invariant and right-invariant vector fields. 


. If Gis aclassical Lie group contained in End(V), right-invariant vector fields on 


G correspond to mappings a b> AX (a) = Aa with A € End(V). One can check 
that 4X = — inv, X4. This implies [4X, 2X] =[B, A]. 


Exercises 
5.2.1 Prove the statements of Remark 5.2.5/2. 
5.2.2 For the classical Lie groups H C M, (IK) of Example 1.2.6, determine the sub- 


spaces T; H C M,,(K). By a direct computation, check that these subspaces 
are Lie subalgebras. Compare your result with the list of classical Lie algebras 
in Example 5.2.6. 


5.2.3, Construct bases for the classical Lie algebras of Example 5.2.6 and determine 


the corresponding structure constants. 


5.2.4 Carry out the computations necessary for Example 5.2.10. 
5.2.5 Complete Example 5.2.15 by computing the induced homomorphisms of Lie 


algebras for the Lie group homomorphisms of Sect. 5.1 involving classical 
Lie groups. 


5.2.6 Show that every continuous additive mapping of real vector spaces is linear. 


Hint. Use that additivity implies linearity over the rationals. 


5.2.7 Prove the assertions of Example 5.2.16. 
5.2.8 Prove the assertions of Example 5.2.20. 
5.2.9 Verify the properties of right-invariant vector fields stated in Remark 5.2.21. 


5.3 The Exponential Mapping 


In this section, we will construct the exponential mapping associated with a Lie 
group. Via this mapping, the algebraic structure of the Lie algebra encodes the local 
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structure of the Lie group. This way, a considerable part of the theory of Lie groups 
can be reduced to the study of Lie algebras. Let G be a Lie group and let g be the 
Lie algebra of G. 


Proposition 5.3.1 Every left-invariant vector field on a Lie group is complete. 


Proof Let X € g and let 6: Y— G be the flow of X. According to Proposi- 
tion 3.2.13/3, left-invariance implies L,(Y,) = Y for all a € G and t ER. Hence, 
for every t, either YZ, = G or Y, = @. Since J is open in G x R, there exist t+ > 0 
and t_ <0 such that Y,, is nonempty and hence coincides with G. Then, ®;, (1 
is defined and, as an element of G, it belongs to Y,,, so that ®,, (1) is define 
Iteration of this argument yields that Y, A @ and hence Y, = G for all t € R. Thus, 
X is complete. 


ao 


Remark 5.3.2 Let X € g and let ® denote the flow of X. By left-invariance, for all 
t,s € R we have 


Pi45(1) = O; 0 Ps (1) = O; oLe, a) (1) = Le, a) o (1) = 5 (1) @; (1). 


Thus, the maximal integral curve through 1 defines a Lie group homomorphism 
R — G, that is, a one-parameter subgroup of G. It is not hard to show that, con- 
versely, every one-parameter subgroup of G is the maximal integral curve through 
1 of a unique left-invariant vector field on G (Exercise 5.3.1). Thus, one-parameter 
subgroups are in bijective correspondence with left-invariant vector fields. 


Definition 5.3.3 (Exponential mapping) The exponential mapping of G is defined 
by 


exp: g—> G, exp(X) = &* (1), 


where @%* is the flow of X. 


If necessary, we will write expg to distinguish between the exponential mappings 
of different Lie groups. 


Remark 5.3.4 
1. Since the zero vector field has the trivial flow p? = idg for allt € R, 
exp(0) = 1. (5.3.1) 


2. Let X € g and let us consider the mapping f + exp(tX). Due to the scaling 
property (3.2.15) of vector fields, for all t € R we have 


exp(tX) = 6* (1), (5.3.2) 
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that is, the mapping t + exp(tX) yields the maximal integral curve of X 
through 1. Since, according to Remark 5.3.2, the latter is a group homomor- 
phism, for all t, s € R we obtain 


exp((t + 5)X) = exp(tX) exp(sX), exp(—tX) = (exp(tX)) (5.3.3) 


Furthermore, by left-invariance, (5.3.2) yields ox (a) =lLg o ox (1) = 
aexp(tX), hence 


OE = Resptx). (5.3.4) 
Repeated application yields 
DY 0 OX (a) = Rexp(sy) 0 Rexp¢x) (a) = a exp(tX) exp(sY). (5.35) 


3. Let X be a right-invariant vector field on G and let inv : G > G denote the 
inversion mapping. According to Remark 5.2.21, inv, X is left-invariant. Hence, 
Proposition 3.2.13, together with (5.3.3) and (5.3.4), implies 


@X (a) = invoG}"** 0 inv(a) = exp(—t inv, X)a. (5.3.6) 


That is, the flow of X is given by left translation by exp(—t inv, X). In particular, 
X is complete. 


We derive the basic properties of the exponential mapping. 
Proposition 5.3.5 The exponential mapping of G is smooth. By restriction, it in- 
duces a diffeomorphism from an open neighbourhood of 0 in g onto an open neigh- 


bourhood of 1 in G. 


Proof Consider the vector field Y on G x g defined by ¥ (a, X) := (Xq, 0). Its flow 
is given by @;(a, X) = (ox (a), X), where ®* denotes the flow of X. Thus, 


exp(X) = 07 (1) = ptg o $1 (1, X), 
where prg : G x g > G is the natural projection. Hence, exp is smooth. To see that 


it restricts to a diffeomorphism between open neighbourhoods of 0 in g and 1 in G, 
we compute the tangent mapping at 0 € g. Using (5.3.2), we obtain 


EXP) X = = exp(tX) = - @*(1)=Xi1, Xeg, 
dt fo dt fo 


that is, exp) coincides with the natural isomorphism g — TG of Proposition 5.2.4. 
Now, the assertion follows from the Inverse Mapping Theorem 1.5.7. 


Proposition 5.3.6 For a Lie group homomorphism g : G — H, one has 


Y~ 0 €XPg = exp, odg. 
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Proof Let X € g. According to Remark 5.2.12, dy(X) is g-related to X. According 
to Proposition 3.2.13/1, for all t € R we get 


BY 9g = G0. 


Evaluation of both sides for t = 1 at 1g yields the assertion. 


Corollary 5.3.7 For a Lie group homomorphism g : G > H, the following is equiv- 
alent. 


1. g is an immersion (submersion). 
2. g has discrete kernel (is open). 
3. dg is injective (surjective). 


In particular, if a Lie group homomorphism is surjective and has discrete kernel, 
it is a covering homomorphism, see Examples 5.1.11 and 5.1.13. 


Proof | = 2: If g is a submersion, this is due to Remark 1.5.16. If g is an im- 
mersion, the Constant Rank Theorem 1.5.11 implies that every point of G has a 
neighbourhood on which @¢ is injective. In particular, every element of ker(g) has a 
neighbourhood containing no other element of ker(g). Thus, ker(¢) is discrete. 

2 => 3: First, assume that g has discrete kernel. If dy was not injective, there 
would exist a nonzero A € g such that dg(A) = 0. Then, Proposition 5.3.6 would 
imply that 


ly =expy (dg(tA)) = v(expg (tA)) 


and hence expg (tA) € ker(g) for all t € R. Since t +> expg(tA) is a non-constant 
curve through 1g and since expg is a local diffeomorphism, this would contradict 
the assumption that ker(g) be discrete. 

Now, assume that ¢ is open. Since im(expg) is open, then g(expg (g)) and hence 
exp, (dg(g)) is open in H. Since exp, is a local diffeomorphism, dg(g) must con- 
tain an open subset. Hence, dg is surjective. 

3 = 1: By left-invariance, for a € G and X € g, we have g/, Xa = (dg(X)) gia). 
Hence, if dg is injective (surjective), then g is an immersion (submersion). 


Corollary 5.3.8 For a homomorphism of Lie groups to be an isomorphism it suffices 
to be bijective. 


Proof If @ is bijective, then Corollary 5.3.7 implies that it is an immersion and a 
submersion. It follows that it is a diffeomorphism (by the Inverse Mapping Theo- 
rem 1.5.7) and that its inverse is a group homomorphism (by elementary algebra). 


Finally, Propositions 5.1.7 and 5.3.6 imply 


Corollary 5.3.9 Let g, / : G — H be Lie group homomorphisms. If dg = dy, then 
gy and w coincide on the identity component of G. 
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Proposition 5.3.10 For X, Y € g such that [X, Y] =0 one has 
exp(X) exp(Y) = exp(Y) exp(X) = exp(X + Y). 


In particular, if g is Abelian, exp is a Lie group homomorphism from the vector 
group underlying g to G. 


Proof It suffices to show that the smooth curve y : R > G, defined by y(f) := 
exp(tX) exp(tY), is an integral curve through 1 of the vector field X + Y. Indeed, 
y (0) = 1. By (5.3.5), 


d 
y= Ao o ®*(1)). 


According to Proposition 3.2.15, the flows of X and Y commute. Using this and the 
product rule (2.2.8), one obtains 


a ee pti ©. oh wate 
y(t)= P,., 0 P; (1)+ P7450, (1) = Vy + Xy~- 
ds fo ds fo 


If X and Y do not commute, one has the following approximate formula. 


Proposition 5.3.11 For all X,Y € g there exists ¢ > 0 such that for all |t| < , 
1 
exp(tX) exp(tY) = exp (rx +Y)+ 5 LX, Y|+ 0), (5.3.7) 


where O(t*) is the value of a smooth mapping (—s, €) > g such that O(t?)/t? is 
bounded. 


Proof Let X,Y € g. By Proposition 5.3.5 and by continuity of the multiplication 
mapping, there exists e > 0 and a unique smooth curve Z : (—€, €) — g such that 
for all |t| < ¢ one has exp(Z(t)) = exp(tX) exp(tY). Then, for all f € C°(G) and 
for all |t| < ¢, we have 


f (exp{Z(t)}) = f (exp(tX) exp(tY)). (5.3.8) 


We consider the Taylor expansion of both sides up to second order. On the one hand, 
Taylor expansion of Z(t) att = 0 yields Z(t) =tZ,+ EL + O(t?) with Z; = Z(0) 
and Zz = Z(0). Writing 


F(exp{Z@}) = # (exe e(Z: +524 o(*)) ! 


Zi +5 Z.+0(s" 


) 
= £(%; 1)) ar 
and applying the Taylor formula (3.2.13) for manifolds, we obtain 


s=t 


2 
Flexe{ZO}) = FA) + (Zi N)D + 5 (22+ ZN) + O(P). 
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On the other hand, by (5.3.5) and the iterated Taylor formula (3.2.14) for manifolds, 


f (exp(tX) exp(tY)) 
12 
=f@)+t{(X+Yf}+ a l(e+2x¥+¥? \A}@ + o(r). 


We read off Z) = X + Y and Z) + Z7 =X? + 2XY + Y?. Thus, Z) =[X, Y]. 


Remark 5.3.12 


1. The Taylor formula (3.2.13) for manifolds and Formula (5.3.4) imply that for 
every ae G, f € C~(G) and t € R there holds 


n tk 
f (aexp(tX)) = yoo qi (N)@ + o(r"*"). (5.3.9) 


k=0 


The case n = 2 and a = 1 was used in the proof of Proposition 5.3.11. In addi- 
tion, recall from Remark 5.1.2/3 that the smooth structure of G contains a real 
analytic structure. Hence, it makes sense to speak of real analytic functions on G. 
For such a function one can take the limit n — oo in (5.3.9), thus obtaining an 
absolutely convergent series for small t and hence, by absorbing ¢ in X, an abso- 
lutely convergent series for small X: 


(ee) 


1 
f (aexp(X)) = =D al (X*(f))(a). (5.3.10) 


k=0 


This is the Taylor series for real analytic functions on G. Like for the Taylor for- 
mula (3.2.13) for manifolds, by repeated application of (5.3.10) one may produce 
iterated versions analogous to (3.2.14). 

2. Similarly to the argument in the proof, Taylor expansion of (5.3.8) and com- 
parison of coefficients allows to successively calculate the Taylor expansion of 
Z(t) to arbitrary order. Absorbing the parameter ¢ in X and Y, from this ex- 
pansion one obtains a formal series Z(X, Y), known as the Baker-Campbell- 
Hausdorff series of g. Using the notation ad(X) for the mapping g — g given 
by’ ad(X)Y = [X, Y], this series can be written in the form 


ad(X)kad(Y)!! ...ad(X)* ad(Y)" ad(X)° Y 


Z(X,Y)=X i? 
CON Ae DA Ya DELI Haellt.toelst 


r,s>0 


kj +l, >0 
(53,11) 

1 1 
=X+Y+ 5LX, Y]J+ lx [X, Y]]+[¥,0Y, X]]) +--+. (6.3.12) 


7The notation ad will be explained in the next section. 
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(Exercise 5.3.3). It can be shown that Z(X, Y) converges on some neighbourhood 
of the origin in g and that it satisfies 


exp(X) exp(Y) = exp(Z(X, ¥)); 


see for example [297, §2.15] or [129, §1.4]. These references also contain an 
alternative derivation of Formula (5.3.11). 


Proposition 5.3.11 allows to express the sum and the commutator in g in terms 
of the multiplication in G. Indeed, combining this proposition with repeated appli- 


cation of (5.3.3), we get 


Corollary 5.3.13 For all X,Y €g, 


exp(X + Y) = lim (exo(=x) exp(“y)} ; (5.3.13) 
n—>oo n n 
1 1 n 
exp([X, Y)) = lim (e(-=x) exp(~27) exp( =x) exo(“Y)) 
(5.3.14) 


Example 5.3.14 (Classical Lie groups) Let V be a finite-dimensional K-vector 
space and let G C End(V) be a classical Lie group. Under the identification of 
the Lie algebra g of G with the Lie subalgebra T1G of gl(V) according to Exam- 
ple 5.2.6, the flow of the left-invariant vector field corresponding to A € g is given 
by (5.2.4). Hence, 


expg(A)=e4, Aég. (5.3.15) 


That is, for the classical groups, the exponential mapping is given by the restriction 
of the exponential series on End(V) to arguments from g. This explains the name. 
For G = K \ {0}, where K = R or C, and G = U(J), this yields, in particular, the 
ordinary exponential functions K — K \ {0} and iR — U(1) C C, respectively. 


Example 5.3.15 (Vector groups) Let V be a K-vector space, viewed as a Lie group. 
Under the identification of the Lie algebra of V with the vector space V, cf. Ex- 
ample 5.2.10, the equation for the integral curves of the left-invariant vector field 
corresponding to X € V is given by y(t) = X. The solution with initial condition 
y(0) =ve V is y(t) =v+tX. Hence, the exponential mapping of V coincides 
with the identical mapping idy. 


Example 5.3.16 (Direct product) Let G; and G2 be Lie groups with Lie algebras g1 
and go, respectively. Consider the direct product Lie group G := G, x Go and let 
pr; : G — G; denote the natural projections. Under the natural identification of the 
Lie algebra of G with the direct product Lie algebra g) © g2, cf. Example 5.2.16, 
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left-invariant vector fields on G are given by pairs (X;, X2) with X; € g; and there 
holds d pr;(X1, X2) = X;. Hence, Proposition 5.3.6 implies 


expg(X1, X2) = (pr; 0 expg (X1, X2), pry o expg(X1, X2)) 
— (expg, (X1), eXPg, (X2)). 


Example 5.3.17 (Determinant and trace) Let K = R, C and let us consider the de- 
terminant homomorphism det : GL(n, K) > K \ {0}. According to Example 5.3.14, 
for A € gl(n, KK) we have exp(A) = e4. Hence, (5.2.10) and Proposition 5.3.6 imply 


det(e4) =e", (5.3.16) 


Let us mention that this equality can also be proved by more elementary arguments 
which do not make use of the theory of Lie groups. One way consists in replacing 
A by tA and showing by direct computation that both sides of (5.3.16) satisfy the 
same ordinary differential equation and the same initial condition. Another way is to 
use the Jordan normal form of A. We leave the details to the reader (Exercise 5.3.4). 


Remark 5.3.18 Since exp(g) contains 1 and since it is connected,® it is contained in 
the identity connected component Go. For G = R \ {0}, the exponential mapping is a 
global diffeomorphism from R onto Go = R. For the vector groups, it is trivially a 
diffeomorphism. In general, however, exp is neither injective nor surjective onto Go. 
For example, in the case G = GL(n, C), it is surjective but not injective, whereas 
in the case G = GL(n, R), n > 2, it is neither injective nor surjective onto Go, see 
Exercise 5.3.5. As a general fact, exp is surjective onto Go if Go is compact or 
Abelian. In the latter case, this can be seen as follows. As a consequence of (3.2.12) 
and (5.3.5), the Lie algebra of G is Abelian. According to Propositions 5.3.5 and 
5.3.10, exp(g) is a subgroup of Go which contains a neighbourhood of 1. According 
to Proposition 5.1.7, this implies exp(g) = Go. 


Exercises 

5.3.1 Show that every one-parameter subgroup of a Lie group is the maximal inte- 
gral curve through the unit element of a unique left-invariant vector field, cf. 
Remark 5.3.2. 

5.3.2 Repeated application of (5.3.7) yields 


exp(—tX) exp(—tY) exp(tX) exp(tY) = exp(?7[X, Y]+0O (t*)) 
for all X,Y eg. 


Use this to give a geometric interpretation of the commutator of left-invariant 
vector fields. 


8 As the image of a connected set under a continuous mapping. 
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5.3.3 Calculate the first three terms of the Baker-Campbell-Hausdorff series given 
in (5.3.12) by Taylor expanding (5.3.8) to third order and absorbing the pa- 
rameter f in the vector fields X and Y. After this warm-up, prove the general 
formula (5.3.11) by induction on n. 

5.3.4 Carry out the proofs of the identity (5.3.16) indicated in Example 5.3.17. 

5.3.5 Show that the matrix 

_{-l 1 
[0 3] 


is contained in the identity component of GL(2, R) but not in the image of the 
exponential mapping. 
Hint. Assume that a = e“ for some A € gl(2, R). Use (5.3.16) to show that 
A has eigenvalues 1 and —A. Use the Jordan normal form of A to show that, 
then, a has eigenvalues e* and e~*. Deduce from this that A and hence a is 
diagonalizable (contradiction). 

5.3.6 Show that the tangent mapping of exp at X € g is given by 


(ee) 


(Lexp(—x))4. 9 EXPy = y. 
n=0 


a+)! (ad(—X))” 


and that exp, is bijective if and only if no eigenvalue of ad(X) is an integer 
multiple of 277i. 


5.4 Adjoint Representation and Killing Form 
Let K=R, C, H and let V be a finite-dimensional K-vector space. 


Definition 5.4.1 (Representation) 


1. A representation of a Lie group G on V is a Lie group homomorphism @ : G > 
GL(V). A homomorphism of representations gj : G > GL(V;), i = 1,2, is a 
linear mapping A: Vj — V>2 such that A 0 gy) (a) = ¢2(a) oA forallaeG. 

2. A representation of a Lie algebra g on V is a Lie algebra homomorphism ¢ : 
g— gl(V). A homomorphism of representations g; : g > gI(V;), i= 1,2, is a 
linear mapping A: Vj — V2 such that A og (A) = @2(A) oA forall A € g. 


Homomorphisms of representations are also referred to as intertwining operators. 

Every Lie group G possesses a natural representation on its Lie algebra g, con- 
structed as follows. For every a € G, the conjugation mapping C, is a Lie group 
automorphism of G. According to Remark 5.2.14/2, the induced homomorphism 
dC, of g is an automorphism and hence, in particular, a vector space automorphism 
of g. Thus, we obtain a mapping 


Ad:G—>GL(g), — Ad(a):=dCy. 
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According to (5.2.9), Ad is a group homomorphism. To see that it is smooth, con- 
sider the chart on GL(g) given by the matrix representation induced by a basis in g. 
For Ad to be smooth with respect to this chart it suffices that the mapping G —> g 
given by at Ad(a)X be smooth for all X € g. This mapping can be written as the 
composition 


G>T(GxG)-TG>Gxg-—4g. 


Here, the first mapping assigns to a € G the tangent vector at (a, 1) corresponding 
to (O, Xz) under the natural identification of T(G x G) with TG x TG, the second 
mapping is the tangent mapping of (a, b) + C,q(b) and the last two mappings are 
the left trivialization and the natural projection onto the second factor, respectively. 
Thus, Ad is a Lie group homomorphism and hence a representation of G on g. 

Next, we note that every Lie algebra possesses a natural representation on itself, 
given by? 


ad: g — gl(g), ad(X)Y :=[X,Y] forall Y eg. 
Indeed, the Jacobi identity implies 
ad([X, Y]) =[ad(X), ad(Y)], 


hence ad is a Lie algebra homomorphism. Since Ad(a) is a Lie algebra automor- 
phism for all a € G, we obtain 


ad(Ad(a)X) = Ad(a) oad(X)o Ad(a~!) forallaeGandXeg. (5.4.1) 


Definition 5.4.2 (Adjoint representations) The representations Ad and ad are called 
the adjoint representations of G and g, respectively. 


Every representation g of G on V induces a representation of g on V, given by 
the induced homomorphism dq. Under the natural identification of the Lie algebra 
of GL(V) with gI(V), 


d 
dy(X) = 94 (X1) = i g(exp(tX)), Xeg. (5.4.2) 
0 
Proposition 5.4.3 The adjoint representations fulfil the relation 
dAd = ad. 


Proof Let X, Y € g. Due to Remark 5.2.14/2, left-invariance and (5.3.4), we get 


Ad(exp(tX))¥ = Cexpirx)x¥ = Rexprx)« 0 Lexpirx)x¥ = o* ,Y. 


Hence, (5.4.2) yields dAd(X)Y = YyY =[X, Y] =ad(X)Y. 


°The notation ad has already been used in Formula (5.3.11). 
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Let ¢ be a representation of G on the finite-dimensional K-vector space V. Since 
the mapping End(V) —> End(V*) which assigns to A the dual endomorphism A' is 
linear and hence smooth, ¢ induces a representation 


g*:G>GL(V*), — g*(a):=¢(a7!)", (5.4.3) 


called the dual representation of gy. The representation d(g*) : g > gl(V*), induced 
on g, is called the dual representation of dg. A brief calculation shows 


d(y*)(X) = —(dg(x))" for all X eg. (5.4.4) 


Definition 5.4.4 (Coadjoint representation) The duals of the adjoint representations 
of G and g are called the coadjoint representations of G and g and are denoted by 
Ad* and ad*, respectively. 


By definition, 
(Ad*(a)é, Y) = (é, Ad(a~')¥) forallae G, Y egandé eg*. (5.4.5) 
According to Proposition 5.4.3 and (5.4.4), 
(ad*(X)é, Y) = —(é, LX, Y)) for all X,Y eg andé €g*. (5.4.6) 


Next, recall that the kernel of a group homomorphism is the preimage of the unit 
element of the range and that the kernel of a linear mapping between vector spaces 
is the preimage of the origin of the range. Recall further that 


(a) the centralizer Cg(H) of a subset H of a group G consists of those elements 
of a € G satisfying ah = ha for all h € H, and that the centre of G is the 
centralizer of H =G, 

(b) the centralizer of a subset h of a Lie algebra g consists of those elements X € g 
satisfying [X, Y] = 0 for all Y € b, and that the centre of g is the centralizer of 


h=g. 


Proposition 5.4.5 The kernels of Ad and Ad* coincide with the centralizer Cg(Go) 
of the identity component Gy in G. The kernels of ad and ad* coincide with the 
centre 3 of g. 


Proof It is easy to see that ker(Ad*) = ker(Ad) and ker(ad*) = ker(ad). Moreover, 
ker(ad) = 3 is obvious. Hence, it suffices to show ker(Ad) = Cg(Go). Since for 
a € Cg(Go) there holds Ca+G y = idG, and since didg,) = idg, we obtain ker(Ad) 5 
CgG(Go). Conversely, if a € G satisfies Ad(a) = idg, Proposition 5.3.6 implies 


exp(X) = exp(Ad(a)X) =C, (exp(X)) for all X Eg. 


That is, a commutes with all elements of exp(g). Since exp(g) contains a neigh- 
bourhood of 1 in Go, Proposition 5.1.7 yields ker(Ad) C CG(Go). 
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Proposition 5.4.5 implies that if G is Abelian, then Ad(a) = idg for alla eG 
and ad(X) = 0 for all X € g. This applies in particular to vector groups and to tori. 


Example 5.4.6 (Classical Lie groups) Let V be a finite-dimensional K-vector space 
and let G C End(V) be a classical Lie group. Let a € G. Under the identification 
of the Lie algebra g with the Lie subalgebra Ti G of gl(V), Ad(a) = dC, coincides 
with the tangent mapping (C,)',. For A € g, 


(CAH EZ Calel4)= 5 (aelart) aa 
0 


Hence, 
Ad(a)A = aAa', aéG, Ae g. (5.4.7) 


Let us mention that in this example the proof of Proposition 5.4.3 reduces to the 
following calculation: 


d d 
dAd(A)B = Adi, (A)B = 7 Ad(e'“) B = 7 (e'4 Be ‘4) = [A, B]. 
lo 


to 


Example 5.4.7 Recall the Lie algebra isomorphism g : R* — so(3) from Exam- 
ple 5.2.8. It satisfies 


g(ax) =ag(x)a', aeSO(3), xeER’, (5.4.8) 


(Exercise 5.4.1). In view of (5.4.7) and a? = a7!, this implies that g identifies the 
adjoint representation of SO(3) with the identical representation on R?. Similarly, 
the Lie algebra isomorphism R? —> sp(1) of this example, also denoted by 9, satis- 
fies 


y(o(a)x) =ag(x)a, ac Sp(1), xeR’, 


where @ : Sp(1) — SO(3) is the covering homomorphism of Example 5.1.11. This 
follows from (5.4.8), because @(q) = #(q)!. As a consequence, the adjoint repre- 
sentation of Sp(1) can be identified with the representation of Sp(1) on IR? induced 
by ¢. The same statement holds for SU(2). Finally, (5.2.11) implies that the Lie 
algebra isomorphism sp(1) ® sp(1) — so(4) induced by the covering homomor- 
phism Sp(1) x Sp(1) ~ SO(4) of Example 5.1.11 identifies the representation of 
Sp(1) x Sp(1) on s0(4), induced via this covering homomorphism by the adjoint 
representation of SO(4), with the adjoint representation of Sp(1) x Sp(1). 


Example 5.4.8 (Direct product) Let G; and G2 be Lie groups with Lie algebras g; 
and go, respectively, and let G = G, x G2. Under the natural identification of g with 
91 ® g2 of Example 5.2.16, Eq. (5.2.12) implies 


Adg ((a1, a2)) (M1, Y2) = (Ada, (a1) ¥1, Adg,(a2)¥2), ai € Gi, Yi € gi. 


An analogous formula holds for ad. 
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Using the adjoint representation of g, one can construct a natural symmetric bi- 
linear form on g which is invariant under the adjoint representation of G. Recall 
from linear algebra that for an abstract K-vector space V, the trace of an endomor- 
phism of V is defined to be the trace of the corresponding matrix with respect to 
an arbitrary basis in V or, equivalently, to be the sum of eigenvalues, counted with 
multiplicities. The trace is a linear functional on End(V) satisfying 


tr(AB) =tr(BA) forall A, B ¢ End(V). (5.4.9) 


Definition 5.4.9 (Killing form) The Killing form of a finite-dimensional Lie algebra 
g is the bilinear form 


k:gxgoR, k(X,Y):= tr(ad(X)ad(Y)). 


Proposition 5.4.10 (Properties of the Killing form) Let G be a Lie group and let g 
be the Lie algebra of G. The Killing form k of g is symmetric, Ad-invariant, 


k(Ad(a)X, Ad(a)Y) = k(X, ¥), (5.4.10) 


and satisfies 
k(ad(Z)X, Y) + k(X, ad(Z)Y) =0. (5.4.11) 


It is non-degenerate iff g (and hence G) is semisimple. 


Proof The symmetry property and Eq. (5.4.11) are due to (5.4.9). By the help of 
(5.4.9), Eq. (5.4.10) follows from (5.4.1). To prove the last assertion, define e i= 
{X © g:k(X, g) = {0}}. Since every Abelian ideal of g is contained in gt, if k is 
non-degenerate, g is semisimple. Conversely, assume that g is semisimple. As a 
consequence of (5.4.11), g+ is an ideal in g. This implies that the Killing form of 
the Lie algebra g+ is the restriction to g+ of the Killing form of g (Exercise 5.4.2) 
and is hence trivial. Now, Cartan’s criterion!® yields that g+ is a solvable ideal in 
g. Hence, if it was nonzero, it would contain a nonzero Abelian ideal. Therefore, 
gy = {0}. 


Remark 5.4.11 


1. Computation of the trace by means of a pair of dual bases {e;} in g and {e*"} in 
g* yields 
k(X, Y) =(e", [X, [Y, ei]]) = X*¥ci:4 cxj', (5.4.12) 


where X* and Y! are the expansion coefficients of X and Y, respectively, with 
respect to the basis {e;} and cx;' are the corresponding structure constants. The 


10A Lie algebra g whose Killing form k satisfies k(X, Y) = 0 for all X eg, Y € [g, g] is solvable 
[145, §4.3]. 
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symmetric covariant tensor of order 2 
a= ey" cy 


is referred to as the Cartan-Killing tensor of G with respect to the basis {e;}. 

2. According to Proposition 5.4.10, if G is semisimple, the Killing form is non- 
degenerate and hence induces a linear isomorphism F : g > g* defined by 
(F(X), Y) =k(X, Y) for all X, Y € g. Invariance under Ad implies 


F o Ad(a) = Ad*(a) 0 F (5.4.13) 


for all a € G. Thus, F is an isomorphism of the representations Ad and Ad* 
of G. As a consequence, for semisimple Lie groups, the adjoint and coadjoint 
representations can be identified. 


Example 5.4.12 For each of the semisimple classical Lie groups of Example 1.2.6, 
there exists c > O such that 


k(X,Y)=ctr(XY) forall X,Y eg. 


For example, for sl(n, K) and su(n), n > 2, the factor is c = 2n, for so(n), n > 3, it 
is c=n—2 and for sp(n), n > 1, itisc = 2(n+ 1) (Exercise 5.4.3). 


Exercises 

5.4.1 Prove the two formulae stated in Example 5.4.7. 

5.4.2 Let g be a Lie algebra, let i be an ideal in g and let kg and kj denote the 
respective Killing forms. Show that for all X, Y €i there holds kj(X, Y) = 
kg(X, Y). 

5.4.3 Determine the factor of proportionality between the Killing form and the 
trace form of the identical representation for the classical groups listed in 
Remark 5.4.12. 


5.5 Left-Invariant Differential Forms 


Definition 5.5.1 A differential form € on G is called left-invariant if L*é = & for 
allaeG. 


The set of left-invariant differential forms on G will be denoted by 2*(G)E. 
Written pointwise, for 1-forms, the defining condition reads &gp o (La)} = &} for all 
a,b €G or, equivalently, 


&,=& 0(L,-1)), forallaeG. (5.5.1) 
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Proposition 5.5.2 Let G be a Lie group with Lie algebra g. 


1. 2*(G)® is a differential subalgebra of 2*(G). 
2. There exist natural algebra isomorphisms 


Q*(G)°s /)\TiG 7 Ag: 


The first one is given by € +> &. The second one is induced by the isomorphism 
g— T1G, given by X + X34. 

3. For all & € RQ" (G)F and X\,...,X; € g, the function €(X1,..., X;) on G is 
constant. 


Proof 1. By (2.4.24), 92*(G)® is closed under exterior multiplication. By (4.1.23), 
it is closed under taking exterior derivatives. 

2. It suffices to consider the mapping 2*(G)° > /\T3G, defined by & + &4. 
This mapping is a homomorphism of algebras, which is injective by (5.5.1). To 
show that it is surjective, it is enough to prove that its image contains T] G. Thus, 
let yn € T; G. By fixing the second argument of the inverse left trivialization of T*G 
to be n, cf. (5.1.6), one obtains a left-invariant 1-form € with &; = 7. 

3. This is a consequence of (4.1.16). 


According to assertion 2 of Proposition 5.5.2, we will identify left-invariant dif- 
ferential r-forms with elements of /\’ g* without explicitly stating that. Depending 
on the context, for € € rae g* and X,,..., X,; €g, the expression €(X1,..., &) will 
be interpreted as a function on G or as a number. 


Remark 5.5.3 


1. Propositions 4.1.6 and 5.5.2 imply that the exterior derivative of & € /\’ g* is 
given by!! 


—_ XiXj 
dé(Xo,..., Xr) = Do(- 1)! E (Xi, Xj], Xo, *.1, Xr), (5.5.2) 
i<j 
where Xo,..., X- € g. The right hand side may be taken as an intrinsic def- 


inition of a differential on the exterior algebra /\ g*, thus turning the natural 
isomorphism of algebras 2*(G)° = /\ g* of Proposition 5.5.2 into an isomor- 
phism of differential algebras. In particular, this yields a characterization of the 
subcomplex 2*(G)© of the de Rham complex of G in terms of the Lie algebra 
alone. 

2. For € € g*, (5.5.2) yields 


dé(X,Y)=—&([X,Y]), X,Yeg. (5.5.3) 


IC. Proposition 4.1.6 for the notation. 


250 5 Lie Groups 


Let {e],..., e,} be a basis in g, let ci , be the corresponding structure constants 
and let {e*!,..., e*”} be the dual basis in g*. Then, (5.5.3) is equivalent to 
ar es rne* j=] n (5.5.4) 
=~ 5Cjk jf =1,...,n. J. 


This can be seen by evaluating both sides on the basis elements e; (Exercise 
5.5.1). Equation (5.5.4) is known as the Maurer-Cartan equation associated with 
the basis {e1,...,e,} of g. 

3. In terms of left-invariant 1-forms, the inverse left and right trivializations ( x jt 
and (x1)! of T*G, given by (5.1.6), read 


(xt) "Ge =f, (x8) @, 8) = (C88), = bn 0 (Ca). 


Note that the 1-form C7é need not be left-invariant. 


Proposition 5.5.2 yields in particular that the space of left-invariant n-forms cor- 
responds to /\" g*. Hence, it has dimension one and its elements are of the form 
e*! A... A e*” for some basis {e*'} in g*. By left-invariance, every nonzero element 
of this space is a volume form. Hence, we obtain 


Corollary 5.5.4 On every Lie group there exists a left-invariant volume form vg. 
This form is unique up to multiplication by a nonzero real number. 


It is common to write vg (a) = da. The Lebesgue measure associated with a left- 
invariant volume form on G is called a Haar measure on G. Thus, Haar measures 
on G are left-invariant and unique up to a constant. 


Remark 5.5.5 


1. Every scalar product (-,-) on g defines a Riemannian metric g on G by 
GalXa, Ya) = (X,Y), aeG, X,Yeg. (5.5.5) 


By construction, g is left-invariant, that is, Ly.g = g for all a € G, and so is its 
volume form vg. This way, the choice of a scalar product on g and an orientation 
on G singles out a unique left-invariant volume form. Furthermore, every basis of 
g which is orthonormal with respect to (-,-) provides a global frame of TG which 
is orthonormal with respect to g. By (4.4.14), in terms of the corresponding dual 
basis {e*’} of g*, the volume form vg is given by 
vg =e he hehe, 
2. If G is compact, another way to single out a unique left-invariant volume form 
on G is to require that the volume of G be equal to 1. 
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3. We determine the action of right translations on left-invariant volume forms. Re- 
call from linear algebra that for an abstract K-vector space V, the determinant 
of an endomorphism of V is defined to be the determinant of the correspond- 
ing matrix with respect to an arbitrary basis in V or, equivalently, the product 
of eigenvalues, counted with multiplicities. A brief computation (Exercise 5.5.2) 
shows that for every left-invariant volume form vg there holds 


Révg =det(Ad(a~'))vg,  aeG. (5.5.6) 
The smooth function 
A:G>R, = A(a):=det(Ad(a™')), 


is called the modular function of G. For every f € C“(G), integrable with re- 
spect to vg, and for every a € G, the function R* f is integrable as well and 


[®Ave=aer [ f VG: (5.5.7) 
G G 


For compact Lie groups, where every smooth function is integrable with respect 
to any volume form, the existence of left-invariant volume forms provides the pow- 
erful tool of averaging over the group. This concept can be used, for example, to 
construct invariants of representations, e.g. an invariant scalar product. 


Proposition 5.5.6 (Invariant scalar product) Let G be a compact Lie group, let V 
be a finite-dimensional K-vector space and let y be a representation of G on V. The 
vector space V admits a scalar product (-,-) such that 


(p(a)v, y(a)w) =(v,w) forallacGandv,weV. 
For the induced representation dg of g, we have 
(dg(X)v, w) + (v, dg(X)w) =0 forall X Egandv,weV. 


Thus, every finite-dimensional representation of a compact Lie group may be 
assumed to be orthogonal (in case K = R) or unitary (in case K = C, H). 


Proof Choose an arbitrary scalar product (-,-) on V. For v, w € V, define a function 
fo.w € C~(G) by fy.w(a) := (y(ag!)v, o(a~!)w). Choose a left-invariant volume 
form vg on G and define 


(v, w) = | fuwve. 


The integral exists, because G is compact. The defining properties of a scalar 
product carry over from (.,-) to (-,-). Let a € G. A brief computation shows 
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Fo(a)v,o(aw = L* Jv.w- Hence, by left-invariance of vg, 


(o(a)v, p(a)w) = [ (fives A fowve = (0, Ww), 


as asserted. In view of (5.4.2), the formula for dg follows by differentiation. 


For a vector space endomorphism A, let spec(A) denote the spectrum. 


Corollary 5.5.7 For a finite-dimensional K-representation y of a compact Lie 
group G, 


1. spec(g(a)) C UC) for all a € G and spec(dg(X)) C iR for all X € g, 
2. incase K=C, g(a) and dg(X) are diagonalizable for alla € G and X € g. 


Proof 1. Without loss of generality, view the vector space V carrying the represen- 
tation as a real vector space. Choose a basis in V which is orthonormal with respect 
to some @-invariant scalar product. With respect to this basis, p(a) and dg(X) are 
represented by an orthogonal and a skew-symmetric matrix, respectively. Hence, the 
assertion follows from elementary linear algebra. 

2. In case K = C, g(a) is unitary and dg(X) is skew-Hermitian. Hence, this 
assertion follows from elementary linear algebra, too. 


Application of Corollary 5.5.7 to the adjoint representations of G and g yields 


Corollary 5.5.8 Let G be a compact Lie group. 


1. The modular function of G is given by A(a) = +1 for all a € G. If G is con- 
nected, A(a) = 1. 

2. The Killing form k of g is negative semidefinite. 

3. If G is in addition semisimple, —k is an Ad-invariant scalar product on g. 


Proof \. Let a € G. On the one hand, Corollary 5.5.7/1 implies that A(a) = 
det(Ad(a)) € U(1). On the other hand, Ad(a) is an automorphism of the real vector 
space g, hence det(Ad(a)) is real. It follows that A(a) = +1. In particular, A is 
locally constant. Hence, if G is connected, A(a) = A(1) = 1. 

2. Let X € g. By Corollary 5.5.7/1, the spectrum of ad(X)? consists of the squares 
of certain purely imaginary numbers. Hence, k(X, X) = tr(ad(X )?) <0. 

3. This follows from point 2 and non-degeneracy, cf. Proposition 5.4.10. 


Recall from Remark 5.5.5/1 that every scalar product on g defines a left-invariant 
Riemannian metric g on G via (5.5.5). If the scalar product is invariant under Ad, g 
is in addition right-invariant and hence bi-invariant, that is 


LaxG =Raxg=g forallaeG 


(Exercise 5.5.3). Thus, Corollary 5.5.8 implies 
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Corollary 5.5.9 Every compact Lie group admits a bi-invariant Riemannian metric. 
If the group is in addition semisimple, then the negative of the Killing form yields 
such a Riemannian metric. 


To conclude this section, we discuss left-invariant differential forms on G with 
values in g, that is, differential forms € € 2*(G, g) satisfying L3é = &. According 
to Remark 4.1.10/2, given a basis {e;} in g, every € € 2"(G, g) can be written as 


=! Be; (5.5.8) 


with ordinary differential r-forms &'. The form € is left-invariant iff so are 
the &'. Since g is an algebra, 2*(G, g) carries an exterior product 2"(G, g) x 
2" (G, g) > 2+" (G, g), given by 


[&1, €21(%1,..-, Yrytry) 
1 


“ry lro! 


2 (ei Pedierins Yeti eS atts ig Tenia) |: 


TES; itr 
There holds the following analogue of Proposition 5.5.2. 


Proposition 5.5.10 Let G be a Lie group with Lie algebra g and let 2*(G, g)° 
denote the set of left-invariant differential forms on G with values in g. 


1. 2*(G, g)° is a differential subalgebra of 2*(G, g). 
2. There exist natural algebra isomorphisms 


2*G,9)° = (ATG) @9=(As*) oo, 


given by & +> &y and induced by the mapping X +> X41, respectively. 
3. Forallé € Q*(G, we and X,,...,X;, € g, the g-valued function &(X1,..., X;) 
on G is constant. 


Proof The arguments are completely analogous to those for ordinary left-invariant 
differential forms in the proof of Proposition 5.5.2, except for the surjectivity of 
the mapping & +> £1, because §2*(G, g) need not be generated as an algebra by 1- 
forms. Thus, let 7 € ra T].G @g. Choose a basis {e;} in g and write n = n! @e; with 
nie NN’ T,.G. According to Proposition 5.5.2/1, y= él for certain left-invariant 
&' € 2"(G). Then, & := &' @ e; is a left-invariant differential r-form with values in 
g satisfying &; = 7. 


According to Proposition 5.5.10, left-invariant differential 1-forms with values 
in g can be identified with linear endomorphisms of g. 


Definition 5.5.11 (Maurer-Cartan form) The left-invariant differential 1-form on G 
with values in g corresponding to idg is called the Maurer-Cartan form of G. 
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We denote the Maurer-Cartan form by ©. By definition, 
(O,X)=xX forall X eg. (5.5.9) 


As aconsequence, © assigns to a tangent vector Y at a the left-invariant vector field 
X with Xz = Y. Moreover, the expansion (5.5.8) of © with respect to a basis {e;} 
in g is given by 


+1 
O=e" BE. 


Remark 5.5.12 


1. Like for ordinary left-invariant 1-forms, and by the same argument, the exterior 
derivative of left-invariant 1-forms with values in g is given by (5.5.2). For the 
Maurer-Cartan form this yields 


d@(X, Y) = -(0, [X, Y]}=-[X, Y]=-[0(X), OY) ] 
and hence 
dO + 5ONO=0, (5.5.10) 


This is the Maurer-Cartan equation (5.5.4) in a basis-independent version. 

2. Fora classical Lie group G C End(V), under the identification of the Lie algebra 
of G with a subalgebra of gl(V) and of the tangent spaces T,G with subspaces 
of End(V), cf. Example 5.2.6, there holds 


@(Y)=a 'Y¥, aeG, YeET,G. 


In the physics literature, it is common to write da for the tangent mapping!* of 
idg at a. Using (idg)/, = idr,g, one obtains Og = a~'da. 


We conclude this section with a remark on right-invariant differential forms. 


Remark 5.5.13 (Right-invariant differential forms) By analogy with left-invariant 
differential forms one defines right-invariant differential forms on G by the condi- 
tion R*é = & and shows the following (Exercise 5.5.7). 


1. Equivalent conditions for a differential 1-form & to be right-invariant are &,_ 0 
(Ra)}, = & for all a, b € G and & = & 0 (R,-1)/, for alla eG. 

2. Proposition 5.5.2 remains true for right-invariant differential forms if g is re- 
placed by the subalgebra of right-invariant vector fields. In terms of right- 
invariant 1-forms, the inverse left and right trivializations ( re )! and ( Xn z 
of T*G, given by (5.1.6), read 


(xf), 8) = (C818), = 600g, (XB) G8) = bea. 


127 ike writing dx for the first derivative at x of the function f (x) = x on R; it must not be confused 
with the notation vg (a) = da for an invariant volume form. 
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3. 


Via — inv“, the inversion mapping inv : G > G provides an isomorphism of dif- 
ferential algebras between the subalgebras of left and right-invariant differential 
forms. Under the natural isomorphisms with /\ Tj. G, —inv* corresponds to the 
identical mapping. 


Exercises 
5.5.1 Show that Eqs. (5.5.3) and (5.5.4) are equivalent. 
5.5.2 Show that the pull-back of a left-invariant volume form on G by Rg, a€ G, 


is given by (5.5.6) and that integrable smooth functions satisfy (5.5.7). 


5.5.3, Consider the Riemannian metric g on G defined via (5.5.5) by a scalar product 


on g. Prove that if the scalar product is Ad-invariant, then g is bi-invariant. 


5.5.4 (a) Show that the mapping 


R:(—2,7) x (0,7) x (-2,7) — SOG), 
RO, VW) = RPRy OR), 


where 
cos@ —sing O cos@é 0 —sing 
R.(@)=]|sin@ cosd@ Of], Ry(o) = 0 1 0 
0 0 1 sing O  cosd 


(rotation by an angle @ about the z-axis and the y-axis, respectively), 
induces a local chart on SO(3). What is the complement of the domain of 
this chart?! 

Show that the left-invariant volume form with respect to which SO(3) has 
unit volume is given in these coordinates by 


(b 


wm 


1 
VSO(3) = oa sinvddd A dd A dw. 


5.5.5 Use the result of Exercise 5.1.7(a) to show that the mapping 


U : (—,2) x (0,7) x (—2z, 277) — SU(2), 
given by 


ei V4%) cog2 93 V-®) sin ‘ 


e728) sin 2B 92049) cos 2 


vo a=| 


induces a local chart on SU(2). Compute the left-invariant volume form with 
respect to which SU(2) has unit volume in these coordinates. 


5.5.6 Find a left-invariant volume form on SL(2, C). 
5.5.7 Verify the properties of right-invariant differential forms stated in Re- 


mark 5.5.13. 


'3The angles @, # and y are called Euler angles. 
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5.6 Lie Subgroups 
Let G be a Lie group of dimension n and let g be the Lie algebra of G. 


Definition 5.6.1 (Lie subgroup) A Lie subgroup of G is a pair (H, y), where H isa 
Lie group and g : H — G is an injective and immersive Lie group homomorphism. 
If g is an embedding, (H, ¢) is said to be embedded. Lie subgroups (A, g1) and 
(Hz, ~2) of G are said to be equivalent if there exists a Lie group isomorphism 
w: H, — Ho such that g} =gow. 


Example 5.6.2 


1. The classical groups of Example 1.2.6 are embedded Lie subgroups of the ap- 
propriate general linear group GL(n, K). Here, the Lie group homomorphism ¢ 
is given by the natural inclusion mapping. 

2. (GL(n, H), g), with g being induced from (1.1.2), is an embedded Lie subgroup 
of GL(2n, C). 

3. Leta, b € R \ {0} such that ¢ is irrational and let g : R > T?, p(t) := (el, el”), 
Then, (R, ¢) is a Lie subgroup of T’. It is not embedded. 

4. Let a,b € Z \ {0} be relatively prime and let g : U(1) > T?, g(z) := (<4, 2°). 
Then, (U(1), g@) is an embedded Lie subgroup of T?. 

5. The identity component Go of G is an embedded Lie subgroup of G. 


Right from the start we observe that Propositions 1.6.10 and 1.6.14 remain true if 
the terms submanifold and C*-mapping are replaced by Lie subgroup and Lie group 
homomorphism, respectively. The situation with Lie subgroups is yet simpler than 
that with general submanifolds, see Proposition 5.6.4 below. We start with deriving 
the basic properties of Lie subgroups. A distribution D on G is said to be left- 
invariant if L/D, = Dap for alla, be G. 


Lemma 5.6.3 The distribution on G spanned by a Lie subalgebra of g is left- 
invariant, regular and integrable. 


Proof Left-invariance holds by construction and implies regularity. The distribution 
is involutive by Remark 3.5.6 and hence integrable by Corollary 3.5.12. 


Proposition 5.6.4 (Basic properties of Lie subgroups) 


1. The connected components of a Lie subgroup (H, p) of G are maximal integral 
manifolds of the distribution on G spanned by the Lie subalgebra im(d@) of g. 

2. Lie subgroups are initial submanifolds. 

3. Lie subgroups are equivalent iff their images coincide as sets. 


Assertion 2 implies that the figure eight submanifold cannot occur as the group 
manifold of a Lie subgroup. Assertion 3 yields, in particular, that if a subgroup 
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H CG admits a smooth manifold structure which makes it into a Lie subgroup of 
G, then this structure is unique. 


Proof Let § denote the Lie algebra of H and let D denote the distribution on G 
spanned by im(dg) = dg(h). For a € H and X € h, a brief computation shows 
g'Xq = (dg(X))a. This implies g’T, H = Dgca) for all a € H. Hence, for every 
connected component Hj of H, (Hj, ¢;H;) is an integral manifold of D. It is max- 
imal, because the vector fields in h are complete. This proves assertion 1. Then, 
assertion 2 follows by means of Proposition 3.5.15. In view of the fact that the 
restriction of a group homomorphism in range to a subgroup remains a group ho- 
momorphism, assertion 3 follows from assertion 2 by the same argument as for the 
analogous assertion about initial submanifolds, see Remark 1.6.13/5. 


Next, we link Lie subgroups to Lie subalgebras. 


Proposition 5.6.5 (Connected Lie subgroups and Lie subalgebras) The assignment 
of im(dg) to (H, @) defines a bijection between equivalence classes of connected 
Lie subgroups of G and Lie subalgebras of g. 


Proof We must show that for every Lie subalgebra h of g there exists a connected 
Lie subgroup (H,y) of G such that im(dg) = § and that (H, ~) is unique up to 
equivalence. 

Existence: let D be the distribution on G spanned by 5. By Lemma 5.6.3, 
D is left-invariant and integrable. Let H be the maximal integral manifold of D 
through 1. By left-invariance of D, for every a € H, L,-1(#) is an integral man- 
ifold of D. Since it contains 1, we get L,-1(H) C H. Hence, a-'b EH for all 
a,b € H, that is, H is a subgroup of G. Since the multiplication mapping of H is 
the restriction of that of G in domain to H x H and in range to H, it is smooth by 
Proposition 3.5.15. Hence, H is a Lie group. Since, by construction, the natural in- 
clusion mapping 1: H — G is a Lie group homomorphism, (H, ) is a Lie subgroup 
of G. Since T; H = {X1: X € 5}, the image of dg coincides with h. This proves 
existence. 

Uniqueness: let (M1, 91) and (H2, ¢2) be connected Lie subgroups of G sat- 
isfying im(dg;) = im(dg2). By assertion 1 of Proposition 5.6.4, they are maxi- 
mal integral manifolds through 1 of one and the same distribution on G. Hence, 
¢1(A)) = ¢2(A2). Then, assertion 3 of this proposition implies that (H), g) and 
(Ao, ¢2) are equivalent. 


Finally, we derive two sufficient conditions for a subgroup of G to be a Lie 
subgroup. We suppress the natural inclusion mapping in the notation. 


Proposition 5.6.6 (Subgroups which are submanifolds) Let H C G be a subgroup. 
If H admits a smooth structure which makes it into a submanifold of G, then H is a 
Lie group and a Lie subgroup of G with respect to this structure. The corresponding 
Lie subalgebra is given byh={X €g: X1 €T1H}. 
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The smooth structure is then unique by Proposition 5.6.4/3. 


Proof Define h := {X € g: X, € Tz H} and let D denote the distribution spanned 
by §. It suffices to show that H is an integral manifold of D, because then Lg (#7) 
is an integral manifold of D through a for all a € G, hence D is integrable and 
Proposition 3.5.15 yields that the restriction of the multiplication mapping of G to 
His smooth. This yields the assertion. 

Thus, let a € H. Let k := dim(#7). Since both D and H have dimension k, it 
suffices to show that Dg C Tg H. Assume, on the contrary, that this is not the case. 
For a smooth curve y in H and b € H, define y? :=Lpo y. Note that y? is a curve 
in H which is smooth in G, but we cannot assume that it is smooth in H, because 
a priori we do not know whether Lp is a smooth mapping of H. Choose smooth 
curves yj,..., ¥x in H through a such that (0), ..., ~%(0) form a basis in T,H. 
Since Dg ¢ TaH and since Dg = L,T1H, there exists a smooth curve yx41 in H 


through 1 such that yi, (0) ¢ Ta HH. Then, ye gnats ve are curves in H through 


1, smooth in G, such that yt! (O),..., ye (0), ¥%-+41(0)} is a linearly independent 
system in TyG. Choose smooth curves yz+2,.-., Yn in G through 1 whose tangent 
vectors at tf = 0 complement this system to a basis in Ty G. Then, according to the 
Inverse Mapping Theorem, there exist ¢ > 0 and an open neighbourhood U of 1 in 
G such that the mapping 


-1 -1 
® : (-e,8)" >U CG, Pthi=yl (th) vg eve (the) + Yat), 


is a diffeomorphism. Since HM U is open in H, it is a submanifold of U, hence 
(HNU,@ ae y) is a submanifold of IR” of dimension k. On the other hand, by con- 
struction, the image @~!(H NU) contains (—e, ¢)‘+! x {0}. This is a contradiction, 
because the image of a submanifold cannot contain the image of another submani- 
fold of higher dimension. Hence, Dg C TqH and the proposition is proved. 


To derive the other sufficient condition for a subgroup to be a Lie subgroup we 
need 


Lemma 5.6.7 [fm and m2 are complementary vector subspaces of g, there exist 
open neighbourhoods U; of the origin in mj, i = 1,2, and V of 1 in G such that the 


mapping 
g:m xm —>G, p(X1, X2) = exp(X1) exp(X2), 
restrics to a diffeomorphism from U, x U2 onto V. 


Proof The tangent mapping gj : mj x m2 — g is given by (X1, X2) > X1 + X2. 
Since m; and m2 are complementary, it is bijective. Hence, the assertion follows 
from the Inverse Mapping Theorem 1.5.7. 
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Theorem 5.6.8 (Closed subgroups) Let H C G be a subgroup. If H is closed in G, 
then it admits a smooth structure which makes it into a Lie group and an embedded 
Lie subgroup of G. The corresponding Lie subalgebra of g ish ={X € g: exp(tX) € 
H for all t € R}. 


Proof First, we show that § is a linear subspace of g. By construction, tX € § for 
all X € h andt ER. For X, Y €g, (5.3.13) implies 


t t n 
exp(t(X + Y)) = sim, (exp( <x) exp(<r)) . 


Since exp(£X) exp(£Y) € H forall n and ¢ and since H is closed, exp(t(X + Y)) € 
H for all t € R, hence X + Y € h. Next, equip H with the relative topology induced 
from G. Choose a neighbourhood V of 1 in G such that p := exp! : V > g is 
defined and smooth and hence a local chart on G. We claim that V can be adjusted 
so that 


p(VNH)=p(V)Nb. (5.6.1) 


If so, then the family consisting of the local charts (La(V), p o L,-1 tLa(V))> aédH, 
satisfies conditions (E1) and (E2) of Proposition 1.7.3 and hence defines a smooth 
structure on H which makes H into an embedded submanifold of G. Then, Proposi- 
tion 1.6.10 implies that in this structure, H is a Lie group and hence a Lie subgroup 
of G. The corresponding Lie subalgebra of g contains and, by (5.6.1), it has the 
same dimension as , hence it coincides with . In particular, h is a Lie subalgebra 
of g. 

To show (5.6.1), assume, on the contrary, that V cannot be chosen so that (5.6.1) 
holds. Since exp(h) C H, this means that V M exp(h) is properly contained in 
V 1 A, for arbitrarily small V. Thus, there is a sequence {a,} in H converging 
to 1 such that a, ¢ exp(h) for all n. Choose a subspace m of g complementary 
to h. According to Lemma 5.6.7, for large enough n, a, defines unique sequences 
{X,} in h and {Y,,} in m by a, = exp(X,,) exp(Y,,) and both of these sequences con- 
verge to the respective origin. Since H is a subgroup, we obtain exp(Y,,) € H. Since 
an € exp(h), we conclude Y,, ~ 0. Now, choose a norm || - || on m. Since the cor- 
responding unit sphere in m is compact, by possibly removing some members we 
may assume that the sequence {Y, || ¥,||~!} converges to some Y € m with ||Y|| = 1. 
We show that exp(tY) € A for all t € R and hence Y € h (contradiction). Let tf €e R 
be given. Let {k,} denote the sequence of integer parts corresponding to {t||¥,||~'}. 
Then, ky, ||Y,|| — ¢t for n + o© and hence 


Yn 


Yn lk 


; : kn 
exp(tY) = jim. exp( ks Yn ll - ) = jim (exp(Yn)) eH, 


as H is closed. This completes the proof of the theorem. 


Remark 5.6.9 Instead of Proposition 1.7.3, one may use the following argument 
to prove that H, equipped with the relative topology induced from G, admits a 
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submanifold structure. The details are left to the reader (Exercise 5.6.1). By (5.3.14), 
h is a Lie subalgebra. Let D be the distribution on G generated by and let H be 
the connected Lie subgroup associated with h, that is, the maximal integral manifold 
through 1 of D. Since the subset H of G is invariant under the flow of the vector 
fields in h, for every a € H there holds L,(H) C H. Use the argument of the proof 
of Theorem 5.6.8 to show that closedness of H implies the existence of an open 
neighbourhood V of 1 in G such that HOV = HOV. Construct from this an open 
neighbourhood V of H in G such that H = VN H. Conclude that the maximal 
integral manifolds La(H ) of D, a € H, are the connected components of H. Since 
the latter is second countable, there are at most countably many of them. This yields 
the desired submanifold structure of H. 


Corollary 5.6.10 A Lie subgroup is embedded iff its image is a closed subset. 


Proof Denote the Lie subgroup by (H, ¢). If g(#) is closed, Theorem 5.6.8 pro- 
vides a Lie subgroup structure on g(H), where g(#7) carries the relative topol- 
ogy induced from G. By Proposition 5.6.4/3, the Lie subgroups (H, g) and g(#7) 
are equivalent. This means that g is a diffeomorphism (and hence a homeomor- 
phism) onto its image. Conversely, if (77, g) is embedded, Proposition 5.6.4/1 and 
Remark 3.5.19/1 imply that g(7) is closed. 


Example 5.6.11 


1. The kernel of a Lie group homomorphism G —> H is a subgroup (by the ho- 
momorphism property) and closed (by continuity) and hence an embedded Lie 
subgroup of G by Theorem 5.6.8. Using Proposition 5.3.6 it is easy to see that 
the Lie algebra of ker(@) is given by ker(dg) (Exercise 5.6.2). 

2. In view of Theorem 5.6.8, for the classical groups of Example 1.2.6 to be Lie 
groups it suffices that they are closed subsets of the appropriate general linear 
group GL(n, K). Proposition 5.6.4/3 implies that the smooth structure so ob- 
tained coincides with that provided by the Level Set Theorem 1.2.1. 


Exercises 

5.6.1 Work out the alternative proof of Theorem 5.6.8 sketched in Remark 5.6.9. 

5.6.2 Use Proposition 5.3.6 to prove that the Lie algebra of the kernel of a Lie group 
homomorphism is given by the kernel of the induced Lie algebra homomor- 
phism, cf. Example 5.6.1 1/1. 


5.7 Homogeneous Spaces 


Let G be a Lie group. Recall from elementary algebra that every subgroup H C G 
defines an equivalence relation on G by a ~y b iff a~'b € H. The equivalence 
classes are given by the subsets aH of G, a € G. They are called the left cosets of 
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H in G. Let G/H denote the set of equivalence classes, equipped with the quotient 
topology. This space will be referred to as the quotient of G by H. By definition of 
the quotient topology, the natural projection 


m:G—>G/H, m(a):=aH, 
is continuous. Every a € G defines a mapping 
[a:G/H > G/H, La (bH) :=abH, 


fulfilling 
Lgom=mnol, forallacG. (5.7.1) 


The topological structure of the quotient G/H may be quite strange, as for example 
in the case G = T’ and H = {(e!@", e!") : t E R} with ; irrational. If, however, H is 
closed, then the quotient G/H turns out to be a smooth manifold. To prove this, we 
need 


Lemma 5.7.1 Let G be a Lie group and let H C G be a subgroup. 


1. The natural projection x : G + G/H is open. 
2. The induced mappings La are homeomorphisms of G/H. 
3. If H is closed, then G/H is Hausdorff. 


Proof 1. Let U C G be open. The subset 7(U) is open in G/A iff a(x (U)) is 
open in G. The latter holds, because a—'(x(U)) = UaeH Rg(U) and the Rg are 
homeomorphisms of G. 

2. Due to La fo) C1 = idg/y, it suffices to show that La is open for alla € G. 
Thus, let U C G/H be open. By (5.7.1), La(U) = 1 0 La(x~!(U)). Since z is 
continuous and since Lg and z are open mappings, the assertion follows. 

3. Consider the mapping g: G x G > G, defined by g(a), az) := ay ap. We 
have z(a,) = (az) iff (aj, a2) € gy !(H). Since H is closed and since ¢ is con- 
tinuous, g~!(H) is a closed subset of G x G. Now, let G1, G2 € G/H be such 
that G@, 4 a. Choose a; € G such that Gj = m(a;). Then, (aj,a2) ¢ y~!(H). 
Since gy '(H ) is closed, there exist neighbourhoods V; of a; in G such that 
(Vi x V2) g'(H) = ©. This implies 7(V|) N 2(V2) = S. Since z is open, 2 (V;) 
is a neighbourhood of a;, i = 1, 2. This yields the assertion. 


Theorem 5.7.2 (Homogeneous space) Let G be a Lie group and let H C G bea 
closed subgroup. There exists a unique smooth structure on the quotient G/H such 
that the natural projection x : G — G/H is a smooth submersion. 


Proof It suffices to prove existence, because uniqueness holds for any submersion, 
see Remark 1.5.16. Choose a subspace m of g complementary to h. The manifold 
structure on G/#H will be modelled on this subspace. 
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According to Lemma 5.6.7, there exist open neighbourhoods Wy, and W,, of the 
origins in h and m, respectively, and an open neighbourhood V of 1 in G such that 
the mapping 


pg: Wm x Wy V, (X, Y)  exp(X) exp(Y), 


is a diffeomorphism. Since H is closed and hence embedded, exp(Wj,) is open with 
respect to the relative topology on H induced from G. As a consequence, Wy, may 
be shrunk so that (exp(Wm)* 1 H) C exp(Wy). Consider the mapping z 0 exp: 
Wn > G/H. It is 


(a) injective: if X, Y € Wi satisfy exp(—X) exp(Y) € H, then exp(— X) exp(Y) = 
exp(Z) for some Z € Wy, hence g(Y, 0) = g(X, Z) and injectivity of g implies 
X =Y (and Z=0); 

(b) open: if U is open in Wm, then U x Wy is open in Wm x Wy. Since g and x 
are open, 7 oexp(U) =m 0 y(U Xx Wy) is open in G/H. 


Thus, 7 o exp maps Wy, homeomorphically onto the open subset 
U:=1o0 exp(Wm) 


of G/H and hence induces a local chart ¢ := (a 0 exp)~!: U > mon G/H. Since 
the mappings L, are homeomorphisms, 


{(La(U), @ oL,-1):a€ G} 


is an atlas on G/H. The transition mappings can be expressed in terms of exp, 
exp” !, left translations and the projection g —> m defined by the vector space de- 
composition g = § @ m. Details are left to the reader. It follows that the atlas 
so constructed defines a smooth structure on G/H. To see that m7 is a smooth 
submersion, due to m(La(V)) = Ly (U ), it suffices to check that the restriction 
mz:Lg(V)> i (U ) is a smooth submersion for all a € G. The latter follows from 
the fact that the mapping kK o C,-1 omolgog~: Wm X Wy > Wm coincides with 
the natural projection of the direct product. 


Remark 5.7.3 According to Remark 1.5.16, being a submersion, 2 admits local 
sections. That is, for every Gd € G/H there exists an open neighbourhood U anda 
smooth mapping s : U — M such that 2 os = id;;. In the present situation, as a by- 
product of the construction of local charts on G/H in the proof of Theorem 5.7.2, 
one has the following distinguished class of local sections. For every subspace m 
complementary to h there exists an open neighbourhood W of the origin in m such 
that exp(W) C G is an embedded we es Ttexp(W) 18 injective and U:=m0 
exp(W) is open in G/H. Then s := 17 :U> exp(W) is a local section of zr, 
and so is (La (U), Lgoso Tye 1) for all a € G. This argument also shows that the 
natural vector space isomorphism g — TG, given by evaluation at 1, descends to 
an isomorphism 


9/6 = Ti (G/F). (5.7.2) 


5.7 Homogeneous Spaces 263 


The existence of local sections of z implies that the equality (5.7.1) can be re- 
solved locally for Lg. This entails 


Corollary 5.7.4 The mapping Ly isa diffeomorphism of G/H for alla eéG. 


In Sect. 6.2 it will become clear that Corollary 5.7.4 provides the justification 
of the term homogeneous space, referred to in the headlines of this section and of 
Theorem 5.7.2 (see Remark 6.2.10/3). Furthermore, in Sect. 6.5 we will see that the 
coset manifold G/#H can be interpreted as the quotient of the action of the Lie group 
H on the manifold G by right translation and, correspondingly, as the base manifold 
of a principal bundle with structure group H. Finally, we note that the construction 
of the coset manifold carries over in an obvious way to the space of right H-cosets 
in G. In this case, right translation induces diffeomorphisms R, for alla éG. 


Example 5.7.5 (Stiefel manifolds) Let K = R,C or H. The Stiefel manifold 
Sk(k, n) is defined to be the set of k-frames in IK” which are orthonormal with 
respect to the standard scalar product. The actual manifold structure is obtained by 
identifying Sx (k, n) with a certain homogeneous space as follows. Any orthonormal 
k-frame in K” can be obtained from the first k elements of the standard basis in KK” 
by a linear isometric transformation. This transformation is determined by the given 
frame only up to its action on the last n — k elements of the standard basis. That is, 
two transformations produce the same k-frame iff they differ by prior application of 


a block matrix 
I, O 
0 , (5.7.3) 


where a is an isometry of K"-*. Hence, as sets, 
Sr(k, n) = O(n)/O(n — k), 
Sc(k, n) = U(n)/U(n — k), 
Su(k,n) = Sp(n)/Sp(n — k), 


where O(n — k), U(n — k) and Sp(n — k) refer to the corresponding subgroups 
of matrices of the form (5.7.3), and these equalities define the smooth structure of 
SK(k, n). For k <n, a possible sign (if K = R) or phase (if K = C) in the determi- 
nant of the isometry producing the desired k-frame can be shifted to the irrelevant 
part which acts on the last n — k standard basis vectors. Therefore, 


Sp(k, n) = SO(n)/SO(n — k), Sc(k, n) = SU(n)/SU(n — k). 
An orthonormal 1-frame is just a vector of length 1. Hence, 
Sk(1,n)=S"-!, = d=dimp(X). 


We leave it to the reader to check that this is a diffeomorphism, indeed (Exer- 
cise 5.7.3). 
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Example 5.7.6 (Gra8mann manifolds) The Gra8mann manifold Gx(k,n) is de- 
fined to be the set of k-dimensional subspaces of K”. As for the Stiefel manifolds, 
the manifold structure of Gg(k,n) is obtained by identifying it with a homoge- 
neous space. Any k-dimensional subspace of KK” can be obtained from the subspace 
spanned by the first k elements of the standard basis in IK” by application of a lin- 
ear isometry. This transformation is determined by the given subspace up to prior 


application of a block matrix 
a 0O 
k >| ‘ (5.7.4) 


where a and b are isometries of K* and K"~*, respectively. Hence, 


Gr(k,n) = O(n)/(O(K) x Om — 4k), 
Gc(k, n) = U(n)/(U(k) x Un —k)), 
Gu(k, n) = Sp(n)/(Sp(k) x Sp(n — k)), 


where the subgroups consist of matrices of the form (5.7.4) and the smooth structure 
of Gx (k,n) is defined by that of the corresponding homogeneous space. The case 
k = 1 reproduces the projective spaces Gx(1,) = KP”! of Example 1.1.15. The 
proof that both sides are in fact diffeomorphic is postponed until Sect. 6.5 (Exam- 
ple 6.5.4/3). 


Example 5.7.7 (Flag manifolds) Let 0 < kj <---<k;_1 <n be integers. A flag of 
type (k,,...,k-—1) in K” is an ascending sequence of vector subspaces Vj C --- C 
V,—1 C K” where dim(V;) = k;. By defining W| to be V; and W;.-; to be the orthog- 
onal complement of V; in V;;1, with every flag one can associate a decomposition 
K" = W; @--- ® W, into mutually orthogonal subspaces, and this defines a bijec- 


tion from the set of flags of type (ki,...,k,-—1) onto the set of orthogonal direct 
sum decompositions of K” into subspaces of the dimensions (n1,...,;), where 
ny =ki,n- =n—k,_, and nj+1 = ki+1 — k; in between. Let Fx(11, ...,;) denote 


either of these sets. A similar argument as for the GraBmann manifolds shows that 
Fir (11,.-.,2r) = O(n)/(OM) x ++ x O(n»), 
Fe(1,..., Mr) = U(n)/(U(m1) x --- x Un,)), 
Fuxy(n1, ...,n-) = Sp(n)/(Sp(m1) x +++ x Sp(n,)), 


where the subgroups consist of block diagonal matrices 
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with a; being an element of, respectively, O(n;), U(m;) or Sp(m;), i= 1,...,r. As 
before, these equalities are used to define a smooth structure on Fx(n1,...,,). 
The case r = | and nj =k reproduces the Grafmann manifolds: Fx(k,n — k) = 
Gx (k,n). 


Next, we investigate the special case where H is a normal subgroup. This 
means that aH = Ha, or equivalently C,(H) = H, for all a € G. In this case, 
G/H inherits a group structure, defined by the condition that the natural projection 
a :G-— G/H bea group homomorphism. 


Proposition 5.7.8 (Quotient Lie group) Let G be a Lie group with Lie algebra g 
and let H C G be a closed normal subgroup with associated Lie subalgebra § C g. 


1. G/H with the induced group structure is a Lie group. 

2. The natural projection x : G > G/H is a Lie group homomorphism. 

3. We have ker(dz) = h and dx induces an isomorphism from g/h onto the Lie 
algebra of G/H. 


The Lie group G/H will be referred to as the quotient Lie group of G by H 
and the Lie algebra of G/H will be identified with g/h by means of the natural 
isomorphism induced by dz without further notice. Under this identification, dz 
corresponds to the natural projection g > g/b. 


Proof 1. Since z is a group homomorphism, the multiplication mappings yz of G 
and fi of G/H satisfy (10 (mw x w) =z om. Using pairs of local sections of zr, this 
equality can be locally resolved for (1. Hence, i is smooth and G/H isa Lie group. 
2. This holds by construction. 
3. Due to ker(z7) = H and Example 5.6.11/1, ker(dzr) = b and the assertion fol- 
lows from the homomorphism theorem for algebras. !4 


Example 5.7.9 


1. Let g: G— H bea Lie group homomorphism. Then, ker(¢g) is a closed normal 
subgroup of G and hence G/ker(¢) is a Lie group with Lie algebra g/ker(dg). 

2. The identity component Go of G is a normal subgroup: for every a € G, Cg is 
a homeomorphism of G, hence C,(Go) is a connected component of G. Since 
Ca(1) = 1, it coincides with Go. As a connected component, Go is also closed 
and hence G/Go is a Lie group. The cosets coincide with the connected compo- 
nents of G. Hence, the quotient topology on G/ Go is discrete. Thus, the quotient 
Lie group of G by Go is given by the group-theoretical quotient of G by Go, 
equipped with the discrete smooth structure. 


'4The kernel of an algebra homomorphism is an ideal of the domain and the mapping induced by 
passing to the quotient is an algebra isomorphism onto its image. 
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Proposition 5.7.10 (Quotient homomorphism) Let G; be Lie groups with Lie alge- 
bras g; and let H; C G; be closed normal subgroups with associated Lie subalge- 
bras hi C gj, i = 1,2. Let p: Gj — G2 be a Lie group homomorphism satisfying 
g(H) C Ap. 


1. There exists a unique mapping @ : G1 /H, — G2/ Hz such that for the natural 
projections 1; : G; > G;/Hj,i =1, 2, there holds @om, = 1209. This mapping 
is a Lie group homomorphism. 

2. There holds dy(h,) C hz and d@ is given by the homomorphism g,/6, > 92/62 
induced by dg on passing to the quotients. 


Proof 1. Due to g(H1) C Ho, the mapping ¢ is well-defined. It is obviously unique. 
A brief calculation, using that 21, 22 and g are group homomorphisms, shows that 
@ is a group homomorphism. Since 7 admits local sections, ¢ is smooth. 

2. By (5.2.9), there holds d@ o dz, = dz o dg. Since under the identification of 
the Lie algebras of G;/H; with the quotients g;/h;, dz; corresponds to the natural 
projection g; — g;/b;, this implies the assertion. 


Propositions 5.7.8 and 5.7.10 provide a Lie group analogue of the homomor- 
phism theorem of group theory which states that for a group homomorphism 
g:G — H, the image im(@) is a subgroup of H and that the induced mapping 
@: G/ker(y) > im(@) is a group isomorphism. 


Proposition 5.7.11 (Homomorphism theorem) Let 9: G > H be a Lie group 
homomorphism. Then, im(g) is a Lie subgroup of H and the induced mapping 
@: G/ker(y) > im(Qg) is a Lie group isomorphism. 


The assertion may be rephrased by stating that (G/ker(g), ¢) is a Lie subgroup 
of H. 


Proof By Propositions 5.7.8 and 5.7.10, G/ker(@) is a Lie group and the induced 
mapping @ : G/ker(y) > H is a Lie group homomorphism. By construction, @ 
is injective and hence an immersion by Corollary 5.3.7. Thus, (G/ker(g), @) is a 
Lie subgroup of H and so is the image im(@) with respect to the smooth structure 
transported by @. 


Example 5.7.12 


1. The exponential function y : R — C, defined by t + e?7" has kernel Z and 
image U(1). The induced mapping yields a Lie group isomorphism from R/Z 
onto U(1). This extends to an isomorphism from R” /Z” onto T” for every n. 

2. The Lie group homomorphism SU(2) — SO(3) of Example 5.1.11 is surjective 
and has the centre Z2 of SU(2) as its kernel, hence the induced mapping yields 
a Lie group isomorphism from SU(2)/Zz2 onto SO(3). Similarly, the Lie group 
homomorphism SL(2,C) > SO@, 1) of Example 5.1.13 induces a Lie group 
isomorphism from the quotient of SL(2, C) by its centre onto the identity com- 
ponent of SO(3, 1). 
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Exercises 

5.7.1 Prove Corollary 5.7.4. 

5.7.2 Compute the dimensions of the Stiefel, Graf’mann and flag manifolds, see 
Examples 5.7.5—5.7.7. 

5.7.3 Let K=R, C, H and let d = dimp K. Show that the Stiefel manifold Sx (1, 1) 
is diffeomorphic to the sphere S¢”—!, cf. Example 5.7.5. 


Chapter 6 
Lie Group Actions 


In Sect. 6.1, we define the notions of Lie group action and G-manifold and collect 
some of their elementary properties. There is a variety of derived notions fitting to- 
gether to a geometric structure which will be studied in this chapter. In particular, 
a Lie group action gives rise to a special type of vector fields, so-called Killing vec- 
tor fields,! see Sect. 6.2. These vector fields span an integrable distribution whose 
integral manifolds coincide with the orbits of the group action. This way, every orbit 
is endowed with the structure of an initial submanifold. Starting from Sect. 6.3, we 
limit our attention to the important special class of proper group actions. Under this 
additional regularity assumption, one can prove the Tubular Neighbourhood Theo- 
rem? which constitutes one of the basic tools of the theory of Lie group actions, see 
Sect. 6.4. It states that for every orbit there exists a G-invariant neighbourhood and a 
diffeomorphism identifying this neighbourhood G-equivariantly with a G-invariant 
neighbourhood of the zero section in the normal bundle of this orbit. In particular, 
we study the case of a free proper action in some detail, because it gives rise to in- 
teresting bundle structures. Next, in Sect. 6.6, we study elementary properties of the 
orbit space of a given Lie group action, and in Sect. 6.7 we discuss invariant vec- 
tor fields. The latter notion is of basic importance for the study of physical systems 
with symmetries, see Chap. 10. In Sect. 6.8, we make some elementary remarks on 
relative equilibria and relatively periodic integral curves. 


6.1 Basics 


Let M be a smooth manifold and let G be a Lie group. Let g be the Lie algebra of G. 
Starting from this chapter, elements of g will be denoted by A, B,.... Depending 
on the context, they will be viewed either as left-invariant vector fields on G or as 
elements of the tangent space T1G. 


1Or fundamental vector fields. 


2 Also known as the Slice Theorem. 


G. Rudolph, M. Schmidt, Differential Geometry and Mathematical Physics, 269 
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Definition 6.1.1 (Lie group action) 


1. An action of G on M is a smooth mapping W : G x M — M such that the 
induced mappings 


WwW: MM, W,(m) := W(a,m), 


satisfy W% = idy and either Yo W = Wy or Yo W = Y, for alla, be G. 
In the first case, W is called a left action and in the second case a right action. 
The triple (M, G, W) is referred to as a Lie group action and the pair (M, W) as 
a G-manifold. 

2. A morphism of Lie group actions (Mj, G1, yw!) and (M), Go, yw?) (both left or 
both right) consists of a smooth mapping g : M; — M2 and a Lie group homo- 
morphism @ : Gj — G2 such that 


goW!=Wo(ex). (6.1.1) 


One says that the mapping ¢ intertwines the actions W! and W?. If G; = G2 
and @ is the identical mapping, ¢ is also called equivariant or, equivalently, a 
morphism of the G-manifolds (M1, W!) and (Mo, W”). 


Since YW o W,-1 = idy, the induced mappings W% are diffeomorphisms of M. 
Hence, the assignment at W, defines a group homomorphism (in the case of a left 
action) or a group anti-homomorphism (in the case of a right action) from G to the 
group of diffeomorphisms of M. Condition (6.1.1) is equivalent to 


Wea) og=yowW} forallacG. (6.1.2) 
In particular, a mapping g : M; — Mp? is equivariant with respect to the G-actions 


W' on M; if and only if 
Wog=gow! forallaeG. (6.1.3) 


Analogously to the notion of morphism one defines the notion of anti-morphism 
from a left to a right Lie group action, or vice versa, by requiring that @ be a Lie 
group anti-homomorphism. If G; = G2 = G and @ is the inversion mapping, that 
is, if 

Ww ,og=yow! forallaeG, (6.1.4) 


a 


then @ is called anti-equivariant or an anti-morphism of G-manifolds. If G is 
Abelian, there is no difference between left and right actions. In the general case, 
a left action can be turned into a right action, and vice versa, by composing it with 
the mapping G x M — G x M given by (a,m) +> (a~!,m). Then, idy is anti- 
equivariant and hence an anti-morphism of G-manifolds. 


Example 6.1.2 We encourage the reader to check the axioms of Lie group action for 
each of the following examples (Exercise 6.1.1). 


1. Every representation @ : G ~ GL(V) of G on a finite-dimensional K-vector 
space V defines a left action of G on V by W(a, v) := o(a)v forallae G,ve V. 
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Since the induced diffeomorphisms W%, = o(a) are linear, one speaks of a linear 
action. A homomorphism of representations of G is the same as a morphism of 
the associated G-manifolds. 

2. The operations of left translation, right translation and conjugation, defined 
in (5.1.3), yield actions of a Lie group on itself. Each of them may be restricted 
in the first argument to a Lie subgroup H C G, thus producing the actions of H 
on G by left and right translation and conjugation, respectively. 

3. For every closed subgroup H C G, the induced left translations L,:G /H—> 
G/H, where a € G, define an action of G on the homogeneous space G/H of 
left cosets. The natural projection G — G/H is equivariant and hence a mor- 
phism of G-manifolds. A similar statement holds for the induced right transla- 
tions Ry on the homogeneous space of right cosets. 

4. The flow of a complete vector field on M is an action of R on M. More generally, 
according to Proposition 3.2.15, the flows of commuting complete vector fields 
X1,..., X; on M define an action of the vector group R’ on M by 


MH = OP! 0---0 Oe, (6.1.5) 


By Proposition 3.2.13/2, every diffeomorphism g of M intertwines W with the 
corresponding action defined by g,.X1,..., @x Xr. 

5. With every Lie group action (M, G, W) there come induced actions of G on TM 
and T* M, given by (a, X) +>» W/X for all X € TM and 


(4, &m) +> Em © Yaw, (m) 


for all €,, € T,M, respectively. Since the induced diffeomorphisms of TM and 
T* M are vector bundle automorphisms, these actions turn TM and T* M into G- 
vector bundles, see Remark 6.1.3 below. The natural projections TM — M and 
T*M — M are equivariant. All of this carries over in an obvious way to arbitrary 
tensor bundles over M. 

6. The direct product of Lie group actions (M;, G;, w'), i =1, 2, both left or both 
right, is defined to be the action of G; x Gz on M; x M2 given by 


((a1, a2), (m1, m2)) > (Yq (m1), Ye, (m2) 


For i = 1, 2, the natural projections M; x Mz — Mj; and G, x G2 > G; yield 
morphisms of Lie group actions. The direct product of G-manifolds (M;, ¥'), 
i = 1,2, both left or both right, is defined to be the action of G on M, x M2 
given by 


(a, (mj, m2)) > (HW) (m1), ¥2(m2)). 


The natural projections M, x Mz > M;, i = 1,2, are equivariant. 
7. Let (M,G,W) be a Lie group action, (H, @) a Lie subgroup of G and (N, ¢) 
a submanifold of M. One says that (N, @) is invariant under (H, o) if 


Wor) (g(N)) C @(N). 


In this case, and if additionally (N, @) is initial, the restriction W of W in domain 
to the submanifold (H x N, @ x g) of G x M and in range to the initial subman- 
ifold (N, ~) of M is an action of H on N, called the restriction of (M, G, W) to 
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(H, @) and (N, ¢). It is determined by the relation g o W=Wo (Q x g), that is, 
by the condition that (g, g) be a morphism of Lie group actions from (N, H, VW) 
to (M,G,W). 


Remark 6.1.3 If M is endowed with an additional structure, like a Riemannian met- 
ric, one may require the diffeomorphisms WY, to respect this structure. This way, 
one obtains, for example, the notion of Riemannian Lie group action and Rieman- 
nian G-manifold (with G acting by isometries), symplectic Lie group action and 
symplectic G-manifold (with G acting by symplectomorphisms, cf. Chap. 8) and 
G-vector bundle (with G acting by vector bundle automorphisms). 


Next, we introduce the basic notions associated with a Lie group action 
(M,G,W), collect some of their elementary properties and describe the geomet- 
ric structure arising. Two points m,,mz € M are said to be conjugate under W if 
mz = W,(m 1) for some a € G. Obviously, to be conjugate is an equivalence relation 
on M. 


Definition 6.1.4 Let (M, G, W) be a Lie group action. 


1. The above equivalence classes are called the orbits of G under W. The set of 
equivalence classes, equipped with the quotient topology, is called the orbit space 
of (M, G, W) and is denoted by M/G. 

2. Form € M, the induced mapping WY, : G > M is called the orbit mapping of m. 

3. For m € M, the subgroup G», := {a € G: Y(m) =m} is called the stabilizer? 
of m. 

4. The kernel of W is the subgroup ker(W) := {a € G: YW, = idy} of G. 


The kernel of W coincides with the kernel of the corresponding (anti-)homomor- 
phism from G to the group of diffeomorphisms of M. By definition, ker(W) = 
(mem Gm. The orbit through m € M is denoted by G - m. It is given by 


G-m={W(m)eM:aeG}. (6.1.6) 
More generally, for a subset N C M and a subgroup H C G we denote 
H-N:={W,(m)€M:meN,ae H}. (6.1.7) 


Assume that W is a left action. Since for a € G, the orbit mapping YW, satisfies 
W,, (ab) = YW, (a) iff b € Gm, it induces an injective mapping 


Dn :G/Gm > M, Dn (aGm) := Yn (a), (6.1.8) 


where G/G,, denotes the homogeneous space of left cosets. Since the natural pro- 


A 


jection G > G/G,» is a surjective submersion, by Remark 1.5.16, Y%, is smooth. 


3 Another common name is isotropy group of m. 
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Moreover, it is equivariant with respect to the action of G on G/G,, by the induced 
left translations La and its image coincides with the orbit of m.* Thus, for every m, 
Dr, yields an equivariant bijection between G/G,, and the orbit through m. In the 
next section, this bijection will be used to induce a smooth structure on G - m. 


Proposition 6.1.5 Let (M, G, W) be a Lie group action. 


The action W is an open mapping. 

The natural projection m : M — M/G is an open mapping. 

M/G is locally compact and second countable. 

Gm is a closed subgroup of G for allm € M. The mapping a+> (W,)j,, defines a 
representation” of Gi on TM, called the isotropy representation at m. 

5. The kernel of © is a closed normal subgroup of G. The mapping 


alee a oe 


W :G/ker(W) x M> M, W o(o x idy) =, 


where 9 : G — G/ker(W) denotes the natural projection, is an action of 
G/ker(W) on M and (idy, ©) is a morphism of Lie group actions. 


Proof 1. This follows from ¥(V x U) =U, cv Ya(U) and the fact that the map- 
pings Y% are homeomorphisms. 

2. Let U C M be open. Then, Y%(U) is open for all a € G and hence 
a '(x(U)) = UeG Ya(U) is open. By construction of the quotient topology, then 
x (U) is open. 

3. Since M is locally compact, every m € M possesses a compact neighbour- 
hood U. Since z is open, 2(U) is a neighbourhood of z(m) in M/G. Since the 
image of a compact space under a continuous mapping is compact, 7(U) is com- 
pact. Hence, M/G is locally compact. Second countability holds by definition of 
the quotient topology. 

4. For every m € M, the stabilizer G,, is the preimage of m under the orbit 
mapping W,,,. The rest follows from points 5 and 7 of Example 6.1.2. 

5. Being the kernel of a group (anti-)homomorphism, ker(W) is normal. Being 
the intersection of the stabilizers of all points of M, it is also closed. Hence, by 
Proposition 5.7.8, G/ker(W) is a Lie group. Since YW, = idy for all a € ker(W), 17 
is well-defined. Since 9 is a submersion, W is smooth by Remark 1.5.16. The rest is 
obvious. 


Next, we observe that for every a € G, 


Gw,(m) =aGma_! (left action), Gw,(m) =a~'Gma (right action). (6.1.9) 


“Tn the case of a right action, Wj, (ba) = W,(a) iff b € Gm. Thus, Wn is defined on the homoge- 
neous space of right cosets by Vin (Gma) := (a). It is equivariant with respect to the induced 
right translations R,. 


5More precisely, a left (right) representation in case W is a left (right) action. 
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That is, the stabilizers of any two points lying on the same orbit are conjugate.® 


Definition 6.1.6 (Orbit type) Let (M@,G,W) be a Lie group action. The type of 
an orbit O is defined to be the conjugacy class of subgroups of G given by {Gi : 
m € O}. The set of conjugacy classes of subgroups of G which appear as types of 
orbits under W is called the set of orbit types of (VM, G, W). For a given orbit type 
[H], the subset of M of orbit type [H] will be denoted by My. 

We note that 


Mtn = {m € M:Gm =aHa' for somea € G}. (6.1.10) 


Definition 6.1.7 For a closed subgroup H C G, define 
My :={m € M:H =Gpy}, (6.1.11) 
M" :={méM:H C Gy}. (6.1.12) 
My and M® are called the subset of isotropy type H and the subset of fixed points 
under H, respectively. 
By construction, 
MH) =G- My. (6.1.13) 
Moreover, if H is compact, one has (Exercise 6.1.2) 
My = M" 29 Mpy}. (6.1.14) 


Finally, we introduce the following algebraic properties of Lie group actions. 


Definition 6.1.8 A Lie group action is called 


1. effective if the kernel is equal to {1} 
2. free if all stabilizers are equal to {1}, 
3. transitive if it has a single orbit. 


According to Proposition 6.1.5/5, by passing to the induced action of G/ ker(W), 
every action can be replaced by an effective action which has the same orbits. More- 
over, it is obvious that free actions and transitive actions have just one single orbit 


type. 


Example 6.1.9 Proofs are left to the reader (Exercise 6.1.3). 


®Subgroups H and H> of G are said to be conjugate if Hy = aH\a~! for some a € G. To be 
conjugate is an equivalence relation on the set of subgroups of G. The equivalence classes are 
called conjugacy classes. The conjugacy class of a subgroup H in G will be denoted by [#7]. 
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1. The kernel of the Lie group action associated with a representation coincides 
with the kernel of the representation. Hence, the action is effective iff the repre- 
sentation is faithful. The action is neither free nor transitive, because the origin 
is invariant. As an example, for the fundamental representation’ of G = SO(3) 
on R?, the orbits are given by the spheres about the origin and the origin itself. 
Therefore, the orbit space may be identified with the closed half-line [0, 00). The 
stabilizer of a nonzero x € R? consists of the rotations about x, hence it is iso- 
morphic to SO(2). Since it does not depend on the norm of x, all stabilizers of 
nonzero vectors belong to the same orbit type. The stabilizer of the origin is the 
whole group. 

2. The actions of G on itself by left and right translations are effective, free and tran- 
sitive. The restrictions of these actions to a proper Lie subgroup H C G are still 
effective and free but no longer transitive. The orbits are the right and left cosets 
of H, respectively. According to Theorem 5.7.2, the orbit space G/H carries in 
addition a smooth structure. The action of G on itself by inner automorphisms 
is neither free nor transitive, because the unit element is invariant. The kernel 
is given by the centre. Hence, the action is effective iff the centre is trivial. The 
orbits are called the conjugacy classes of elements of G. The stabilizer of a e G 
is the centralizer of a in G. Therefore, the orbit types are given by the conjugacy 
classes of those subgroups of G which are centralizers of elements of G. The 
orbit space is usually referred to as the adjoint quotient of G. 

3. The action of G on the homogeneous space G/H, where H C G is a nontrivial 
closed subgroup, by the induced left translations Lg is transitive but not free. The 
stabilizer of a coset aH, a € G, is given by Ggy =aH ath, Consequently, the 
kernel of the action is given by (),<g aH a~'. It lies between the intersection of 
the centre of G with H and H itself, the latter being the case when H is normal. 

4. For the action of G = R on M defined by the flow ® of acomplete vector field X, 
the orbit mapping of a point m € M is given by the maximal integral curve ®»,. 
The stabilizer is G,, = R in case X» = 0, Gm = ZT in case the maximal in- 
tegral curve ®,, is periodic with period 7, or G,», = {0} otherwise. Since G is 
Abelian, the orbit types correspond bijectively to the subgroups which appear as 
stabilizers. 


Remark 6.1.10 Every Lie group action (M,G,W) induces a representation of 
G on C%(M) by the algebra automorphisms f t Se f,a¢€G. A function 
f €C™(M) is called invariant under W if it is a fixed point of this representa- 
tion, that is, if W* f = f for all a € G. This means that f is equivariant with respect 
to the trivial action of G on R. The invariant functions form a subalgebra which 
will be denoted by C°®(M)®. Analogously, Y induces representations of G on the 
various types of smooth tensor fields by T +> W%,.T. T is said to be invariant if it 
is invariant under this representation. This is equivalent to being equivariant as a 
mapping from M to the corresponding tensor bundle, where the latter carries the in- 
duced action of Example 6.1.2/5. More generally, a section of a G-vector bundle is 


7Given by matrix multiplication of SO(3)-matrices with elements of R>. 
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said to be invariant if it is invariant under the transport operators of the vector bundle 
automorphisms provided by the G-action, and this is equivalent to being equivariant 
with respect to the action of G on the bundle manifold and the induced action on 
the base manifold. 


Exercises 

6.1.1 Verify the axioms of a Lie group action for the examples of 6.1.2. 

6.1.2 Prove Formula (6.1.14). Try to find a counterexample for noncompact H. 
6.1.3 Verify the statements of Example 6.1.9. 


6.2 Killing Vector Fields 


Throughout this section, let (MW, G, W) be a Lie group action and let g be the Lie 
algebra of G. For A € g, we have 


Wexp(td) 0 Wexp(s) = Vexp(ta) exp(sA) = Vexp((t-+ts)A)> 


hence the assignment (7m, t) +> Wexpira)(m) defines a one-parameter group of dif- 
feomorphisms of M, that is, a complete flow. 


Definition 6.2.1 (Killing vector field) The vector field on M defined by the flow 
WexpcrA) is called the Killing vector field generated by A. It will be denoted by A,. 


The value of A, at m € M may be expressed in several equivalent ways: 


d d 
(Agm = =. Wexpray(m) = — Wn (exp(tA)) = Wf (A). (6.2.1) 
dt fp dt fp 


Proposition 6.2.2 Let (M, G, W) be a Lie group action. 


1. For every a € Gand A € g, there holds Wy,.Ax = (Ad(a) A) in case W is a left 
action and Wy, Ay = (Ad(a~!)A), in case W is a right action. 

2. The mapping g > X(M) given by At> A, is an anti-homomorphism of Lie al- 
gebras if W is a left action and a homomorphism if W is a right action. The kernel 
of this mapping coincides with the Lie algebra of ker(W) C G. In particular, if 
W is effective, the mapping At> A, is injective. 

3. Form € M, the Lie algebra of Gy is given by gm = {A € g:(Ax)m = O}. In 
particular, if W is free, then A, cannot have critical points. 


Proof 1. Due to Proposition 3.2.13/2, the flow of Wj,A, iS Wyo Wexpay 0 Yy-1, 
which is equal to Wexpraa(ayA) if W is a left action and to Yep adia-!yay If WY isa 
right action. 

2. We give the proof for the case where W is a left action. Linearity is obvious 
from (6.2.1). According to Proposition 3.3.2/2 and point |, for A, B € g, we have 


d d 
[Ax, Belm = (La, By)n = att (Wexp(—1A)« Ba )n = Wi, (=, (Aa{exp(—rA)}3)). 
0 0 


dt dt 
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Now, (5.4.2) and Proposition 5.4.3 yield [A,, B,] = —[A, B],.. Furthermore, A be- 
longs to the Lie algebra of ker(W) iff Wexpa.a) = idy for all t. Since Wexp¢) 18 the 
flow of A,., this is equivalent to A, = 0. 

3. Let A € g. By Theorem 5.6.8, A belongs to the Lie algebra g,, of G,, iff 
Wexp(ta)(m) = m for all t, that is, iff m is an equilibrium of A,. 


Remark 6.2.3 According to point 3, m is an equilibrium of A, for all A € g,,. Thus, 
by Formula (3.6.2), the representation of g,, on T,,M induced by the isotropy rep- 
resentation of G,, is given by 


dt 


It is referred to as the isotropy representation of g,,. 


d 
Gm > End(TnM), Are dt} (Wexp(r))m = Hess, (Ax). 
0 


Proposition 6.2.4 (Transformation properties) 


1. Let (M;, G;, ¥'), i = 1, 2, be Lie group actions and let p: M, + M2,@:G\ > 
G2 define a morphism. The Killing vector field on M, generated by an element A 
of the Lie algebra of G, is y-related to the Killing vector field on Mz generated 
by do(A). - 

2. Let (Mj, ¥'), i = 1,2, be G-manifolds and let g : M, > Mp2 be equivariant. For 


every A € g, the Killing vector fields AM on M; generated by A are o-related. 


If g is a diffeomorphism, points | and 2 yield, respectively, 
(do(A)), = @xAx, AM — 9AM, (6.2.2) 


Proof According to Proposition 5.3.6, for m € Mj, we have 


d 
(@' 0 As)(m) =F 9 2 erp, cay ™) 


d 2 

= dt to Po(expa, (tA)) ° g(m) 
a 2 

— Ar to Pexpg, (tdg(A)) fe) g(m). 
This yields the first assertion. The second one follows by letting Gj = Gz = G and 
Q=idg. 


Example 6.2.5 Proofs are left to the reader (Exercise 6.2.1). 


1. For the left action of G associated with a representation 0 : G > GL(V) ona 
finite-dimensional K-vector space V, the Killing vector field A, is the linear 
vector field on V corresponding to the endomorphism do(A). In particular, for 
the adjoint and the coadjoint actions we obtain 


AS¢ = ad(A), ASM —ad*(A), A€g, (6.2.3) 


and for the identical representation of a classical group G C GL(V), under the 
natural identification of g with a Lie subalgebra of gI(V), we have A, = A. 
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. For the action of G on itself by right translations we get A, = A. That is, 


the homomorphism g — X(G) of Proposition 6.2.2/2 coincides with the natu- 
ral inclusion mapping. For the action of G on itself by left translations, A, is 
the right-invariant vector field on G generated by A € T,G. According to Re- 
mark 5.2.21/3, this means that the anti-homomorphism g > X(G) is given by 
the natural inclusion mapping, composed with — inv... For the action of G on 
itself by inner automorphisms, A,, is the difference between the right and the 
left invariant vector fields generated by A and hence the anti-homomorphism 
g— X(G) is given by —(1 +inv,). 


. Let H CG be a closed subgroup and let 7 : G ~ G/H denote the natural 


projection. For the action of G on the homogeneous space G/H by induced 
left translations, Proposition 6.2.4/2 and the previous result yield (A)z(a) = 
x’ oR’A forallae Gand A eg. 


. For the action of G = R on M defined by the flow of a complete vector field X, 


under the identification of the Lie algebra of R with R itself, the homomorphism 
g— X(M) is given by sts, =sX. 


By analogy with Lie group actions and their morphisms one defines 


Definition 6.2.6 (Lie algebra action) 


Il. 


An action of a Lie algebra g on a smooth manifold M is a mapping w : g > 
X(M) which is either an anti-homomorphism or a homomorphism of Lie alge- 
bras. In the first case, w is called a left action and in the second case a right 
action. The triple (M, g, y) is referred to as a Lie algebra action and the pair 
(M, w) as a g-manifold. 


. A morphism of Lie algebra actions (M1, 91, 71) and (M2, go, w2) (both left or 


both right) consists of a smooth mapping g : M; — Mp and a Lie algebra ho- 
momorphism @ : g; — gz such that the vector fields (A) and wW2 0 @(A) are 
g-related for all A € gj. If g1 = go = g and oc = id, ¢ is called equivariant or, 
equivalently, a morphism of g-manifolds. 


Note that the induced mapping g x M > TM, (A,m) ® ((A))m, is automati- 


cally smooth. Using the notion of Lie algebra action, Propositions 6.2.2/2 and 6.2.4 
may be restated as follows (Exercise 6.2.3). 


Corollary 6.2.7 


1. 


2. 


The Killing vector fields of a left or right Lie group action (M, G,W) induce a 
left or right action, respectively, of the Lie algebra g of G on M. 

If (g, @) is a morphism of Lie group actions, then (y, dQ) is a morphism of the 
corresponding Lie algebra actions. If p is a morphism of G-manifolds, then it is 
also a morphism of the corresponding g-manifolds. 


Next, consider the distribution D§ on M spanned by the Killing vector fields, 


that is, 


De = {(Ax)m €T,»M:A€g}. 
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According to points | and 3 of Proposition 6.2.2, we have 


De (n) = w' Ds (6.2.4) 


for all a € G, and dim(D$) = dim(G) — dim(G,,), respectively. In particular, D§ 
need not be regular; this is so, however, if W is free or transitive. We will show that 
D® is integrable and that the integral manifolds are given by the orbits. For that 
purpose, recall the mapping Yn iG /Gm — M, defined by (6.1.8). 


Theorem 6.2.8 (Orbit Theorem) Let (M,G,W) be a Lie group action. For every 
méM, (G/Gy, Yn) is an initial submanifold of M whose connected components 
are maximal integral manifolds of D8. In particular, D9 is integrable. 


Proof We give the proof for the case of a left action. Let m € M and let r° : G > 
G/G,» denote the natural projection. First, we show that Vin is an injective immer- 
sion. Injectivity was already noted before. Since Vin on? =W,,, for (Gn), Gn, 1 be 
injective for all aG,», € G/G, it suffices to show that 


G / 
ker(Wm)i, C ker(x) | 
for all a € G. To see this, let A € g be such that (%,,)/ Ag = 0. Due to 
(Win)' Aa = (Wn) ° (La)'A1 = (Wa) ° (Wn) At = (Wa) (Ax), (6.2.5) 
and since (W,)’ is bijective, we obtain (A,.)_ = 0 and hence A € g,, by Proposi- 
tion 6.2.2/3. Then, exp(tA) € G,» for all tf € R and hence 
d 


(x)! (Ag) = — 2% (aexp(tA)) =0. 
¢ dt to 


This proves that Ww, is an injective immersion. 

Second, we show that the connected components of (G/Gin, Din) are maximal 
integral manifolds of D9. For simplicity, we may assume that G and hence G/G, 
are connected. Due to 1@ being a submersion, for (G/Gim, Dn) to be an integral 
manifold of D9 it suffices that im((W»)/,) = De tn) for all a € G. The latter is true, 
indeed, because using (6.2.5) and Proposition 6.2.2/1, for A € g one finds 


(Yn)' Aa = (Wa)! (Ax) = (Wax Ax)W,(m) = ({Ad@)A}.) yom 


and hence (Ax)y,(m) = (Gn) (Ad(a~!)A)m.- Maximality follows from the fact that 
the images of the connected components of the submanifolds (G/Gi, Dn), mewM, 
establish a disjoint decomposition of M. Finally, since m was arbitrary, we can 
conclude that D¥ is integrable. Hence, Theorem 3.5.15 yields that (G/Gyn, Dn) is 
initial. 


As a consequence of Theorem 6.2.8, according to Remark 1.6.13/5, the smooth 
structures induced on an orbit O by the submanifolds (G/Gm, Yn), m € O, coin- 
cide for all m, so that we can view O itself as a submanifold.’ Then, Theorem 6.2.8 
implies 


8There is also a direct argument proving this, see Exercise 6.2.4. 
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Corollary 6.2.9 The orbits of W are initial submanifolds of M whose connected 
components are maximal integral manifolds of D8.° 


Remark 6.2.10 


1. Let (M, G, W) be a Lie group action, let O be an orbit and let m € O. Consider 
the isotropy representation of G,, on TM, cf. Proposition 6.1.5/4. By Proposi- 
tion 6.2.2/1, 


(WaYin (Ax) = (Wax Ax) = ((Ad(a)A),),,.- (6.2.6) 


Therefore, the subspace T,, O of T,,M is invariant and the isotropy representa- 
tion descends to a representation on N,, O, which for reasons that will become 
clear later is called the slice representation at m. By construction, the natural 
projection T,,M — N,, O intertwines the isotropy representation with the slice 
representation. 

On the other hand, the isotropy representation restricted to T,,O can be 
brought to the following normal form. By Theorem 6.2.8, the mapping (Wn) : 
g— TnM has image T,, O. By point 3 of Proposition 6.2.2 it has kernel gm. 
Hence, this mapping induces a vector space isomorphism 


Tn O = 9/9m- (6.2.7) 


In view of (6.2.1), Formula (6.2.6) implies that this isomorphism intertwines the 
representation of G,, on g/Gm induced by the adjoint representation with the 
isotropy representation on T,,, O (Exercise 6.2.5). 

2. Let (M, G, W) be a Lie group action and let O be an orbit. Since O is invariant 
under W and since it is an initial submanifold of M, W restricts to a transitive 
action of G on O. Since the connected components of O are integral manifolds 
of DS, (6.2.4) implies that the action of G on TM induced by W leaves invariant 
the submanifold TO.!° The restriction of this action to the invariant submanifold 
(TM);o induces an action WN of G on the normal bundle NO = (TM) to/TO 
by the vector bundle automorphisms! ! 


Ww) (LX) = [Y(X)], (6.2.8) 


where a € G and X € TM. The natural projections (TM);9 — NO — O are 
equivariant. Equivariance of the second projection implies that WN restricts to 
an action of G,, on N,,O for every m € O. This action coincides with the slice 
representation at m. 

3. A manifold carrying a transitive Lie group action is said to be homogeneous. 
The Orbit Theorem 6.2.8 yields that every homogeneous manifold is diffeomor- 
phic to G/H for some Lie group G and some closed subgroup H of G. The 


°In particular, according to Proposition 3.5.21, they form the foliation associated with D8. 
‘OBoth of these statements follow as well by viewing O as the image of the submanifold 
(G/Gmn, Win) and using that Wn is G-equivariant. 


'l Smoothness follows from the existence of local sections of the submersion (TM) to > NO. 
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diffeomorphism provided by this theorem is obviously equivariant with respect 
to the action of G on G/H by the induced translations La (in the case of a left 
action) or Ry (in the case of a right action). Hence, it is an isomorphism of Lie 
group actions. In this sense, all of the homogeneous spaces discussed in Sect. 5.7 
constitute homogeneous manifolds. This explains their name. In the spirit of Re- 
mark 6.1.3, the notion of homogeneous manifold may be further specialized, e.g., 
to that of homogeneous Riemannian manifold (a Riemannian manifold carrying 
a transitive Lie group action by isometries). 


Exercises 

6.2.1 Provide proofs for the statements of Example 6.2.5. 

6.2.2 Calculate the Killing vector fields for the natural action 
(a) of SO(2) on R?, 
(b) of SO(3) on R3, 
(c) of SU(2) on C?. 
For SO(3) and SU(2), use the bases given in Example 5.2.8. Read off the 
Killing vector fields on the corresponding unit spheres in R? and C?, respec- 
tively. 

6.2.3 Verify the statements of Corollary 6.2.7. 

6.2.4 Let (M,G,W) be a left Lie group action. Show that for every m € M and 
aeéG, R,-1 descends to a diffeomorphism g : G/Gy, > G/ Gy, (m) satisfy- 


ing Wy, (m)°9= Din. Use this for showing that the smooth structures induced 
on an orbit O by means of the submanifolds (G/Gy, Din) m € O, do not 
depend on the choice of m. (This proof does not use the fact that orbits are 
initial submanifolds.) 

6.2.5 Use Proposition 6.2.2/1 and Formula (6.2.1) to show that the vector space 
isomorphism g/gm = TO induced by the mapping (Win): g > TM in- 
tertwines the representation of G,, on g/g induced by the adjoint represen- 
tation with the isotropy representation on T,, O, cf. Remark 6.2.10/1. 


6.3 Proper Actions 


Lie group actions possessing the following topological property behave particularly 
well. For example, they allow for local models in terms of vector bundles. This has 
important consequences for the structure of both the manifold carrying the action 
and the orbit space. 


Definition 6.3.1 (Proper action) 


1. A continuous mapping of locally compact Hausdorff spaces (in particular, of 
manifolds) is proper if the preimages of compact subsets are compact. !? 


More generally, this definition applies when the domain is Hausdorff and the range is locally 
compact Hausdorff [53]. Compactness of subsets is understood with respect to the relative topol- 


ogy. 
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2. A Lie group action (M, G, W) is called proper if the following mapping is proper: 
Vox: Gx M>Mx M, Wexi(a,m) := (Walm),m). (6.3.1) 


We give two equivalent characterizations of proper mappings. Note that since 
manifolds are metrizable, one may use the criterion that a subset is compact iff 
every sequence contains a subsequence converging in that subset. 


Lemma 6.3.2 For a continuous mapping g : M — N between manifolds, the fol- 
lowing statements are equivalent. 


1. The mapping ¢ is proper. 

2. Any sequence in M whose image converges in N contains a convergent subse- 
quence. 

3. The mapping ¢ is closed and y~'(p) is compact for all p € N. 


Proof 1 = 2: Let {my,} be a sequence in M such that {g(m,)} converges in N. 
Since N is locally compact, there exist ng € N and a compact subset K C N such 
that y(m,) € K for alln > no. Then, my € gy”! (K) for all n > ng. Since ¢ is proper, 
g7! (K) is compact, hence {m,,} contains a convergent subsequence. 

2 = 3: The mapping ¢ is closed: let K C M be closed. Let { p,} be a sequence in 
gv(K) which converges to some p € N. For every n there is m, € K such that p, = 
g(m,). By point 2, {m,} contains a subsequence converging to some m € M. Since 
K is closed, m € K. Then, p = g(m) € g(K). Thus, g(K) is closed. Compactness 
of g~!(p) for p € N is obvious. 

3 = 1: Let K C N be compact and let {U; : i € I} be an open covering of 
gy !(K). Let p € K. Since {U; :i € J} is an open covering of go '(p) and since 
the latter is compact, there is a finite subcovering {U; :i € Ip}. Let V, denote the 
complement in N of the image under g of the complement of Lj. i. U; in M. By 
construction, p € Vp. Since @ is closed, V, is open. Hence, {V, : p € K} is an 
open covering of K. Since K is compact, there is a finite subcovering labelled by 
Pi,-++, pr. Then, {Uj :i € Ip, U--- U Tp,} is a finite subcovering of {U; :i € J}. 


Lemma 6.3.2 implies the following equivalent characterizations of a proper ac- 
tion. 


Corollary 6.3.3 For a Lie group action (M,G,W), the following statements are 
equivalent. 


1. The action is proper. 

2. If {an} is a sequence in G and {my} is a sequence in M such that {m,} and 
{Wa, (Mn)} converge, then {an} contains a convergent subsequence. 

3. The mapping Wex, is closed and all stabilizers of VY are compact. 


Proper actions have the following elementary topological properties. 


Proposition 6.3.4 Let (M, G, W) be a proper Lie group action. 
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1. The restriction of W to the product of a closed subgroup H C G with an H- 
invariant initial'* submanifold is proper. 

2. The orbit mapping Wp is proper for everym € M. 

3. The orbit space M/G is Hausdorff. 


Proof 1. We apply Corollary 6.3.3/2. Without loss of generality, we may assume 
that the subgroup and the submanifold are given by subsets. Thus, let H C G bea 
closed subgroup and let N C M be an H-invariant initial submanifold. Let {a,} and 
{mn} be sequences in H and N such that {m,} and {Y%,, (mn)} converge in N. Since 
N is a submanifold, the natural inclusion mapping N — M is smooth and hence 
continuous. Therefore, {m,} and {Wz, (mn)} converge in M. Since W is proper, {ay} 
contains a subsequence which converges in G. Since # is closed, the limit belongs 
to H. 

2. We apply Lemma 6.3.2/2. Let {a,} be a sequence in G such that {W%,(a,)} 
converges. Then, the sequences {a,} and {m, = m} satisfy the assumption of Corol- 
lary 6.3.3/2. Since W is proper, it follows that {a,} contains a convergent subse- 
quence. 

3. The argument is analogous to that for homogeneous spaces, cf. Lemma 5.7.1/3. 
Denote the natural projection by 7: M > M/G. Let m,,m2 € M be such that 
(m1) A (m2). Then, (m1,m2) ¢ im(Wext). Since im(Y%xt) is closed in M x M, 
there exist open neighbourhoods U; of m; in M such that (U; x U2) Nim(Wext) = S. 
Then, 2(U1) A w(U2) = ©. Since, by Proposition 6.1.5/2, the mapping z is open, 
z(U;) is a neighbourhood of z(mj;), i = 1, 2. 


The Orbit Theorem 6.2.8 and point 2 of Proposition 6.3.4 imply 


Corollary 6.3.5 The orbits of a proper Lie group action are closed embedded sub- 
manifolds. 


Proof Letm € M. By Proposition 6.3.4/2 and Lemma 6.3.2/3, WY, is closed. Hence, 
the image W,,(G) is closed and the induced mapping Din :G/Gm — M is closed. 
Since the latter is bijective onto W,,(G), it is open onto W,,(G) in the relative topol- 
ogy induced from M. 


Remark 6.3.6 Let (M, G, W) be a proper Lie group action, let O be an orbit and let 
m € O. The fact that G,, is compact has the following immediate consequences for 
the isotropy representation. 


1. By Proposition 5.5.6, T,, M admits a G,,-invariant scalar product. 

2. The orthogonal complement (T,, o)+ of T,, O in T,, M with respect to this scalar 
product is invariant under the isotropy representation. Hence, the isotropy repre- 
sentation decomposes into the direct sum 


TmM =TmO ® (TmO)*. 


'3The assumption that the submanifold be initial is made to ensure that the restricted action is 
smooth, cf. Example 6.1.2/7. 
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3. The natural projection T,, M — N,,O restricts to an isomorphism (T,, o)t => 
Nm O which intertwines the isotropy representation induced on (T,, O)+ with the 
slice representation. 

4. Points 2 and 3 imply that there exists a vector space isomorphism 


TnmM =TnO @®NnO (6.3.2) 


intertwining the isotropy representation with the direct sum of the isotropy rep- 
resentation on T,, O and the slice representation. 


Next, we discuss the tool of averaging. Depending on whether W is a left or 
a right action, choose a left or a right-invariant volume form da on G, cf. Corol- 
lary 5.5.4. Let T be a smooth tensor field on M of type (p, g) with compact support. 
Due to Proposition 6.3.4/2, for every m € M, the smooth mapping G > (14M dm 
given by at (WT) m has compact support (Ww! (supp(T)))7! Cc G. Hence, one 
can define a mapping T° : M > T,M by 


18 = | oT mda, 
G 


where the integrand is a differential form on G of maximal degree with values in the 
finite-dimensional real vector space (4, M)m. The reader may convince himself that 
the local representative of T° with respect to a chart (U,«) on M and the induced 
chart on 14M is given by a system of real-valued functions on «(U) each of which 
is obtained by integrating a compactly supported smooth function on G x «(U) 
over the first variable. Thus, 7 is a smooth tensor field of the same type as T. By 
construction, T is invariant and supp(T°) = G - supp(7). We call T© the average 
of T with respect to W. Since the choice of da is unique up to a nonzero real factor, 
so is the average. If G is compact, one may fix this factor by requiring that G have 
unit volume. Note that taking the average is a linear operation but it is obviously not 
compatible with the multiplication of tensor fields. 


Proposition 6.3.7 Let (M, G, W) be a proper Lie group action. 


1. The invariant smooth functions separate the points of M/G.'* 

2. Every covering of M by invariant open subsets admits a countable subordinate 
partition of unity by invariant smooth functions. 

3. There exists an invariant Riemannian metric on M. 


Proof Let x : M — M/G denote the natural projection. 

1. Let m,, m2 € M be such that 2 (m 1) 4 2(mz). Since the orbit G - mz is closed, 
using a local chart at m; one can construct a smooth function f > 0 on M with com- 
pact support such that f(m,) = 1 and supp(f) N (G - m2) = S. Then, the average 
satisfies f%(m,) > 0 and f°% (m2) =0. 


14That is, for all m,,m2 € M such that G- m; 4% G- mz, there is f € C?(My)S with f(m,) 4 
f(mz). 
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2. Let {U;} be a covering of M by invariant open subsets. Then, {7 (U;)} is an 
open covering of M/G. Since, by Propositions 6.1.5/3 and 6.3.4/3, M/G is locally 
compact, second countable and Hausdorff, there exists a countable subordinate open 
covering {V; :i € J C N} which is locally finite, see for example [302, Lemma 1.9]. 
Then, {~!(V;) :i € J} is a locally finite covering of M by invariant open subsets, 
subordinate to {U;}. According to Proposition 1.3.7, there exists a partition of unity 
{fi : G,a@) € ACI x N} with compact supports such that supp( fia) C a !(V;) 
for all (i, a) € A. By passing to the averages ce we obtain a family of invariant 
smooth functions. However, these functions need no longer add up to 1 and the 
family of their supports need no longer be locally finite. This can be remedied by 
an appropriate summation as follows. For f € C°(M), let f(m) denote the k-th 
order tangent mapping at m, viewed as a k-linear mapping T,,M x ---x TnM —> R, 
and let || f“ (m)|| denote the operator norm of this mapping with respect to a chosen 
Riemannian metric on M, cf. Proposition 4.4.2. Define 


[o,@) 
Cia = sup{| f2@n)|:0<k<a}, fim):= > °2-°C FS (m). 
a=0 
The latter series converges absolutely for all m € M, and so do all formal deriva- 
tives, hence f; is well-defined and smooth.!° By construction, f; is invariant and 
supp( fi) C «7 !(V;). In particular, the family {supp(f;) : i € 1} is a locally finite 
covering of M. Hence, }°;.; f;(m) is well-defined for all m and by dividing f;(m) 
by this sum one finally obtains the desired partition of unity. 

3. Choose a countable atlas {(U;, «;) : i € J} on M such that the closures of the U; 
are compact and carry out the construction of the symmetric second-order covariant 
tensor fields g;, i € J, as in the proof of Proposition 4.4.2. Averaging yields invariant 
tensor fields or. As noted in the proof of point 2, the open covering {7 (supp(g;))} 
of M/G admits a subordinate countable, locally finite covering {V; : j € J}. For 
each j, choose i; so that V; C supp(gi,) and define g(m) := es gy (m). This is 
an invariant smooth second-order covariant tensor field on M. Since on the interior 
of their supports, the g; are Riemannian metrics, and since averaging does not affect 
this property, g is a Riemannian metric.!° 


Example 6.3.8 


1. Due to Corollary 6.3.3/2, for an action to be proper it suffices that G be compact. 
If M is compact, then this condition is also necessary. In particular, the action 
induced by the flow of a complete vector field on a compact manifold cannot be 
proper. 

2. The action associated with a representation of a Lie group G on a K-vector space 
V is proper iff G is compact, because the stabilizer of the origin is G. This 
applies, in particular, to the adjoint representation. 


'STn fact, the construction is designed so that the sequence of smooth functions, given by the partial 
sums, converges in an appropriate topology on C%°(M). For the latter, see e.g. [211]. 


'6For an alternative proof, see Exercise 6.3.1. 
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3. The action of G on itself by left or right translation is proper: let {a,,} and {b,} be 
sequences in G such that b, — b and a,by > b. Then, a, = (anbn)b;! —> bo), 
The action of G on itself by inner automorphisms is proper iff G is compact, 
because the stabilizer of the unit element is G. By Proposition 6.3.4/1, these 
statements carry over to the corresponding induced actions of a closed subgroup 
on G. 


Remark 6.3.9 Let (M,, ¥') and (Mo, ¥?) be G-manifolds and let g : Mj > Mp be 
equivariant. If W is proper, then W! is proper, too: let {a,} and {m,} be sequences 
in G and M,, respectively, such that {m,} and {Wi (my)} converge. Then, {g(m,)} 
and {We (g(m,))} also converge and thus {a,} contains a convergent subsequence. 
This has the following consequences. 


1. If an action is proper, so is the induced action on the tensor bundles. 
2. The direct product of G-manifolds is proper if one of the factors is proper. 


Exercises 

6.3.1 Prove assertion 3 of Proposition 6.3.7, using the following ingredients: a Rie- 
mannian metric on M, a partition of unity { f; : i €¢ J C N} such that supp( fj) 
is compact and contained in the interior of supp(fj+1) for all 7, and an in- 
variant partition of unity subordinate to the covering of M by the interiors of 
We (supp( fi). 

6.3.2 Let (M,G,W) be a Lie group action with G compact and let m € M bea 
fixed point of W, 1.e., G,, = G. Show that every neighbourhood of m contains 
an invariant open neighbourhood. 

6.3.3 Let (M, G, W) be a proper Lie group action. Show that for every m € M, the 
isotropy representation at m admits an invariant scalar product. 


6.4 The Tubular Neighbourhood Theorem 


The Tubular Neighbourhood Theorem, or Slice Theorem, is one of the basic tools 
in the theory of proper Lie group actions. It is due to Montgomery and Yang [215] 
and Mostow [227] for compact Lie group actions and to Koszul [172] and Palais 
[233, 234] for proper Lie group actions. !7 

Recall that a G-vector bundle is a vector bundle endowed with an action of G by 
vector bundle automorphisms. 


Definition 6.4.1 (Tubular Neighbourhood and Slice) Let (MW, G, W) be a Lie group 
action and let O C M be an orbit. 


'TThe special case of a free compact Lie group action is due to Gleason [107]. 
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1. A tubular neighbourhood of O is an equivariant diffeomorphism x : U > E 
from an open invariant neighbourhood U of O in M onto an open neighbourhood 
V of the zero section so in a G-vector bundle E over O such that x10 = So. 

2. For m € O, the G,,-invariant neighbourhood V,, := VM E,, of the origin in Ej, 
is called the linear slice of x at m and the G,,-invariant embedded submanifold 
Un = gol) of M is called the slice of x at m. 


One says that the G-vector bundle E provides a local normal form for W near O. 


Remark 6.4.2 


1. Recall from Remark 6.2.10/2 that W induces an action WN of G on the bundle 
manifold of the normal bundle NO, turning NO into a G-vector bundle. We will 
show below that one may always choose E = NO. 

2. Let m € O. Since the orbit mapping G/G,, — O induced by m is an equivari- 
ant diffeomorphism, one might as well assume FE to be a G-vector bundle over 
G/G,. In this case, the requirement x;o = so reduces to x (m) = so([1]). 

3. The slices U;,, m € M, of a tubular neighbourhood x : U > E of O have the 
following properties. 

(a) G-U, =U and Uy, is closed in U. 

(b) Ifa eG satisfies W(Um) Um 4 S, then a € Gm. 

(c) Uj, intersects every orbit in U transversally. 

The proof consists in showing that the fibres E,,, of the G-vector bundle E have 
these properties. We leave the details to the reader (Exercise 6.4.1). 


Theorem 6.4.3 (Tubular Neighbourhood Theorem) Every orbit of a proper Lie 
group action admits a tubular neighbourhood. 


Proof To be definite, we give the proof for a left action. As noted above, we will 
construct a tubular neighbourhood by means of the normal bundle NO and the ac- 
tion WN, Choose a point m € O. The plan of the proof can be summarized in the 
following commutative diagram, 


G x M<2—G x Um —202" + G x Nn O—<+G x NO 
“| “| we (6.4.1) 
M = U us NO = NO 


whose ingredients will be constructed step by step and which will ultimately de- 
fine the desired diffeomorphism x. We start with constructing the mapping ¢ : 
Um — NmO. According to Remark 6.3.6/1, we may choose a G,,-invariant scalar 
product on T,,M. Let (T;, O)+ denote the corresponding orthogonal complement!® 
of TO. 


'8 Any G,,-invariant complement will do. 
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Lemma 6.4.4 There exists a G,,-invariant embedded submanifold Uj, of M con- 
taining m such that 


Tn Un = (Tn O)+ 


and a Gm-equivariant diffeomorphism @ from Um onto a Gy -invariant open neigh- 
bourhood of the origin in Ny O such that p(m) = 0. 


Proof of Lemma 6.4.4 Clearly, there exists a diffeomorphism g; from an open 
neighbourhood W;, of m in M onto an open neighbourhood of the origin in T,,M 
satisfying g|(m) = 0 and 


(91) in = idt,,M - (6.4.2) 


According to Exercise 6.3.2, W,; may be chosen to be G,,-invariant. Using the 
unique invariant volume form da of unit volume on G,,, cf. Corollary 5.5.9, we 
take the average 


eo” :Wi>TnM, gen (-) ‘= i ((Wa)in ogio W-1 (p)) da. 


m 


Using (6.4.2), for X € TM, we obtain 
Cage (X)= / (Wain © (D1) )y © (Yy-1)\n (X)) da = X. (6.4.3) 


Hence, by the Inverse Mapping Theorem 1.5.7, yg,” restricts to a diffeomorphism 
from an open neighbourhood W2 of m in M onto an open neighbourhood V) of the 
origin in N,, O. According to Exercise 6.3.2 again, W2 and hence V> can be chosen 
to be G,,-invariant. We define 


=] S Ss = 
Un = (9f") (V2 (TmO)*), gpa) :=[9P"(H)], HE Um. 
Obviously, U,,, is G,,-invariant and g(m) = 0. Moreover, using (6.4.3), we calculate 
TnUm = (v'")", (TnUm) = To(y (Um)) = To(Tmn O)* = (Tn Oy: 


Finally, according to Remark 6.3.6/3, the natural projection T,;, M — Nj, O restricts 
to a G»-equivariant vector space isomorphism (T,, o)t — NO. Hence, ¢g is a 
Gm-equivariant diffeomorphism. This proves Lemma 6.4.4. 


Next, by restriction, the action YW induces the mapping 


w:GxUn— M, w(a,m) := W,(m). 


Lemma 6.4.5 U;, can be shrunk so that 


1. w is a submersion, 
2. Ifa,,a2 € Gand m, m2 € Uy satisfy (a1,m1) = (a2, m2), then ana; ' E Gn 
and m2 = Pala, (m1). 
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Proof of Lemma 6.4.5 Since (a, m) = W(wW(4, m)) for all a € G and m € Up, to 
prove point | it suffices to show that y is a submersion at (1, m). Since 


im Via) = Tn O+ Tn Un, 


this follows from T,,U», = (T, O)+. For point 2, we first observe that (a ,,™m ,) = 
wW(a2,mz2) implies m2 = ete (m1). Hence, the point to prove is that the latter 
2 


implies a, lay € Gm» if only m; and mz are close enough to m. Assume, on the 
contrary, that this is false. Then, there exists a sequence {m,} in U,, converging to m 
and a sequence {a,} in G\G,, such that Y%, (m,) belongs to U;, and converges to m. 
We will show that this implies a, € Gj» for large enough n, which is a contradiction. 

By Corollary 6.3.3/2, {a,} contains a subsequence converging to some a € G. By 
passing to this subsequence, we may assume that a, —> a. Then, Y%,, (mn) converges 
to W,(m), so that Y,(m) = m and hence a € G,,. Using a vector space complement 
of gm in g and the exponential mapping of G, we construct a submanifold N of G 
of dimension dim G — dim G,,, which intersects G,, transversally in 1. The Inverse 
Mapping Theorem implies that the mapping N x G,, — G induced by the group 
multiplication of G restricts to a diffeomorphism between open neighbourhoods 
of (1,1) in N x G,, and of 1 in G, respectively. Thus, for large enough n, we can 
decompose ana! = bac with by, € N and cy € Gm, where both the sequences {b,} 
and {c,} converge to 1. On the other hand, W restricts to a diffeomorphism from an 
open neighbourhood of (1, m) in N x Uj, onto an open neighbourhood of m in M. 
Indeed, since U;, is G,,-invariant, we have 


Yam) (TiN ® Tn Um) = Yam) (T1G ® Tn Um) = TnM. 
Hence, the restriction of W/ 


(1,m) t© the subspace T1 N @ Tn Um of T1G ® Tn Um 
is surjective. By counting dimensions one finds that it is also injective. Hence, the 
assertion follows from the Inverse Mapping Theorem. Now, since the sequences 
{by} and {c,} both converge to 1, for large enough n, Vaan! (my) is the image of 
(bn, %,,(™n)) under the diffeomorphism just discussed. Since Wuan! (my) € Um, 
we conclude b, = 1. It follows that a,a~! € Gj» and hence ay, € Gm, which is the 
desired contradiction. This proves Lemma 6.4.5. 


Finally, by restriction, the action WN induces the mapping 


WN:GxNnO>NO, — WN(a,[X]):= YN([X]). (6.4.4) 


Lemma 6.4.6 


1. The mapping WN is a surjective submersion. 
2. If ay,a2 € G and [X,],[X2] © NyO satisfy the relation WN (ay, (Xi) = 
WN (a, [X2]), then [X2]= WN, (Xi) and ay tay € Gn: 
2 


mr 


Proof of Lemma 6.4.6 1. Surjectivity is due to the fact that YN covers the action of 
G on O, which is transitive. Since wN(a, [X]) = WN(WN (a, [X])) and since WN is 
linear in the second variable, for proving that yr is a submersion it suffices to show 
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that it is a submersion at (1, 0). The latter follows by observing that with respect to 
the decomposition 
TyN(1,0)NO =TnO ®NnO, 
provided by Remark 6.3.6/4, the tangent mapping is given by 
(WN) 4.0) (A: [X1) = ((Aadm [X1). 


2. Since the projection NO —> O is equivariant, wN(a;, [X1]) = WN(a, [X2]) 
implies YW, (m) = W,(m) and hence a, lay € Gy. The rest is obvious. 


Proof of Theorem 6.4.3 As a consequence of Lemmas 6.4.5/2 and 6.4.6/2 and the 
Gn-equivariance of @, there exists a unique injective mapping x : U — NO such 
that the central square in the diagram (6.4.1) commutes. Since w is a submersion, x 
is smooth. Since WN is a submersion, x has open image V and the inverse mapping 
is smooth, too. Thus, x is a diffeomorphism onto V. Since y intertwines the action 
of G on G x Uy, by left translation on the first factor with W and since an analogous 
statement holds for wN, x is equivariant. This completes the proof of the theorem. 


Remark 6.4.7 (General Tubular Neighbourhood Theorem) The notion of tubular 
neighbourhood extends to arbitrary submanifolds P C M: a tubular neighbour- 
hood of P consists of an open neighbourhood U of P in M and a diffeomorphism 
x :U — NP onto an open subset, mapping P onto the zero section. There holds a 
general Tubular Neighbourhood Theorem, stating the existence of a tubular neigh- 
bourhood for every embedded submanifold. The main difference in the proof is that 
in the general situation one does not have a group action to transport a diffeomor- 
phism defined in a neighbourhood of some point of P to all of P. Instead, one 
chooses some Riemannian metric g, identifies NP with the orthogonal complement 
of TP in TM;p and uses the exponential mapping exp, associated with g to con- 
struct x. The exponential mapping exp, maps an open neighbourhood of the zero 
section in TM to M. It is defined by expg (Xm) := y(1), where y is the solution 
of the geodesic equation of g with initial conditions y (0) = m and y (0) = X,. For 
a thorough discussion of the geodesic equation and the corresponding exponential 
mapping, we refer to [166] or to volume 2 of this book. Let us add that if one has 
already proved the general Tubular Neighbourhood Theorem, it is more natural to 
prove Theorem 6.4.3 by the same method, but using a G-invariant Riemannian met- 
ric. The latter exists due to Proposition 6.3.7/3. 


Exercises 
6.4.1 Verify the properties of the slices stated in Remark 6.4.2. 


6.5 Free Proper Actions 


In the case of a free proper Lie group action, Theorem 6.4.3 allows for constructing 
an atlas on the orbit space by means of local slices. This yields 


6.5 Free Proper Actions 291 


Corollary 6.5.1 The orbit space M/G of a free proper Lie group action (M,G, V) 
admits a unique smooth structure such that the natural projection M — M/G isa 
submersion. 


Accordingly, if W is free and proper, M/G will be referred to as the orbit mani- 
fold of W. 


Proof Uniqueness follows from Remark 1.5.16/4. To prove existence, let 7 : M —> 
M/G denote the natural projection to equivalence classes. By Proposition 6.3.4/3, 
the orbit space M/G is Hausdorff. By Proposition 6.1.5/3, it is second countable. 
Let m € M be arbitrary and let O denote the orbit of m. Since the action W is 
free, Lemma 6.4.6 implies that the mapping %,:G x Ny,O — NO, defined by 
Formula (6.4.4), is a diffeomorphism.!? Taking a tubular neighbourhood x : U > 
NO of O and composing x with y,, ' we obtain a G-equivariant diffeomorphism X 
from U onto an open G-invariant neighbourhood of G x {0}, which by G-invariance 
must have the form G x V,, for some open neighbourhood V,, of the origin in 
NmO. By embedding V,, as the subset {1} x Vj, into G x Vin, we obtain a bijective 
continuous mapping 


a4 . 
Vn +> GX Vn > US 0, 


where U := z(U) is an open neighbourhood of z(m) in M/G. This mapping is 
open, and hence a homeomorphism, because it maps an open subset W of Vj, to 
the open subset 1(%~!(G x W)) of U. Thus, the inverse of this mapping defines 
a local chart (U,<) on M /G at z(m). The transition mapping between two such 
local charts is given by 


K2 0 i ki (Uy N U>) > Ro(U ia) Up), 
~  a—] ~~] 
0%, ([X]) =pry 0X20 X, (1, [X]). 
Hence, it is smooth. Thus, we have constructed a smooth manifold structure on 


M/G. In this structure, 2 is a submersion, because via x and <, its restriction to U 
corresponds to pr. 


Remark 6.5.2 Let G be a Lie group and let H C G be aclosed subgroup. The action 
of H on G by left or right translation is free and proper, cf. Example 6.3.8/3. Thus, 
in this case, Corollary 6.5.1 reproduces Theorem 5.7.2. 


Corollary 6.5.1 implies (Exercise 6.5.1) 


Corollary 6.5.3 Let (M, G, W) be a free proper Lie group action. 


1. If H C Gis aclosed normal subgroup, then W descends to a free proper action 
of G/H on M/H and idy induces a diffeomorphism between (M/H)/(G/H) 
and M/G. 


!°More precisely, Wm is an isomorphism of G-vector bundles; in particular, NO is globally trivial. 
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2. If (N, @) is a G-invariant embedded submanifold, then the induced action of G 
on N is free and proper and (N/G, @), with @: N/G — M/G being induced by 
g on passing to the quotients, is an embedded submanifold of M/G. 


Example 6.5.4 


1. The action of G = Z on M = Rx (—1, 1) defined by Y% (x, t) := (x +k, (—1)*t) 
is free. It is also proper:7” let {k,} and {(%pn, ty)} be sequences in Z and R x 
(—1, 1), respectively, such that {W%, (Xn, tn)} and {(xn, tn)} converge. Since 


kim — Knl < || (%n + kn. (-D*" th) — (%n + kms (—D*" ty) | 
< | Wi, (Xn, tn) — om (Xm, tm) |+| (Xm, tn) — (Xn, tn) 


there exists a number no > O such that k, = ky, for all n > no. Hence, {k,} con- 
verges, and W is proper. The quotient manifold coincides with the Mobius strip 
constructed in Example 1.1.12: the underlying topological spaces coincide by 
construction, and the smooth structures coincide, because the natural projection 
is a submersion with respect to both of them. Let us add that application of Corol- 
lary 6.5.3 to the closed normal subgroup H = 2Z yields that the Mobius strip is 
diffeomorphic to the quotient manifold of the free proper action of G/H = Zz 
on 


’ 


M/H =R/2Z x (-1,1)=S! x (-1, L), 


where on S! x (—1, 1), the generator of Z2 maps (a, f) to (—a@, —f). 

2. Let K=R, C, H and let K; denote the Lie group given by the unit sphere in K, 
that is, Ry = O(1), Cy = U(1) and Hy = Sp(1). The action of G = K; on M= 
K” is proper, because K, is compact. Since this action is isometric with respect to 
the natural scalar product, it restricts to an action of K, on the unit sphere gdr-1 
where d = dimp K. The latter action is in addition free. The quotient manifold 
S@"—! /IK, coincides with the projective space KP”~!, cf. Example 1.1.15; the 
argument is the same as under point |. 

3. Let M = O(n). For k <n, application of Corollary 6.5.3 to G = O(n —k) x O(k) 
and H = O(n — k) x {1} yields that O(k) acts freely and properly on the Stiefel 
manifold Sp(k, n), cf. Example 5.7.5, and that the quotient manifold of this ac- 
tion is diffeomorphic to the GraBmann manifold Gp(k,n), cf. Example 5.7.6. 
Similar statements hold in the complex and in the quaternionic case. Combining 
this with the diffeomorphism between Sx(1,7) and gan —!. as well as point 2 
above, one obtains that Gx(1,n) is diffeomorphic to KP” —!_ This has already 
been asserted in Example 5.7.6, but the proof was postponed there. 


Corollary 6.5.1 implies the following important bundle structures. 


Definition 6.5.5 (Principal bundle) Let (P, G, W) be a free Lie group action, let M 
be a manifold and let x : P + M be asmooth mapping. The tuple (P, G, M, WY, 1) 


204 proper action of a discrete group is called a properly discontinuous action. 
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is called a principal bundle, if for every m € M there exists a local trivialization 
at m, that is, there exist an open neighbourhood W of m and a diffeomorphism 
x :a~!(W) > W x G such that 


1. x intertwines W with the G-action on W x G by translations”! on the factor G, 
2. pry ox(p) = 2(p) for all p ¢27!(W). 


Denoting x := pr,ox :2~!(W) => G, for a right action WY, property | can be 
rewritten as 


k(Wa(p)) =K(p)a, pe x '(W), aeG. 


The group G is called the structure group of P. If G is fixed, P is called a principal 
G-bundle. The existence of local trivializations implies that 7 is a submersion.” 
Hence the subsets x7! (m), m € M, are submanifolds, called the fibres of P. 

Now, let (P, G, W) be a free proper Lie group action. Let M be the orbit space, 
equipped with the smooth structure provided by Corollary 6.5.1, and let 7: P ~ M 
be the natural projection to orbits. Every tubular neighbourhood of an orbit defines 
a local trivialization over a neighbourhood of the corresponding point of M. Hence, 
the Tubular Neighbourhood Theorem 6.4.3 implies that (P, G, M, W, zr) is a princi- 
pal bundle. Conversely, if (P, G, M, W, zr) is a principal bundle, (P, G, W) is a free 
proper Lie group action, M is diffeomorphic to the orbit space P/G and z corre- 
sponds, via this diffeomorphism, to the natural projection to orbits (Exercise 6.5.2). 

Now, in addition, let (F, &) be a G-manifold. Without loss of generality, assume 
both W and & to be left actions. The direct product action of G on P x F, given by 


(a, (p, f)) > (Val), Fal f)), (6.5.1) 


is obviously free. Due to Remark 6.3.9/2, it is also proper. Hence, by Corollary 6.5.1, 
the orbit space 


PxGgF:=(P x F)/G 
inherits a unique smooth structure. 


Definition 6.5.6 (Twisted product) The manifold P x g F is called the twisted prod- 
uct of (P, YW) with (F, &). 


There are two bundle structures inherent in the twisted product. First, P xg F 
is the base space of the principal bundle associated with the direct product action 
(6.5.1) of G on P x F. Second, the natural projection P x F — P is equivariant 
and hence induces a smooth mapping 


trp: PxgGF— P/G=M. 


217 eft (right) translations if W is a left (right) action. 


221 general, the converse does not hold. A local trivialization is more than a local section, because 
it is global along the fibre. 
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If s: U > P isa local section of the natural projection P — M, the mapping 
Ux Fo te) (m, f)'tPe [(s(m), f)]; (6.5.2) 


is a diffeomorphism projecting to the identical mapping of U (Exercise 6.5.3). 
Hence, P xq F is the total space of a locally trivial fibre bundle?? over M with 
typical fibre F’. 


Definition 6.5.7 (Associated bundle) The locally trivial fibre bundle (P xg F, 
M, mf) is said to be associated with the principal bundle (P, G, M, W, zr) and the 
G-manifold (F, &). 


In the special case where F is a finite-dimensional K-vector space and & is a 
representation of G on F, the associated bundle is a K-vector bundle over M of 
dimension dim F. 


Remark 6.5.8 Let (M, G, W) be a proper Lie group action, without loss of general- 
ity assumed to be a left action. Let O be an orbit and let m € O. The action of G,, on 
G by right translation defines a right principal bundle. After having turned this into 
a left principal bundle, we can form the associated real vector bundle G xG,, NmO, 
where G,, acts on Nj, O by the slice representation. The action of G on G x N;,O 
by left translation on the first factor induces a natural G-vector bundle structure on 
G xq,, Nm@O. As a consequence of Lemma 6.4.6, in this structure, G xg,, Nm O is 
isomorphic to NO (Exercise 6.5.4). Thus, every orbit admits a tubular neighbour- 
hood taking values in G xg,, NmO. 

More generally, one can prove the following. Let W be a real vector space car- 
rying a representation of G,, and let 4: W — T,,M be an equivariant injective lin- 
ear mapping onto a subspace complementary to T,, O. Then, there exists a tubular 
neighbourhood x : U + G xq,, W of O such that, under the natural identification 


Ti(1,0))(G XG, W) = 9/am ® W, 


one has 


(x7")iea.o (Al, w) = (Aad +2(W), (6.5.3) 
see Exercise 6.5.5. This will be used in Sect. 10.4. 


Exercises 

6.5.1 Prove Corollary 6.5.3. 

6.5.2 Let (P,G,M,W,7r) be a principal bundle. Prove that (P,G,W) is a free 
proper Lie group action, M is diffeomorphic to the orbit space P/G and x 
corresponds, via this diffeomorphism, to the natural projection to orbits. 

6.5.3 Show that the mapping (6.5.2) is a diffeomorphism. 


3The definition of locally trivial fibre bundle is obtained from that of vector bundle by replacing 
“vector space” by “manifold” and “linear mapping” by “smooth mapping”. 
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6.5.4 Use Lemma 6.4.6 to show that the G-vector bundles G xg,, Nn,O and NO 
are isomorphic. 

6.5.5 Prove the existence of a tubular neighbourhood x : U > G xg,, W satisfying 
(6.5.3), see Remark 6.5.8. 
Hint. Choose G,,-invariant scalar products on T,,O and A(W) and define the 
scalar product on T,,, M as their direct sum. Use this scalar product in the proof 
of Theorem 6.4.3 to obtain a tubular neighbourhood xo : U > NO. Next, 
show that A extends to a G-vector bundle morphism G xg,, W > (TM) 0 
and that composition with the natural projection (TM);o — NO yields a 
G-vector bundle isomorphism x; : G xg,, W > NO. Then, x := x, a xo 
has the desired properties. 


6.6 The Orbit Space 


The Tubular Neighbourhood Theorem 6.4.3 implies that the orbit space admits a 
disjoint decomposition into manifolds. Let (M, G, YW) be a Lie group action and let 
a:M>M=M /G denote the natural projection. Without loss of generality, as- 
sume that W is a left action. Recall that M and M decompose into orbit type subsets, 
that is, subsets made up by the orbits of a fixed type. The connected components of 
the orbit type subsets of M will be referred to as the strata of M. Let S$ denote the 
set of strata. Elements o € S will be viewed as pairs consisting of a conjugacy class 
of subgroups of G and a label for the connected component of the respective orbit 
type subset. The actual stratum of M eoespouae to o will be denoted by Mg. 
For o € S and for a subgroup H C G representing” o, define 


Mz :=17! (Mz), Mo. = Mg My. 


The subsets M, and M,, 4 will be referred to as the orbit type strata of M and the 
isotropy type strata of M, respectively. Unless G is connected, M, or Mo, need 
not be connected. Obviously, 


Ms =G- Mao,H, My C Min}, MoH C Mu. 
By restriction, 7 induces mappings 
1M, > Mo, To, ? Mo. > Mo. 
Next, recall that the normalizer of a subgroup H C G is defined to be 
Ng(H) := {ae G:aHa' =H}. (6.6.1) 


This is the maximal subgroup of G containing H as a normal subgroup. If H is 
closed, so is NG(#) (Exercise 6.6.1). Theorem 5.6.8 yields that NG(#) is an em- 
bedded Lie subgroup of G. Hence, it is a Lie group and H C Nc(/Z) is a closed 
normal subgroup. Denote the quotient Lie group by 


Ty :=No(A)/H. 


>4That is, representing the conjugacy class of subgroups of G corresponding to a. 
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Due to Ha = aH for alla € NG(A), Ty acts naturally from the left on G/H: 
ly x G/H > G/H, (aH, bH)+> ba7'H. 


Since the natural projection NG(H) x G > Iq x G/H is a direct product of sub- 
mersions, this action is smooth. 


Proposition 6.6.1 (Structure of strata) Let (M,G,W) be a proper left Lie group 
action. Let o € S and let H be a subgroup of G representing o. 


1. Mo and Mg, are embedded submanifolds of M. Form € Mg,H, one has 
Tm Mo.H =(TmM)", TinMo = (TmM)"# +T(G-m). 


2. W induces a free and proper left action of 'y on Mg. 7. 
3. The twisted product Mo,H Xr, G/H carries a left G-action, given by 


(a, [(m, bH)}) > [(m, abH)], 
and W induces a G-equivariant diffeomorphism 
Ww: MoH Xry G/H > Mz, v([(m, aH)]) = Yam). 


4. There exists a unique smooth structure on Mo such that the natural projection 
Ig is a submersion. With respect to this structure, the natural projection 1o,H is 
a submersion, too, and the natural inclusion mapping Mg, — Mg descends to 
a diffeomorphism from Mg, /I'y onto Mo. 


Proof 1. According to Remark 1.6.13/3, it suffices to prove the assertion for the 
intersections of the subsets M,, 7 and M, with a tubular neighbourhood of the orbit 
of any of their points. Thus, according to the Tubular Neighbourhood Theorem 6.4.3 
and Remark 6.5.8, it is enough to consider the case M = G x y V, where V is a 
finite-dimensional real vector space carrying a representation 9 of H. To determine 
the stabilizers, let a € G and v € V. One has Y¥%([(a, v)]) = [(ba, v)] = [(a, v)] iff 
ba =ac~! and v= o(c)v for some c € H. Hence, Giq,y)| = aH,a~', where H, 
denotes the stabilizer of v under 9. This implies 


M,=GxyV", Mo.u =No(A) xu V4, 


where V“ denotes the subspace of fixed points under 9. That is, Mz is a verti- 
cal vector subbundle of the vector bundle M = G xy V and M, 7 is the restric- 
tion of this vertical subbundle to the subset 7 C G/H, which is easily seen to 
be an embedded submanifold. Hence, the assertion follows from Examples 2.7.2 
and 2.7.3. This argument yields in addition that the mapping G x Mo.4 — Mo 
defined by W is a submersion. This will be needed in the proof of assertion 3. To 
prove the formulae for the tangent spaces, we write tangent vectors of G xy V 
at [(a, v)] € NG(A) xq V" in the form pr’(L/,A,u) with A € g, u € V and with 
pr: G x V > G xy V denoting the natural projection. One can show that such 
a tangent vector is invariant under the isotropy representation of Griq,y)] = A iff 
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uéV and A belongs to the Lie algebra of NG(#) (Exercise 6.6.2). This yields 
TnMo.n = (TnM ji . The formula for T,, M, then follows. 

2. Since M,.y is invariant under Ng(A), W induces an action of NGg(H) on 
Maz.H. By Proposition 6.3.4/1, the latter is proper. Since it has kernel H, by Propo- 
sition 6.1.5/5, it induces an action of 7 on M,,47. Using Corollary 6.3.3/2, it is 
easy to see that the latter remains proper. By construction, it is also free. 

3. According to point 2, we can build the twisted product Moy xr, G/H. Since 
the action of G on M,. 4 x G/H by left translations on the second factor commutes 
with the action of "77, it descends to an action of G on Mg, Xr, G/H, given by the 
asserted formula. Now, consider the mapping y. It is well-defined: if [(™m1, a; H)] = 
[(m2, a2H)], then mz = Y%(m1) and ay = a,b~! for some b € NG(A), and hence 
Wa, (m2) = Wa, (m1). Moreover, y is G-equivariant. By construction, we have the 
commutative diagram 


Mo,n X G——>M,, 


v 
MoH Xry G/H 


where the horizontal arrow is given by the action W. Since the vertical arrow is a 
submersion, w& is smooth. Since, as noticed under point 1, the horizontal arrow is a 
submersion, y is a submersion, too. For dimensional reasons, it is also an immersion 
then. Hence, it remains to show that y is bijective. Surjectivity is obvious. To prove 
injectivity, let m,,m2 € Mo,” and a), a2 € G be such that YW, (m1) = Wa, (m2). 
Then, b := dy ‘ay € NG(B#), because both m, and Y%(m,) = mz have stabilizer H. 
Thus, a2 =a 57! and hence [(m,, a, H)] =[(m2, a2H)). 
4. The natural inclusion mapping M,,4 — M, descends to a mapping 


W Mon /TH > Mo. 


Since two points of M,.y are conjugate under G iff they are conjugate un- 
der NG(A), v is a bijection. According to assertion 2 and Corollary 6.5.1, 
Mo.H/I'H carries a unique smooth structure such that the natural projection 
Mo,H — MoH /Ty is a submersion. We have the commutative diagram 


Mex Xr_ G/H ——*—> M, 


G/H |r 


A 


Mo,4 / lH —————> Ma, 


where zG/y is the natural projection in the corresponding associated fibre bundle. 
Thus, it is a submersion, and hence it is open. Since M, is an embedded subman- 
ifold, 2, is open, too. Thus, since yw is a homeomorphism, so is Vv. We define a 
smooth structure on M, by requiring v to be a diffeomorphism. In view of the 
above diagram, with respect to this structure, 77, is a smooth submersion, and so is 
Io,H. Uniqueness is then obvious. 
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Remark 6.6.2 


1. The finer decomposition of the orbit type subsets of M into connected compo- 
nents is necessary because the latter may have different dimensions, see [275] 
for an example. 

2. Since the smooth structure of Mz is unique, assertion 4 of Proposition 6.6.1 
implies that different choices of the subgroup H C G representing o lead to 
diffeomorphic orbit manifolds Mo, /Iy. This assertion also generalizes Corol- 
lary 6.5.1 to the case where the action is not free, but has a single orbit type. 

3. The diffeomorphism v turns M, y into a principal bundle over M,, with struc- 
ture group I"y and projection 7,4. Similarly, the diffeomorphisms y and W 
turn M, into a locally trivial fibre bundle over M, with typical fibre G/H and 
projection z,, associated with that principal bundle. 

4, Since 1g : Mg > Ms is a submersion, it admits local sections. Tubular neigh- 
bourhoods provide a distinguished class of local sections in the following way. 
Let m € M,, let x: U > E be a tubular neighbourhood of the orbit G -m and 
let U, be the corresponding slice atm. We have Mz N Um = Mo,H Um, where 
H =G,,. Since this subset is mapped under x onto Vi = EE AV», it is an 
embedded submanifold of M.7> The standard properties of slices stated in Re- 
mark 6.4.2/3 imply that any two distinct points in M, M Um belong to different 
orbits. It follows that the restriction of the surjective submersion 2, to Mg NUm 
is injective and hence a diffeomorphism onto an open neighbourhood of z(m) 
in M,. Then, the inverse of this mapping yields a local section of 27, at 1,(m), 
taking values in U,,, and composition of this section with x yields a local chart 
on Mz at 25 (m), taking values in the open subset ye of the vector “Space EP 

5. The family {Mo :o € S} establishes a disjoint decomposition of M into mani- 
folds. This decomposition has several additional properties reflecting how these 
manifolds fit together. In fact, it is a so-called Whitney stratification, see, for ex- 
ample, [238]. A similar statement is true for M and the family {M, :o € S}. 
If the orbit space M/G is connected, there exists an orbit type which is mini- 
mal in the sense that one (and hence any) of its representatives is conjugate to 
a subgroup of the stabilizer of an arbitrary point. The corresponding orbit type 
subset of M/G is open and dense in M/G and connected [238, Thm. 4.3.2], 
[150, Thm. 2.1], see also [54, Thm. IV.3.1]. It is therefore referred to as the prin- 
cipal stratum of M/G. Correspondingly, all the other strata are referred to as the 
secondary strata. 

6. It is sometimes”° more convenient to work with the connected components of the 
submanifolds M, and M,. 7. For a chosen point mo € Mz, 1, let Mg’? and Me 
denote the connected component of M, or MoH, respectively, containing mo. 
One can show that 


G" := {a € G: Wa(mo) € MP} (6.6.2) 


>5This follows also from the Transversal Mapping Theorem 1.8.2. 


26.9. in the theory of singular symplectic reduction. 
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is a closed subgroup of G (Exercise 6.6.3). Since it contains H, we can form the 
quotient Lie group 


Ty) = Nom (H)/H. 


Now, points 1-3 of Proposition 6.6.1 remain true if My, Mo,4, G and I'y are 
replaced by, respectively, Mz, M3"),, G"° and I77°. Point 4 can be replaced by 


the statement that the natural inclusion mappings Mz'° > M, and Mr, — M, 
induce diffeomorphisms M”’°/G""° — M, and MeV? > Mz. respectively, 


where the smooth structure of M. is the one inherited from M, according to 
point 4 of the original proposition. 


Next, we discuss the concept of a smooth function on the orbit space. Let 
(M, G, W) be a free proper Lie group action. Recall that, in this case, M is asmooth 
manifold and 2 : M — M is a smooth mapping. Since z is surjective, the pull- 
back z* : C™ (M ) > C%(M) is injective. Its image is contained in the subalgebra 
C°(M \G of C °(M ) of G-invariant functions. Conversely, every f € C (Myo 
defines a function 7 on M by 2* f= jf. Since z is a submersion, a is smooth. 
This shows that 2* induces an algebra isomorphism from C CCM ) onto C°(M)®. 
If (M, G, W) is proper but not free, M does not inherit a smooth structure from M ; 
hence a priori we do not have the concept of a smooth function on M. However, the 
above observation and the fact that, due to Proposition 6.3.7/1, C°(M he separates 
the points of M, motivate the following generalization. 


Definition 6.6.3. A continuous function f on M is said to be smooth if the function 
f oz on M is smooth. The set of smooth functions on M is denoted by C°(M). 


Obviously, C® (M ) is an associative algebra, with operations being defined 
pointwise. By construction, the pull-back c* defines an algebra isomorphism from 
C™(M) onto C~(M)°. 

Proposition 6.6.4 For every f € C™(M) and everyo €S, Fe €C®(M,). 


Proof Let H C G be a subgroup representing o. By assumption, (x* f )iMon = 
7S ay f [Mo ) is a smooth function on M,,77. Hence, the above argument for proper 


free actions yields that an , Is a smooth function on Mg. 


Example 6.6.5 (Adjoint representations of U(n) and SU(n)) Let G be a compact 
Lie group, let M = g and let W be given by the adjoint representation of G. For 
subsets S C G ands C g, define the centralizers 


Cg(S) = {A e€g:Ad(a)A=A forallae S}, 
Co (s):= {ae G: Ad(a) js = ids}. 


We will restrict attention to the cases G = U(n) and G = SU(n), starting with U(n). 
For a positive integer r, let S, denote the group of permutations of r elements. 
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Let t denote the subspace of diagonal matrices in u(m). Being skew-Hermitian, the 
elements of t have purely imaginary entries ix1,...,ix,. Let ¢ C t denote the cone 
defined by x; <--- < x,. By elementary linear algebra, every element of g = u(n) 
is conjugate under Y to some A € c. The stabilizer of A is given by Cy) (A), that 
is, by the subgroup 


Ay = {diag(ay, ..e5 Up) 1 aj € Uk)} 


of U(n), labelled by the sequence k = (k,,...,k,) of the multiplicities of the 
eigenvalues of A. Let K denote the set of all sequences arising this way, that is, 
sequences k = (kj,...,k,-) of length r = 1,2,... of positive integers satisfying 
ky +--- +k, =n. Subgroups Hy and Hj are conjugate in U(n) iff the sequences 
k and | differ by a permutation. Hence, the orbit types of W correspond bijectively 
to partitions n = kj + --- + k,. Now, let k € K. Obviously, My, is given by the 
subset 


= {diag(iy; 1g, »-+-5 Ly, 1%,): y; € R, pairwise distinct} ct. 


It follows that Mj, consists of the elements of u() having r distinct eigenvalues 
with multiplicities k;,...,k-. Next, we determine Nujm) (Hk). For o € S;, define 
Ok € Sy, to be obtained by dividing (1,...,7) into the r subsequences (1,..., ki), 
(ki t+1,...,k1 +k2),..., ki t---+k--1+1,...,) and permuting these subse- 
quences according to 9. Since a € Nun) (Ax) iff Ad(a) My, C Mp,, the normalizer 
Nuq) (Aik) is generated by Hy and the permutation matrices of 0, for all @ € S; sat- 
isfying o(k) =k. It follows that I", coincides with the stabilizer (S,)x of k under 
the action of S, and that it acts on M;,, via the corresponding permutations 0, of 
the entries. To determine the strata, for g € S,, denote 


te = {diag(iy;1,,, eves iy, Ly, ) et: Ye) <1 < Yor}. 


These subsets are the connected components of t, and hence of My,. Two con- 
nected components ee and a get identified under the action of ly, iff 0105 Rg 
(S;)x. It follows that the strata correspond bijectively to the elements of K and that 
the stratum Mx, k € K, is given by the image of is under the natural projection 7 
to the orbit space. Since is obviously injective on t'4, it induces a diffeomorphism 


io = My. 


Moreover, Mx 7, consists of the connected components rd with @ € (S;)_x and My 
consists of the elements of u(m) whose eigenvalues, in ascending order, have the 
multiplicities kj,...,k,. Finally, the submanifolds ne k € K, of t fit together to 
form the cone c. By restriction, m7 yields a bijection from this cone onto M. It is not 
hard to see that this bijection is in fact a homeomorphism. Thus, via this homeomor- 
phism, the stratum M. (1,...,1) Corresponds to the interior of c, given by x1 <---< Xp, 
and the other strata form the boundary of c, consisting of lower-dimensional cones. 

For the case G = SU(n), we observe that restriction of the above action of 
U(m) on u(n) to the subgroup SU(m) yields the same orbits. Since the subspace 
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Fig. 6.1 The curves T!, T?, 1 
T? and the central 

elements 1, 51 and 571, 

where 6 = ela, in the 

subgroup T ~ U(1) x UC) 

of diagonal matrices in 

SU(3), see Example 6.6.6 


su(m) C u(m) is invariant under this action, the results for U(7) carry over to SU(n) 
as follows. The orbit types are the same. They are represented by the stabilizers 


SA, :=SU(n)N Ak, keK. 


Since up to a phase factor, every permutation matrix may be chosen from SU(n), 
the normalizers Nsuin)(SHx) and the quotient groups Is, can be characterized 
in the same way as for G = U(). We have Msy, = tk M su(7), with connected 
components i Msu(n), o € S;. The strata correspond bijectively to the elements of 
K and for every k € K, z restricts to a diffeomorphism from M su(n) onto the 


stratum Mx. The strata fit together to form the cone ¢M su(7) which is an (n — 1)- 
dimensional simplex with one face moved to infinity.*’ Note that intersection with 
su(n) is achieved by imposing the additional condition x; + ---+ x, = 0, which for 
t Osu(n) reads kj y; +---+k,y, = 0. For a description of the orbit space, including 
the strata, in terms of discriminants we refer to [143]. 


Example 6.6.6 (Inner automorphisms of SU(3)) Consider the action of G = SU(3) 
on the group manifold M = SU(3) by inner automorphisms. Let T denote the sub- 
group of diagonal matrices. We have T = U(1) x U(1). Since every element of 
SU(3) can be diagonalized and since two elements of SU(3) are diagonal iff they 
have the same eigenvalues, the orbit space M of this action may be identified with 
the orbit space of the action of S3 on T by permuting the matrix entries. The latter 
can be described as follows, see Fig. 6.1. Each of the three equations a2 = a33, 
a33 = ay, and aj, = a22 defines a closed curve in T, respectively denoted by (er 
T? and T?. These curves intersect in the centre Z of SU(3) and separate six open 
subsets in 7. The closures of these subsets form 2-simplices whose edges are given 
by one piece of each T’ and whose vertices correspond to the elements of Z. Let 
T° be one of these open subsets. It is easy to see that the natural projection M>M 
restricts to a bijection, and hence a homeomorphism, from T° onto M. Thus, we 
can read off the stratification of M from T°. The interior T° forms a single two- 
dimensional stratum. It consists of the points whose stabilizer under the action of 


27Tn the theory of semisimple Lie algebras, this cone is called a closed Weyl chamber. 
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SU(8) is T or, equivalently, whose stabilizer under the action of S3 is trivial. There 
are three one-dimensional strata given by (T'M T°) \ Z, i = 1,2, 3. The stabilizers 
under the action of SU(3) are isomorphic to U(2) and consist, respectively, of the 
matrices 


det a 0 0 ay1 0 aj2 ai, ay2 0 
0 ail an ; 0 det a 0 " ex a22 0 , (6.6.3) 
0 ar, an ar 0 a2 0 O deta 


where a € U(2). The corresponding stabilizers under the action of S3 are generated, 
respectively, by the transpositions (23), (31) and (12). Since these subgroups are 
conjugate in SU(3) and S3, respectively, all one-dimensional strata belong to the 
same orbit type. Finally, there are three point strata formed by the elements of Z. 
Their stabilizers are SU(3) and S3, respectively. As a result, the stratification of M 
coincides with the natural stratification of the 2-simplex by its open cells. 


Remark 6.6.7 The result of Example 6.6.6 generalizes to SU(m) with arbitrary n 
(Exercise 6.6.5). In lattice gauge theory, the orbit space of the diagonal action of 
G on G x --- x G by inner automorphisms is relevant. We refer to [61, 62] for a 
detailed analysis including a description of the stratification in terms of invariants 
for a specific example. Correspondingly, the unreduced phase space of such models 
is given by T*(G x --- x G) with the induced diagonal action of G. Under the 
identification of T*G with G x g, G acts diagonally by inner automorphisms on 
the factors G and by the adjoint representation on the factors g. It is possible to 
work out the reduction with respect to this action explicitly. In lattice gauge theory, 
however, the Gau8 law yields a reduction to a level set of the momentum mapping, 
see Sect. 10.7. 


Exercises 

6.6.1 Show that the normalizer of a closed subgroup of a Lie group is closed. 

6.6.2 Complete the proof of Proposition 6.6.1/1 by showing that, in the no- 
tation used there, a tangent vector 2/(L/,A,u) of G xq V at the point 
[(a, v)] €NG(A) x V“ is invariant under the isotropy representation of H 
iff u<¢ V" and A belongs to the Lie algebra of Ng (H). 

6.6.3 Show that the subset G’"° defined by (6.6.2) is a closed subgroup of G. 

6.6.4 Determine the structure of the orbit space and its strata for the action of SO(7) 
on R”. 

6.6.5 Work out Example 6.6.6 for arbitrary SU(n). 


6.7 Invariant Vector Fields 


Throughout this section, let (M, G, Y) be a proper left Lie group action and let z : 
M — M=M/G denote the natural projection. We will discuss the basic properties 
of the following type of vector fields. 
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Definition 6.7.1 A vector field X on M is called invariant if Y,,.xX = X for all 
aeG. 


This is equivalent to the requirement that X be equivariant as a mapping M > 
TM. Therefore, it is also common to refer to invariant vector fields as equivariant 
vector fields. The invariant vector fields form a Lie subalgebra of X(M/), denoted by 
X(M)®. 

Now, let X € ¥(M)© and let 6 : J > M be the flow of X. By Proposi- 
tion 3.2.13/2, for every a € G, we have W%(Y,) = F, for all tf € R and 


@, 0 Wa(m) = Go &;(m) (6.7.1) 
for all (t,m) € ZY. That is, W restricts to an action of G on Y, for all t and the 


diffeomorphism &; : J, > GY_; is equivariant. This implies that ® projects to a 
continuous mapping &:Y—- M, where J := (idp x)(Y), defined by 


B(t,1(m)) := 1 0 D(t,m). (6.7.2) 


It will be shown below that @ is a flow?® on the topological space M. Itis, therefore, 
referred to as the projected flow, or the projection of ®. Next, by equivariance, 


Ge, (mn) S Gm (6.7.3) 


for all (t,m) € Y, that is, the flow leaves invariant the stabilizers and hence the 
orbit types. Thus, for every o € S and every subgroup H C G representing o, the 
submanifolds M, and M,,4 of M are invariant under @. It follows that X is tangent 
to M,, so that it restricts to a vector field X° on M, whose flow ®° : Y° > M is 
given by the restriction of ®, 


QD =DO(RX Mz), 87 =Hiq0, (6.7.4) 


cf. Proposition 2.7.16 and Remark 3.2.9/2. A similar statement holds for Mo, H. 
Since 


e(Xt,(m)) = Mo (Xm) 
for all a € G, the vector field X° on M, induces a mapping X°:M, > TM, by 
Se (6.7.5) 
Since zg is a submersion, this mapping is smooth and hence a vector field on Mz. 


Denote the corresponding flow by Bo: GF > M,. By construction, X° and xe 
are 71, -related. 


Proposition 6.7.2 Let X be an invariant vector field on M. If y : (a,b) > Mg isan 
integral curve of X°, then 1, 0 y : (a,b) > Mg is an integral curve of X° . Every 
integral curve of X° is of this form. 


8By an obvious modification of Definition 3.2.5, the notion of flow extends to the category of 
topological spaces and continuous mappings. 
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Proof That 7, oy is an integral curve of X° is obvious. Conversely, let y : (a, b) > 
My be an integral curve of X°. Choose a point m € M, fulfilling 7, (m) = y (0) and 
denote H = G,,. According to Remark 6.6.2/3, Mg, is a principal bundle over M, 
with structure group (7 = Ng(#)/H. By the help of local trivializations of this 
bundle we construct a curve y : (a,b) > Mo, through m fulfilling 7 0 py = y. If 
Iq is discrete, y is uniquely determined by this condition. It is an integral curve 
of X, because z is a local diffeomorphism in this case. 

Now, consider the case where I"y is not discrete. Here, it is enough to show that 
there exists a curve a : (a, b) > Iy such that 


y:(a,b)—> Mon, V(t) = Yan (VO), 


is an integral curve of X°. Here, by an abuse of notation, we have written W for the 


action of Ty on Mg. 4. Applying W’ ate! to the equation 


Xo =? y= Ws y(t) + ¥ Lay @) 


and using the invariance of X°, together with Wy(-1 0 Weir = Wyn) o Leyy)-1, we 
obtain 


aa Li gy 1(4O) = XZq) — YO. (6.7.6) 


For every f, the right hand side is a tangent vector at y(t) which is annihilated by 
mc! ott and hence contained in Ty ()(I'y - y(t). Since the action of I"y on Mg, x is 
free, CZ is a bijection from T1/"y onto T;()('H - y(t)). Thus, there exists a 
unique curve A; : (a,b) > TI’ such that 


(Yom ApO) =X -¥O, ted), 
and (6.7.6) implies that @ must satisfy the ordinary first-order differential equation 
a(t) = LiwAr@), t € (a,b). 


For the initial condition a (0) = 1, the solution is given by the path-ordered exponen- 
tial mapping, a(t) = Texp a A;(s)ds for t > 0 and a(t) = Texp(— es A;(s) ds) 
for t < 0. 


Proposition 6.7.2 is the main ingredient in proving (Exercise 6.7.1) 


Corollary 6.7.3 


1. The mapping é:9-> Mis a flow on the topological space M. 
2. The flow 6° of Xx is given by the restriction of ®, 


GQ? =9O(Rx Mo), f° = 9- (6.7.7) 
As a further consequence of Proposition 6.7.2, we note 
Corollary 6.7.4 Let X be an invariant vector field on M. For every integral curve 


y of X, the subset G - y is a submanifold of M. It is embedded iff y =(y) is an 
embedded submanifold of the corresponding stratum of M. 
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Proof Let o € S be such that y C M,. Since M, is an embedded submanifold 
of M, it is enough to prove the assertion for M,. By Proposition 6.7.2, the pro- 
jected integral curve 7, 0 y is the integral curve of a vector field on Mz and thus 
its image y is a submanifold of Mg. By Remark 6.6.2/3, we can cover p by a 
system of local trivializations {(U;, x;)} of the fibre bundle 2, : Mz > Ms, map- 
ping m5! (Uj) to U; x G/H, where H C G is a subgroup representing o. Then, 
xiC(G-y)N m5 '(U;)) =(y NU;) x G/H is a submanifold of U; x G/H. Hence, 
(G-y)N a, (U;) is a submanifold of the open subset a, 1(U;) of M,. Since G- y 
is covered by the subsets 75 ! (U;), this shows that it is a submanifold of M@,. By 
Remark 1.6.13/3, 7 is embedded iff so is G- y. 


Remark 6.7.5 Let us analyse how the projected flow @ of an invariant vector field 
X can be characterized in terms of the algebra C° (M) of smooth functions on M. 
Since, for every f € C°(M)°, 


W(X (f)) = ((Wr'), X)(WES) = XP), 


X induces a derivation of the subalgebra C° (M )& of C°(M). By the algebra iso- 
morphism z* : C°(M) > C~(M )©, this derivation is mapped to a derivation X 
of C°(M). By definition, 


n*(X(f)) = X(x* f) (6.7.8) 


for all ‘i € C™(M). Now, the projected flow satisfies 
d,s, » ine en 
qo fe = (X(f) 0 ®p)(t) (6.7.9) 


for all (t, p) € and f € C%(M). Since according to Proposition 6.3.7/1, C°(M) 
separates the points of M, the projected flow is uniquely determined by this equation 
and thus by X. 


In the remainder of this section, we show that every tubular neighbourhood of 
an orbit allows for a decomposition of invariant vector fields adapted to the bundle 
structure given by the slices. Let O be an orbit of W, let x :U — E be a tubu- 
lar neighbourhood of O and let S denote the regular distribution on U spanned 
by the tangent spaces of the slices U,,, m € O. Note that S is invariant, that is, 
WS = Sw,(m) for all a € G and m € U. We are going to construct a complemen- 
tary invariant distribution on U. Recall from Remark 2.2.10 that the tangent map- 
pings W/, : TG + TM combine to a smooth mapping TG x M > TM. Restriction 
of the latter to g x U yields a smooth mapping 


A:Uxg—>TU, = dA(m,A)=W' (A). 


Equip U x g with the action of G on U x g by (a, (m, A)) B® (Y(m), Ad(a)A) 
(direct product of G-manifolds). It is easy to see that 4 is equivariant with respect to 
this action. Choose mg € O. Since Gm, is compact, according to Proposition 5.5.6, 
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g admits an Ad(G,,,, )-invariant scalar product. Let m be the corresponding orthog- 
onal complement of g,, in g. By construction, m is Ad(G,,,)-invariant. View U x g 
as a trivial vector bundle and consider the invariant subset 


M := {(Wa(m), Ad(a)A) EU x g:mE Um, aE G, Aen}. 


Lemma 6.7.6 (20, Ajo) is an invariant vertical subbundle of TU complementary 
to Sin TU. 


Proof We start with proving that 9M is a vertical subbundle of U x g. For m € U, the 
subset tn = STN ({m} x g) is given by the union of all subspaces Ad(a)m with 
a eéG such that m € Y(U,,,). By Remark 6.4.2/3, any two such group elements 
differ by right translation by an element of G»,.. Since m is Ad(Gj, )-invariant, 
all these subspaces coincide. Hence, Jt is a linear subspace of {m} x g, given 
by It, = Ad(a)m whenever m € W(Um,). By Proposition 2.7.5, it suffices to find 
local r-frames, where r = dimm, in U x g spanning Jt. By invariance, it suffices to 
find such a local r-frame in a neighbourhood of the slice U,,,. Consider the mapping 


m x Um, > U, (A,m) > Wexpa(m). 


Since its tangent mapping at (0, mo) is bijective, it restricts to a diffeomorphism 
from an open neighbourhood of (0, mo) in m x Um, onto an open neighbourhood 
U of Um, in U. The inverse of this diffeomorphism induces smooth mappings py : 
U — mand P2: —_ Um, such that 


Wexpopi(m)(p2m)) =m, meu. 


Now, every basis {e;,..., e-} in m defines a local r-frame {5,,...,5,} in U x g over 
U by 
si(m) = Upon 


and this r-frame spans 93 over U. Thus, 0 is a vertical subbundle of U x g. 

Now, consider the mapping A;9x : 9t > TU. For every a € G, Ad(a)m is a vec- 
tor space complement of Ad(a)g,. and the latter is the Lie algebra of the subgroup 
Gw,(mo)» Which is the invariance group of the slice Uy, (mo). First, this implies that 
Aton is fibrewise injective and hence, by Proposition 2.7.4, (90, Atm) is a vertical 
subbundle. Secondly, since slices are transversal to orbits, this implies 


(Mm) + Sin = TnM 


for all m € U. Finally, since A(Nng) A Smy = 0, by counting dimensions we obtain 
MMmn) A Sm = 0 for allm € U. 


Lemma 6.7.6 yields that the image A (0) is a regular invariant distribution on U, 
contained in D8 and satisfying 


AN) @ S = TU. (6.7.10) 


This implies the following natural decomposition of invariant vector fields. 
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Proposition 6.7.7 For every invariant vector field X on M there exists an invariant 
vector field Z € I(S) and a smooth equivariant mapping A:U — g such that, for 
allmeéeU, 


Xm = yi (A(m)) + Zin. 


Proof According to (6.7.10), X decomposes as X = Y + Z with uniquely defined 
Y € FAQ) and Z € '(S). Since A(t) and S are invariant, Y and Z are equiv- 
ariant. Since vertical subbbundles are embedded, Y induces a section A of I by 
Y =A0A. Since A is equivariant, A is invariant. Writing A(m) = = (m, A(m)) witha 
smooth equivariant mapping A: U — g we obtain the assertion. 


The pair (A, Z) is called a slice decomposition of X on U and Z is called the 
slice component of X. Let % denote the flow of Z. We show that 7 projects to 
the restriction of ® to 7(U). Let m € U. We seek a curve t > a(t, m) such that 


y(t) = Vor, m) © oF (m) 


is an integral curve of X through m. As in the proof of Proposition 6.7.2, one can 
show that this curve must satisfy 


G(t,m) = Ly my A(P/ (m)) (6.7.11) 


(Exercise 6.7.2). Hence, it is given by the path-ordered exponential 


t 
atm) =Texp| [ a(@Zom)as} 
0 
and we have 
PD, (m) = Wayy,m) 0 PZ (m). (6.7.12) 


This shows that @7 projects to &, indeed. The mapping (t, mm)» a(t, m) is called 
the phase mapping of the slice decomposition (A, Z). 


Exercises 


6.7.1 Prove Corollary 6.7.3. In particular, show that the mapping db: D> M de- 
fined by (6.7.2) fulfils the conditions of Definition 3.2.5, with C® replaced 
by continuous. 

6.7.2 Prove Formula (6.7.11). 


6.8 On Relatively Critical Integral Curves 


In the last section of this chapter we give a brief introduction to relative equilib- 
ria and relatively periodic integral curves, as well as the corresponding stability 
concepts. In parts, we will stay informal. Standard references for this material are 
[65, 81, 90] and, more historically, [174]. 
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Let (M, G, W) be a proper Lie group action, let X be an invariant vector field on 
M and let ® : Y— M be the flow of X. For an integral curve y of X, let G, denote 
the stabilizer of the points of y, let g, be its Lie algebra and let 7 = 7 o y denote 
the projection to M. Extending the terminology of Chap. 3, we say that an integral 
curve of @ is critical if it is an equilibrium, that is, it consists of a point, or if it is 
periodic. 


Definition 6.8.1 An integral curve y of X is called a relative equilibrium if its 
projection 7 to M is an equilibrium. It is called relatively periodic if y is periodic. 


If y is relatively periodic, the period of 7 is called the relative period of y. Rel- 
ative equilibria and relative periodic integral curves are subsumed under the notion 
of relatively critical integral curve. According to Corollary 6.7.4, if y is relatively 
critical, then G - y is an embedded submanifold of M. Moreover, it is clear that rel- 
atively critical integral curves are defined for all times. We start with characterizing 
relatively critical integral curves in terms of the group action. 


Proposition 6.8.2 Let X be an invariant vector field on the G-manifold (M,W). 


1. An integral curve y of X is a relative equilibrium iff there existm ey and A€g 
such that 


(Axim = Xm. (6.8.1) 


In this case, ®;(m) = Wexpir a) (m). The set of solutions A of (6.8.1) forms a coset 
with respect to gy in the normalizer Ng (gy). It does not depend on the choice of 
mey. 

2. An integral curve y of X is relatively periodic iff there exist T > 0,m € y and 
aeéG such that ®;(m) ¢ G-m for all0 <t <T and 


Pr (m) = W(m). (6.8.2) 


In this case, T is the relative period of y. The set of solutions a of (6.8.2) forms a 
G,-coset in the normalizer NG(G,,). It does not depend on the choice ofm € y. 


The solutions of (6.8.1) and (6.8.2) are called drift velocities and relative phases, 
respectively. 


Proof 1. First, let y be an arbitrary integral curve of X. If y is a relative equilibrium, 
then y = 2(m) and hence y C G-m for all m € y. It follows that X,, € Tn (G-m), 
so that Xm = (Ax)m for some A € g. If, conversely, Xm = (Ax)m With A € g, invari- 
ance implies that X is tangent to the orbit G - m and hence induces a vector field on 
G-m whose integral curves coincide with those of X. We conclude that y C G-m 
and thus 7 = 2(m), that is, y is a relative equilibrium. 

Now, let y be a relative equilibrium. If A is a solution of (6.8.1), then for all 
mey andt € R we have 


d 
ay (exper) (m)) = Pexp(ta) (Ax) _ Wexptayrm = X Woxpitay(m) 
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and hence Wexpi a) (m) = ®; (m). Moreover, (6.7.1) yields 
(Ax) 6,~m) = (®,)\, (Ay) m = (D,)), Xm = X¢,(m)> 


that is, A is also a solution for ®;(m). We conclude that the set of solutions is 
independent of the choice of m € y. Since the left hand side of (6.8.1) is linear 
in the variable A, the solutions form a coset in g with respect to the linear sub- 
space of solutions of the homogeneous equation (B,.),, = 0. By Proposition 6.2.2/3, 
this subspace is given by g,,. It remains to show that the solutions A of (6.8.1) are 
contained in Ng(g,), that is, that they satisfy [A, B] € g, for all B € g,. Using 
Proposition 6.2.2/2, for f € C°(M) we find 


(IA, Ble) n(P) ~ [Bs Ax|m(S) =—-Xm (B.(f)) 
ae Ay oe or 
aa exp(sB) ° :(m)) = } 


because Wexp(sp) 0 ®;(m) = &;(m). Thus, ([A, B])m = 0 and hence [A, B] € gy. 
2. The first assertion is obvious. Thus, let y be relatively periodic and let m € y. 
If a is a solution of (6.8.2) for m, it is also a solution for ®;(m), because 


Pr (,(m)) = B, 0 Pr (m) = D, o Wa(m) = Ya(P;(m)). 


Hence, the set of solutions of (6.8.2) does not depend on m € y. It is contained in 
NG(Gy,), because 


Gy = Gorm) = Gum) = Gua — aG,a"'. 


Finally, the set of solutions is obviously a (left and right) coset with respect to G,. 


For an analysis of the global properties of relatively critical integral curves like 
periodicity, quasi-periodicity or the escaping behaviour, the reader may consult Sec- 
tions 7 and 8 in [81]. 

The concept of linearization extends in the following way from critical to rela- 
tively critical integral curves. Let y be relatively critical and let m € y. 


(a) If y is arelative equilibrium and A is a drift velocity, then m is an equilibrium of 
the (not necessarily invariant) vector field X — A, and we can form the Hessian 
endomorphism 


Hess,, (X — Ay). 


It will be referred to as the Hessian endomorphism of X at m associated with A. 
(b) If y is relatively periodic with relative period T and if a is a relative phase, then 


W -10P7(m) =m, (W,-10Pr),Xm=Xm (6.8.3) 
(Exercise 6.8.1). Hence, (W,-1 o ®r)/,, descends to an automorphism P“ of 
Nm/y, called the period automorphism associated with a. 


Thus, for relatively critical integral curves, instead of a single characteristic linear 
mapping, one has a whole family of characteristic linear mappings, labelled by the 
drift velocities or the relative phases. The algebraic structure of the latter, described 
in Proposition 6.8.2, carries over to these families. 
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Corollary 6.8.3 Let y be a relatively critical integral curve of X and letmey. 


1. [fy is a relative equilibrium, the Hessian endomorphisms Hess,,(X — Ax) form 
a coset in End(Tj,M) with respect to the image of the isotropy representation. 

2. If y is relatively periodic, the period automorphisms P% form a coset” in 
Aut(Nmy) with respect to the image of the isotropy representation induced on 
Nave 


Proof 1.1f Aj, Az are drift velocities, then Az — A, € gy and 
Hess, (X — Aj.) — Hess,,(X — A2,.) = Hess,, ((A2 _ A1)«) (6.8.4) 


(Exercise 6.8.2). Hence, the Hessian endomorphisms form a coset with respect to 
the image of g, under the mapping B +> Hess,,(B,.). By Remark 6.2.3, this is the 
isotropy representation. 

2. If a), az are relative phases, then a; ap € Gy and the automorphism Palo 
(Px?)—! of Nmy is induced by the automorphism (Wy ay) of T,, M on passing to 


Nin Y ‘4 


/ 
m 


It remains to extract those properties of the spectra of the characteristic linear 
mappings which do not depend on the particular choice of the latter. For point 2 of 
the following proposition, we have to assume that G is a subgroup of GL(n, R) or 
GL(n, C) defined by algebraic relations. 


Proposition 6.8.4 Let X be an invariant vector field on the G-manifold (M,V). 


1. For a relative equilibrium y, the real parts and the multiplicities of the elements 
of spec(Hess,,(X — Ax)) are independent of the point m € y and of the drift 
velocity A. 

2. For a relatively periodic integral curve y of X, the absolute values and the mul- 
tiplicities of the elements of spec(P“) are independent of the point m € y and 
the relative phase a. 


Proof 1. Since the set of drift velocities is independent of m € y and since 
-1 
Hessq, (m) (Xx —A,)= (D,)i, o Hess(X — Ay) 0 ((®;)}) 


for all t € R, the spectrum of Hess,,(X — A,) is independent of m. To prove the 
assertion we need the following standard relation, valid for compact subgroups (Ex- 
ercise 6.8.3) 


Ng (Gy) = 9y + Cg(Gy)- (6.8.5) 


Since, by Proposition 6.8.2/1, the drift velocities form an affine subspace of Ng (g,) 
with underlying vector subspace g,,, (6.8.5) implies that there exists a drift velocity 


29Since the relative phases lie in NG(G,,), this is both a left and a right coset. 
30Which exists due to (Wa)),, Xm = Xw,(m) = Xm for alla € Gy. 


6.8 On Relatively Critical Integral Curves 311 


B commuting with all elements of g,,. For given drift velocity A, denote C := B—A 
and write Hy = Hess ,(X — Ax) and Hg = Hess», (X — B,.). By (6.8.4), we have 


Ha = Hz + Hess (C,). (6.8.6) 


Since C € g, and hence [B,C] = 0, Proposition 6.2.2/2 implies [C,, By] = 
[B, C]. = 0. Moreover, by the invariance of X and by Proposition 3.2.15, [C,, X] = 
0. Hence, this proposition implies that the flows of C, and X — B, commute. Then, 
Hess,(Cx.) commutes with Hg and hence, by (6.8.6), it also commutes with H4. 
For the argument to follow, view H,4, Hg and Hess,,(C,.) as endomorphisms of the 
complexification of T,, M. Then, since G,, is compact, Proposition 5.5.7 yields that 
Hess, (C,.) is diagonalizable and possesses purely imaginary eigenvalues. Since Hg 
and H4 commute with Hess,,(C..), they map every eigenspace of Hess, (Cx) to it- 
self. Thus, according to (6.8.6), on a given eigenspace, Hz and Hy differ by the 
identical mapping, multiplied by the corresponding eigenvalue of Hess, (C,.). This 
yields the assertion. 

2. It suffices to prove the assertion for the automorphism (W,-1 0 ®r)’ of TM 
which induces P“ on passing to N,v. By an abuse of notation, we denote this auto- 
morphism by the same symbol, P*. Since the set of relative phases is independent 
of m € y and since 

PS on) = (Pt) © PRO ((idin) 
the spectrum of P“ does not depend on m. To prove the assertion, we need that there 
exists an integer n > O fulfilling a” = ch with c € Cg(G,) and h € G,. Under the 
assumption made on G, this follows from Proposition 1.2 in [316]. Then, for every 
relative phase b, we have b = akg for some ko € Gy and hence 


b” = (ako)" = ch(a~"Vkga”“') --- (a7 kya) ky = ck 


with k € G,. Consider the automorphism t := (W,-1 0 Pury}, Of TmM. Since it 
commutes with (W,-1)/, and (WY%-1)/,,, it maps the eigenspaces of these automor- 
phisms to themselves. According to Proposition 5.5.7, (W,-1)), and (W%-1)), are 
diagonalisable and their eigenvalues lie on the unit circle. Since 


(Pe) =U, a), 09, (PZ) = 1), or, 


the eigenvalues of (P°)” and of ( pey differ from those of t, and hence from one 
another, by phase factors only. Since this carries over to the n-th roots, the assertion 
follows. 


As a result of Proposition 6.8.4, the notions of characteristic exponent and 
characteristic multiplier extend in the following way to relatively critical integral 
curves. 


(a) Let y be a relative equilibrium of X. A real number 4 is called a reduced 
characteristic exponent of y if there exists a point m € y, a drift velocity 
A and an element A of spec(Hess,,(X — Ax)) such that A = Re(A). In this 
case, the multiplicity of 4 is defined as the sum of the multiplicities of all 
i € spec(Hess»(X — A,)) with A = Re(A). 
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(b) Let y be a relatively periodic integral curve of X. A positive number A is called 
a reduced characteristic multiplier of y if there exists a point m € y, a relative 
phase a and an element A of spec(P“) such that 4 = |A|. In this case, the mul- 
tiplicity of A is defined to be the sum of the multiplicities of all 4 € spec(P") 
with A = |A|. 


Remark 6.8.5 


1. The existence and uniqueness results for Poincaré mappings along periodic inte- 
gral curves have an analogue for relatively periodic integral curves, see e.g. [65], 
providing equivariant Poincaré mappings. Using a tubular neighbourhood, the 
corresponding slice decomposition of X and a relative phase, from an equivari- 
ant Poincaré mapping one can construct a relative Poincaré mapping. The latter 
can be interpreted as a Poincaré mapping along the periodic integral curve y with 
respect to the projected flow &. It can be used to analyse the flow & near y. 

2. The reduced characteristic exponents and the reduced characteristic multipli- 
ers may be defined as the elements of the so-called reduced spectrum, see 
[90]. The reduced spectrum of Hess,,(X — Ax) is defined as the quotient of 
spec(Hess,,(X — A,.)) with respect to the action of the additive group iR. The 

reduced spectrum of P“ is defined as the quotient of spec(P“) with respect to 
the action of U(1). In both cases, the multiplicity of an equivalence class is the 
sum of the multiplicities of its elements. 

3. The analysis of the characteristic linear mappings of a relatively critical integral 
curve y can be refined as follows. Choose m € y. Using a tubular neighbourhood 
x:U — E of the orbit G - m and a G,-invariant scalar product on T,, M, one 
can decompose 

G Gyy 
TM =Tn(G-m) ®TnUm” ® (Tx Um ”) : (6.8.7) 
Here, U,,, denotes the slice of the tubular neighbourhood x through m, ies Y de- 


notes the submanifold of G,-invariant points of Uj, and (Tin we vyt denotes the 


orthogonal complement of T,, ope ” in Tin Um. Under x, uct projects to an open 
neighbourhood of z(m) in its stratum. One can prove that, with respect to the 
decomposition (6.8.7), the characteristic linear mappings are given by upper tri- 
angular block matrices. Accordingly, their spectra decompose into a symmetry 
part (corresponding to the subspace T,,(G -m)), an isotypic part (corresponding 
to the subspace T,, Un ”) and an aliotypic (i.e. type-transversal) part (correspond- 
ing to the subspace (T,, i )+). This partition does not depend on the choice of 
E and x nor on the scalar product on T,,M. It turns out that the isotypic part 
coincides with the spectrum of the ordinary characteristic linear mapping associ- 
ated with the ordinary critical integral curve y inside the corresponding stratum. 
In particular, this part of the spectrum does not depend on the particular charac- 
teristic linear mapping chosen. The partition so constructed allows for an analysis 
of how the flow near y behaves in orbit direction (symmetry part), along the stra- 
tum (isotypic part) and transversal to the stratum (aliotypic part). On the other 
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hand, the latter two parts characterize the projected flow @ near y. Therefore, 
the corresponding characteristic exponents or multipliers may be interpreted as 
the characteristic exponents or multipliers of ~ with respect to the topological 
flow &. 

4. The stability concept adapted to relatively critical integral curves is the following. 
For a closed subgroup H of G, a ®-invariant subset S of M is called H-stable if 
for every H-invariant neighbourhood U of S in M there exists a neighbourhood 
V of S in M such that ®;(m) is defined and contained in U for all m € V and 
t > 0. It is called asymptotically H-stable if there exists a neighbourhood V of S$ 
in M with V x R4 C Y such that for every m € V and every H-invariant neigh- 
bourhood U of S in M there exists fo € R fulfilling ®,(m) € U for all t > to. 
The special case H = G recovers the usual stability concepts of Definition 3.8.1 
for the projected flow?! &: S is G-stable (asymptotically G-stable) iff (S$) is 
stable (asymptotically stable) under the projected flow & (Exercise 6.8.4). As a 
consequence of this and of the observation that the proof of Proposition 3.8.5 
carries over word by word to the topological flow @ and the periodic integral 
curve /, one finds that a relatively periodic integral curve is G-stable iff one of 
the following equivalent conditions holds: 

(a) itis G-stable under the G-equivariant local diffeomorphism ®; of M, 

(b) it is G-stable under an equivariant Poincaré mapping, 

(c) itis stable under a relative Poincaré mapping. 

A similar result holds for asymptotic G-stability. Finally, the concept of Lya- 

punov function can be adapted to H-stability as follows. An H-Lyapunov func- 

tion is a continuous H-invariant function f : U > R on an H-invariant neigh- 

bourhood U of y in M with the following properties: 

(a) f(m) =0 for allm ey and f(m) > OforallmeU\(A-y), 

(b) f(®:(m)) < f(m) for allm € U \ (H- y) and t > 0 satisfying ®;(m) € U 
for all s € [0, f]. 

If the second condition holds with f(®;(m)) < f(m), the function f is called 

an H-Lyapunov function in the strong sense. If f is differentiable, the second 

condition is equivalent to X»(f) <0 or Xm(f) < 0, respectively. Under the 

assumption that the Lie algebra of H contains all the drift velocities of y (in 

case y is a relative equilibrium) or that H contains all relative phases (in case y 

is relatively periodic), one can show the following. If there exists an H-Lyapunov 

function for y, then y is H-stable. If there exists an H-Lyapunov function in the 

strong sense, then y is asymptotically H-stable, see Exercise 6.8.5. 


Exercises 

6.8.1 Prove the relations in (6.8.3). 
6.8.2 Prove Formula (6.8.4). 

6.8.3, Prove Formula (6.8.5). 


3! These concepts are of topological nature and thus carry over word by word to topological flows. 
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Hint. Use an Ad(G,,)-invariant scalar product on g to decompose Ng(g,) = 
Gy ® (95; ONg(g;)) and show that gy ONg(gy) C Cg Gy). 

Prove that a subset S of M is G-stable (asymptotically G-stable) under the 
flow ® of an invariant vector field iff 7(S) is stable (asymptotically stable) 
under the projected flow on the orbit space M. 

Hint. Use that the domain J of is given by the orbit space of the action of G 
on Y and that, via z, G-invariant neighbourhoods of S C M in M bijectively 
correspond to neighbourhoods of (S$) in M. 

Prove the criterion for H-stability in terms of an H-Lyapunov function stated 
in Remark 6.8.5/4. Here is an outline: the assumptions imply that y is rela- 
tively critical with respect to the action of H. Hence, it is enough to prove the 
statement for the case H = G. Since the G-stability of y is equivalent to the 
stability of 7 under &, it suffices to observe that Definition 3.8.15 and Theo- 
rem 3.8.16 carry over literally from differentiable to continuous flows and to 
check that f is a Lyapunov function for y in the topological sense. 


Chapter 7 
Linear Symplectic Algebra 


In this chapter, we present linear symplectic algebra, starting with a discussion of 
the elementary properties of subspaces of a symplectic vector space and of the sym- 
plectic group. We also present linear symplectic reduction. In the second part of 
this chapter, we come to some more advanced topics, all related to the study of the 
space of Lagrangian subspaces of a given symplectic vector space. In particular, the 
Maslov index and the Kashiwara index will be presented in some detail. These topo- 
logical invariants will play an essential role in Chap. 12, in the context of geometric 
asymptotics. 


7.1 Symplectic Vector Spaces 


Let V be a finite-dimensional vector space over R and let w € . V*. The bilinear 
form w induces a linear mapping 


weve, (w’(v), u) :=a(v,u), v,UuEeV. 


We define the kernel and the rank of w to be the kernel and the rank of w”, re- 
spectively.! The form w is called non-degenerate if w is an isomorphism. This is 
equivalent to rank wm = dim V or ker w = 0, that is, vanishing of w(u, v) forall v € V 
implies u = 0. If w is an isomorphism, we denote its inverse by w*. If there is no 
danger of confusion, we often write v’ =@"(v) and p= wt (p). 


Remark 7.1.1 Let {e;} be a basis in V and let {e*'} be the dual basis in V*. Then, 


1 . . 
w= saije" Aen, @ (e)) = wije*!, rank(w) =rank(w;j), (7.1.1) 


' This is consistent with the definition of the kernel of a multilinear form given in (4.2.12). 
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where @;; := w(e;,e;). Let V=V / ker w and denote the induced bilinear form 


V x V > R by @. Obviously, ® is antisymmetric and non-degenerate. Let 
{é1,..., @} be a basis in V and let @;; = @(é;, é;). Since 


det(@j;) = det(—@ ji) = (—1)’ det(@;;) = (—1)’ det(@j;), 
we have (—1)" = 1, that is, r = dim V =rankw is even. 
For a subspace W C V, the w-orthogonal subspace is defined by 
W® := {ve V:a@(v,u) =0 forallu € W}. GAZ) 


Proposition 7.1.2 For every antisymmetric bilinear form w on a finite-dimensional 
real vector space V, there exists an ordered basis {e;} in V such that 


n 
=) ene, (7.1.3) 
i=l 
that is, w;; has the form 
Jn O 
=O" 20 
with 
Jn= ES a ; (7.1.4) 


Proof We carry out the following iterative procedure, starting with Vo := V. Ifw= 
0 on Vo, we can choose any basis in Vo and thus we are done. Otherwise, there 
exist vj, U1 € Vo such that w(v), u,) = 1. By bilinearity, v; and wu; are nonzero. By 
antisymmetry, they cannot be parallel. Hence, they are linearly independent. Let E 
be the subspace spanned by v; and u;. We show 


Eyez; = 9, E,+ EP = Vo. 


For the first equation, we decompose v € FE, M EP? as v=av, + Bu, and calculate 
B = —o(v, v1) = 0 and a = w(v, u;) = 0. For the second equation, let v € Vo. 
Write 


v= (o(v, uj)vy — wv, v1)M1) + (v — (VU, 41)vj + @(v, v1)u1) 


and check that the second term is contained in E’’. 

Next, we replace Vo by V; := E/? and iterate the argument until we arrive at a 
subspace V,, (possibly trivial) on which w vanishes. This yields 27 linearly indepen- 
dent vectors e; := Uj, €n4i = Uj, i= 1,...,n, satisfying 


w(e, €j)) =@(Enti, ent j) =9, O(E, €ntj) = 45i;- 
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In case V, #0, we choose a basis {€2741,-..-,@dimv} in V,. Then, {e),..., edimv} 
is a basis in V with the desired properties. 


Definition 7.1.3 (Symplectic vector space) 


1. A symplectic vector space is a pair (V, @) consisting of a real vector space V 
and a non-degenerate bilinear form @, called the symplectic form. A basis in V 
for which w has the canonical form (7.1.3) is called symplectic or canonical. 

2. Let (Vj,@,) and (V2,@2) be symplectic vector spaces. A linear mapping 
f: Vi > V2 is called symplectic if f*w2 = @1. A bijective symplectic mapping 
is called a symplectomorphism. 


Note that the inverse of a symplectomorphism is symplectic. Hence, if one takes 
the symplectic mappings as morphisms of symplectic vector spaces, the symplecto- 
morphisms are the corresponding isomorphisms. 


Example 7.1.4 Let V = R”. The bilinear form wo(x, y) = x’ Jny obviously de- 
fines a symplectic vector space structure and the standard basis is symplectic. We 
call J, the standard symplectic matrix of R*” and the pair (R?”, wo) the canonical 
symplectic vector space structure in dimension 27. 


Proposition 7.1.2 yields 


Corollary 7.1.5 Let (V,@) be a symplectic vector space of dimension 2n. Every 
symplectic basis of V defines a symplectomorphism onto (R2" , a). 


Example 7.1.6 Let W be a vector space and let W* be its dual space. Then, V = 
W @ W* endowed with the bilinear form 


owews(v ® p,u@®o):= p(u) — o(v) (7.1.5) 


is a symplectic vector space (Exercise 7.1.1). This form is referred to as the canoni- 
cal symplectic form on W @ W*. 


The following proposition yields another criterion for the non-degeneracy of an- 
tisymmetric bilinear forms. 


Proposition 7.1.7 An antisymmetric bilinear form w on a real vector space V is 
non-degenerate iff dim V = 2n and w" :-=wA-:-Aw¥40. 


Proof If w is non-degenerate, by Proposition 7.1.2, there exists a basis {e,} in V 
such that w= )*"_, e* A e*"+") Then, 


U 


n(n—1) 
ow" =n! . (-1) 2 . el A et Axia A et 2n—-l) A et2h #0. 
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Conversely, assume that dim V = 2n and w” 4 0 and let w be degenerate. Then, 
there exists a vector 0 4 e; € V with w’ (e}) = 0. If we extend e; to a basis 
{e1,..., an}, then w”(e1,..., €2,) = 0, in contradiction to the first assumption. 


Definition 7.1.8 (Canonical volume form) Let (V,@) be a symplectic vector space. 
The 2n-form 
1 nin-l) yp 
Qo = rc Zw (7.1.6) 
n! 
is called induced canonical volume form on V. 


In a symplectic basis {e;} of V, one has Q, = e*! A--- A e*?", 


Exercises 
7.1.1 Let W be a vector space and let W* be its dual. Prove that the bilinear form 
wew*« defined by (7.1.5) is symplectic. 


7.2 Subspaces of a Symplectic Vector Space 


Let (V, w) be a symplectic vector space. To start with, we present a number of im- 
portant relations characterizing subspaces of a symplectic vector space, their anni- 
hilators and their symplectic orthogonal complements. For that purpose, recall that 
the annihilator W° C V* of a subspace W of V is defined by 


W® := {ne V*: (n,u) =0 for allu € WH. 
From linear algebra we recall (Exercise 7.2.1) 
dimW+dimwW°=dimv, (W°)°=w, = (V/wy*= Ww, (7.2.1) 


Proposition 7.2.1 For subspaces W,W, and W2 of a symplectic vector space 
(V, @), one has 


1. (wey =w?, 
2. dimW + dim W® = dim V, 
3. (W®)®=W, 


4. If W1 C Wo, then wy cCWw?, 
5. WP Ws = (Wi + W2)® and (W, N W2)® = Ww? + Ws. 


Proof 1. Obviously, (W)” C W® and (W°)? Cc W?. 

2. This follows from point | and from (7.2.1). 

3. The inclusion relation W Cc (W®)® is immediate. Since application of point 2 
to both W and W”® yields dim(W®)® = dim W, the assertion follows. 

4. This is obvious. 
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5. It suffices to prove the first equation, because the second one follows by replac- 
ing W; by W,” and by using point 3. Due to W; C W; + W2 and W,;? Cc W)? + W3’, 
point 4 implies 


(Wi + Wo)? CWE We, (Wi? + W2)° CWI Wo. 
Hence, to prove the assertion it suffices to show that 
dim(W; + W2)° + dim(W/? + W3’)® = dim(W, N W2) + dim(W?? N W3’). 


Using point 2 and dim(W; + W2) = dim W; + dim W2 — dim(W, M W2), we can 
write 


dim(W, + W)® = dim V — dim W, — dim W2 + dim(W; N W>), 
dim(W? + W3’)° = dim V — dim W/? — dim W3’ + dim(W?? N WS’). 


Addition of these two equations yields the desired equality. 


The following types of subspaces of a symplectic vector space are important. 


Definition 7.2.2 Let (V,@) be a symplectic vector space and let W C V be a sub- 
space. W is called 


. isotropic if W c W®, 

. coisotropic if W® Cc W, 

. Lagrange if W° = W, 

. symplectic if WM W® = {0}. 


BRWNrFe 


Let W be a subspace of a symplectic vector space (V,@) and let wy be the 
bilinear form on W induced by restriction of w. In terms of wy, the several types of 
subspaces can be characterized as follows. 


Proposition 7.2.3 A subspace W of a symplectic vector space (V, w) is 


. isotropic iff aw = 0, 

. coisotropic iff W® is isotropic, that is, iff wwe = 0, 
. Lagrangian iff ow = owe =0, 

. symplectic iff ww or wwe is a symplectic form. 


BRWN Re 


The key for the analysis of the algebraic properties of W is the study of the rank 
of ww which is called the symplectic rank of W. By Proposition 7.1.2, this is an 
even number. Since 


kerww =ker(ww)’ =Waw?, (7235 


we have 


rank ww = dim W — dimkerwyw = dim W — dim(W a) w”), (7.2.3) 
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and a similar equation for mw. Subtracting these two equations, we get 
rank@y — rankw@yo = dim W —dimW® = 2dim W — dim V. (7.2.4) 


This implies 
max(0, 2dim W — dim V) < rankwy < dimW. (7.2.5) 


From (7.2.5) and (7.2.4) we read off that W and W® reach their minimal or maximal 
rank always simultaneously. Moreover, Eq. (7.2.4) yields the following 


Proposition 7.2.4 Let (V,@) be a symplectic vector space of dimension 2n. 


1. The dimension of an isotropic (coisotropic) subspace is at most (at least) equal 
to n. The dimension of a Lagrangian subspace is equal to n. 

2. Every subspace of dimension | (of codimension 1) is isotropic (coisotropic). 

. A subspace of dimension m > n is coisotropic iff its symplectic rank is 2(m — n). 

4. Every subspace of an isotropic subspace is isotropic and every subspace which 
contains a coisotropic subspace is coisotropic. 


io’) 


Example 7.2.5 Let V = R2”", wo(x, y) =x! Jny and let {e;}, i = 1,...,2n, be the 
canonical basis. 


1. Subspaces which are spanned by vectors e; with i < n or by vectors e; with i > n 
are isotropic. 

2. Subspaces which contain all vectors e; with i <n or all vectors e; with i > n are 
coisotropic. 

3. The subspace spanned by e,..., €, and the subspace spanned by ey+1,..., €2n 
are Lagrangian. 

4. Subspaces which are spanned by pairs e;, en +; with i < n are symplectic. 


The following two propositions characterize Lagrangian subspaces. 


Proposition 7.2.6 Let W be a subspace of a symplectic vector space (V, @). The 
following statements are equivalent. 


1. W is a Lagrangian subspace. 
2. W is isotropic and dim W = 5 dim V. 
3. W is isotropic and possesses an isotropic complement in V . 


Proof | = 3: Of course, W is isotropic. An isotropic complement can be con- 
structed by the following procedure. Choose a nonzero v; € V \ W and let Vj := 
Rv,. We check 


(a) Vj is isotropic: this follows from Proposition 7.2.4/2. 
(b) W+ V/? = V: by Proposition 7.2.1/5, we have W + V)? = (W®1 V1). Hence, 
the assertion follows from W® = W and WN V; = 0. 
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If now W + V; = V, we are done. Otherwise, by point (b), there exists a nonzero 
v2 € Vi? \ (W + Vj). Let V2 := Vj + Ruz. We check that (a) and (b) hold for V2. 
For (a), this follows from the isotropy of V; and from the fact that v2 € VP . The ar- 
gument for (b) is the same as before. Now replace V; by V2 and iterate the argument 
to finally arrive at an isotropic subspace V,, satisfying W + V,, = V. By construction, 
V, 18 an isotropic complement of W in V. 

3 = 2: This follows from Proposition 7.2.4/1. 

2=> 1: Since W Cc W® and dim W® = dim V — dim W = dim W, we conclude 
Ww=w”. 


Points 2 and 3 of Proposition 7.2.6 imply that every Lagrangian subspace has a 
Lagrangian complement. 


Remark 7.2.7 The above result can be generalized as follows (Exercise 7.2.3). For 
every tuple (L1,..., Z,) of Lagrangian subspaces there exists a Lagrangian sub- 
space Lo which is transversal to all elements of the tuple, that is, 


faites (Shes (7.2.6) 


Example 7.2.8 For the canonical symplectic vector space W @ W%, the subspaces 
W @ {0} and {0} © W* are obviously complementary Lagrangian subspaces. 


Proposition 7.2.9 Let (V,q@) be a symplectic vector space and let V = W ® W' 
be a decomposition into complementary Lagrangian subspaces. Then, this decom- 
position induces a symplectomorphism onto the canonical symplectic vector space 


Wow. 
Proof Let x: W’ — W* be the natural mapping induced by w, 
i (u), v) = ol(u, v). (7.2.7) 


Note that this is an isomorphism of vector spaces. Thus, lw ® x: W@W’ > 
W @ W* is an isomorphism, too. Finally, for u, v € W and u’, v’ € W’, we have 


o(u tu',u+ v’) = alu’, v) -o(v’, u) = (x (u’), v) = (x (v'), u), 


that is, lw @ x is symplectic. 


Exercises 
7.2.1 Prove the relations stated in (7.2.1). 
7.2.2 Let (V, w) be a symplectic vector space of dimension 2n. Prove the following 
statements. 
(a) Every basis of an isotropic subspace can be extended to a symplectic basis 
of V. 
(b) Every isotropic subspace is contained in a Lagrangian subspace. Corre- 
spondingly, every coisotropic subspace contains a Lagrangian subspace. 
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(c) If (W,, W2) and (U;, U2) are pairs of complementary Lagrangian sub- 
spaces, there exists a symplectomorphism which transforms W; into Uj. 
7.2.3 Prove the statement of Remark 7.2.7. 


7.3 Linear Symplectic Reduction 


Let W bea subspace of the symplectic vector space (V, w). Since kerww = WNW®, 
the quotient 


W := W/(WOW?) (7.3.1) 


carries a natural symplectic structure @ defined by 
&([u], [v]) = ou, v) 
for all u, ve W. Let p: W > W be the canonical projection. Then, 
ow =p (7.3.2) 
and wy and @ have the same rank. If W is coisotropic, then 
W=Ww/w?. (7.3.3) 


Proposition 7.3.1 Let (V,qw) be a symplectic vector space. Let W C V_ be 
coisotropic and let L C V be Lagrange. Then, the image of WL under the canon- 
ical projection p: W > W isa Lagrangian subspace of (W, @). 


Proof Since L is isotropic, so is WM L. Hence, (7.3.2) implies that p(W M L) is 
isotropic, too. Let us calculate the dimension. Since W® C W, we have p(WNL) = 
(WO L)/(W® 1 L) as vector spaces. Using points 2 and 5 of Proposition 7.2.1, we 
get 


dim(W NL) = dim V — dim(W 0 L)® = dim V — dim W® — dim L + dim(W° 0.) 


and a similar equation for W® N L. Subtracting these equations and using (7.3.3), 
we obtain 


dim(W NL) — dim(W® NL) = —(dim W — dim W®) = ; dim W. 


Nle 


Thus, Proposition 7.2.6 yields the assertion. 


Lemma 7.3.2 Let W be a subspace of the symplectic vector space (V, w). If E C W 
is a subspace complementary to W 1 W®, that is, W=E @(WOW?”), then E is 
maximally symplectic in W. 
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Proof Since py¢g is injective and since ® is non-degenerate, (7.3.2) implies that E 
is symplectic. Suppose there exists a symplectic subspace E’ such that E Cc E’ CW 
and dim E’ > dim E. Then, for dimensional reasons, E’ 1 (WM W”) SF {0}. Choose 
anonzero v € E/N (WO W?”). Then, v € W® C E’® and therefore w(v, v’) = 0 for 
all v’ € E’. This contradicts the assumption on E’ to be symplectic. 


As an important conclusion we obtain that every subspace of a symplectic vec- 
tor space (V, w) induces a decomposition of V into a direct sum of w-orthogonal 
symplectic subspaces. 


Theorem 7.3.3 (Witt-Artin decomposition) Let (V, w) be a symplectic vector space 
and let W be a subspace of V. Let E and F be subspaces such that 


W=E®@(WNW®) and W°=FO@(WOW?%). 


Then, E, F and (E ® F)® are symplectic and V decomposes into the w-orthogonal 
direct sum 


V=E@FO(ESGF). (7.3.4) 
Moreover, WM W® is a Lagrangian subspace of (E ® F)®. 


We will see that the above decomposition plays an important role in symplectic 
reduction of systems with symmetries, cf. Chap. 10. 


Proof The subspaces E and F are symplectic due to Lemma 7.3.2. Since EN F C 
WOW® and EN (WN W®) = {0}, we obtain EN F = {0}. Since F C W®, we 
have W c F® and thus E C F®. Consequently, F C E®. Therefore, E and F are 
w-orthogonal. 

Let us denote Z = (E © F)®. Since E @ F is symplectic, Z is symplectic, too. 
By Proposition 7.2.1/5, we have Z = F° 1 E® > WN W®. Moreover, by Proposi- 
tion 7.2.3/1, WM W® is isotropic. Finally, due to 


dim V = dim W + dim W® = dim E + dim F + 2dim(WN W®), 


we obtain dim(W N W®) = 5 dim Z. Therefore, WM W® is a Lagrangian subspace 
in Z. 


Example 7.3.4 Consider R® with the standard symplectic structure. Denote the stan- 
dard basis elements by e1, €2, €3, f) = e4, fo = es, f3 = e6 and choose the subspace 


W =Re, @ Reo 6 Rf. 
Then, 
W°®=Re @Re3 ORR, WOW? =Re. 


We choose E = Re; @ Rf; and F = Re3 @ Rf3. Then, (E @ F)® = Reo @ Rf2 and 
(7.3.4) holds, indeed. 


324 7 Linear Symplectic Algebra 


Remark 7.3.5 


1. The Witt-Artin decomposition (7.3.4) induces the following decomposition of 
the symplectic form: 


o=0" +o" oO”, (7.3.5) 


2. Applying Proposition 7.2.9 to the Lagrangian subspace WM W® of (E @ F)®, 
we obtain 


(E@F)°=WnWw’ea(wnw?)’. (7.3.6) 
3. In particular, let W be a Lagrangian subspace of the symplectic vector space 


(V,q@). Then, we have E = {0} = F and therefore (E @ F)® = V, thus, in this 
case the Witt-Artin decomposition induced by W is trivial. 


7.4 The Symplectic Group 


In this section, we discuss symplectic mappings. The following proposition collects 
their elementary properties. 


Proposition 7.4.1 Let (V,w) and (W,,p) be symplectic vector spaces and let 
f: V— W be asymplectic mapping. 


1. The mapping f is injective. If dim V = dim W, then f is a symplectomorphism. 
2. The image of f is a symplectic subspace of W. 
3. Every symplectomorphism preserves the canonical symplectic volume form. 


4. If (V, @) = (W, p), then det f = 1. 
Proof 1. Let v € V such that f(v) = 0. Then, 
w(v,u) = f* pv, u) = p(f(v), f(u)) =0 


for all u € V. Since w is non-degenerate, we conclude v = 0. 
2. Let we imf NM (mf)? and let v € V such that w = f(v). Then, 
p(f(v), f@)) =a, u) = 0 for all u € V. It follows that v = 0 and hence w = 0. 
3. By f*p =a, we have 


1 n(n—1) 
f°2p= (I) fp r+ A fp = Qo. 
4. By definition of the determinant, f*2,, = (det f)@,. Hence, point 3 implies 
det f = 1. 


Remark 7.4.2 


1. As a generalization of point 4 of Proposition 7.4.1, let V and W be vector spaces 
of the same dimension, dim V = dim W = k, with volume forms 92y and yw. 
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Let f € L(V, W) be a linear mapping. Then, there exists a unique constant 
det(f) such that 


f* Qw =det(f)- Qy, 


because he V is one-dimensional. Thus, f is volume preserving iff det(f) = 1. 

2. Since on a 2-dimensional symplectic vector space the canonical volume form 
coincides with the symplectic form, every volume preserving mapping is auto- 
matically symplectic. However, in higher dimensions this is not the case as the 
following examples shows. Let V = R*, let {e,, €2, €3, e4} denote the canonical 
basis and consider the canonical symplectic form w = e*! A e*? + e*? A e*4. The 
linear mapping f: V — V defined by 


(€1, €2, €3, €4) H> (—e], —e2, e3, €4) 


43 


preserves the volume 2, = e*! A e*? A e*? A e**, but since f*w = —a, it is not 


symplectic. 


The automorphisms of a symplectic vector space (V, w) form a closed subgroup 
of GL(V), called the symplectic group of (V, w) and denoted by Sp(V, w). Accord- 
ing to Theorem 5.6.8, Sp(V, w) is a Lie group and a Lie subgroup of GL(V). Let 
sp(V,w) denote the Lie algebra of Sp(V, w). Recall that in a symplectic basis {e;} 
of V we have w = J, cf. Example 7.1.4. Let f € Sp(V, w) and let a be the matrix 
representing f in this basis, f (e;) = a/ ié;. The condition f*@ = o takes the form 


a’ Ina = Jn. (7.4.1) 


Thus, every symplectic basis induces an isomorphism of the Lie groups Sp(V, w) 
and Sp(n, R), cf. Example 1.2.6. 

For the analysis of the stability of equilibria or periodic integral curves of Hamil- 
tonian systems, the properties of the spectrum of symplectomorphisms are relevant. 
The following proposition puts strong limits on how eigenvalues of symplectomor- 
phisms are located in the complex plane. 


Proposition 7.4.3 (Symplectic Eigenvalue Theorem) Let (V,q@) be a symplectic 
vector space. 


1. If dX is an eigenvalue of a € Sp(V, w) with multiplicity k, then both the complex 
conjugate i and A~' are eigenvalues of a with multiplicity k. 

2. If d is an eigenvalue of A € sp(V, @) with multiplicity k, then both i and —i are 
eigenvalues of A with multiplicity k. 


Proof It suffices to prove the assertion for V = R*” with the canonical symplectic 

structure. The assertions about A are due to the fact that a and A are real matrices. 
1. First, note that A 4 0, because a is invertible. Let x,(z) := det(a — z1) be the 

characteristic polynomial of a. We rewrite (7.4.1) in the form (a')"!=J,a 1 ie to 
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obtain 
Xa(z) = det(J, (a — zi) J!) =det((aT)' — zt) =det(a7! — zt). 


1 _ 2g = (—za~!)(a — z~!11) and using det(a) = 1 we arrive at 


Writing a— 
UGH e kale): 


Thus, if 4 is a zero of x, of multiplicity k, so is 7 ae 
2. Similarly, due to A = —1, from (7.4.1) we conclude Al =J,AIn. Using this 
and det J, = 1, we obtain 


XA(—z) = det(A + zl) = det(J,(A + 21) Jn) = det(AT — z1) = xa(z). 


Thus, if 4 is a zero of x4 of multiplicity k, so is —2. 


Corollary 7.4.4 Let (V, w) be a symplectic vector space. 


1. If +1 is an eigenvalue of a € Sp(V, w), it has even multiplicity. 
2. If 0 is an eigenvalue of A € sp(V, w), it has even multiplicity. 


Proof Let a € Sp(V, w). Unless 4 = +1, the sum of the multiplicities of the eigen- 
values A, A, 27! and a is even. It follows that the sum of the multiplicities of the 
eigenvalues +1 and —1 (if present) is even, too. A similar argument for A yields 
that, there, the eigenvalue 0 can occur with even multiplicity only. Moreover, un- 
less A = —1, the family of eigenvalues 1, 2, AW! and a contributes a unit factor 
to deta. Since deta = 1, the eigenvalue —1 can occur with even multiplicity only. 
Hence, this holds for the eigenvalue +1, too. 


It follows that the eigenvalues of a symplectomorphism must be located symmet- 
rically with respect to reflections about the unit circle and the real axis. For a given 
A, the following cases can occur. 


1. If |A| Al andIma 0, then A, 2,47! and 7! are all distinct (4 different eigen- 
values). 

2. If |A| 41, Ima =0, then A =2 and A~! = 17! (pairs of eigenvalues on the real 
axis). 

3. If |A| = 1, Ima 0, then A =A! and A = A7! (pairs of eigenvalues on the unit 
circle). 

4. If |A]J = 1, Ima =0, then’ =A =A~! =~! =4+1 (One eigenvalue, +1 or —1). 


Figure 7.1 illustrates these statements for the case V = R* endowed with the 
canonical symplectic structure. 

It turns out that every linear symplectomorphism can be brought to a normal 
form, see Sect. 3 in [225]. The following special case will be useful in the sequel. 
The proof is left to the reader (Exercise 7.4.2). 
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Bog 


complex saddle generic center 
(2) 
(2) (2) (4) 
(2) 
degenerate saddle identity degenerate center 
real saddle saddle center 


Fig. 7.1 Possible eigenvalue configurations for symplectomorphisms of R* 


Proposition 7.4.5 Let f € Sp(V, @), dim V = 2n. If the eigenvalues i, of f are all 
distinct and lie on the unit circle, 0%. = e'*, there exists a symplectic basis in V such 
that f is given by the matrix eA where A= diag(a1,...,Q@y,Q@1,...,Q@y). 


Finally, we prove that for a compact subgroup H C Sp(V, @), the subspace V7 
of H-invariant elements is symplectic. Later on, this will be applied to the stabilizers 
of proper Lie group actions. 


Lemma 7.4.6 Let (V,@) be a symplectic vector space and let H C Sp(V,@) be a 
compact subgroup. Then, V" is a symplectic subspace. 


Proof We have to show that V4 1 (V%)® =0. Since H is compact, Proposi- 
tion 5.5.6 implies that V admits an H-invariant scalar product g. Clearly, it suffices 
to show that 


Gass gen cee (7.4.2) 
where (V)+ denotes the g-orthogonal complement of V” in V. By Proposi- 


tion 7.2.1/1, (V4) = ((V“)’)®. Note that H acts on V* from the left via (h~!)7, 
h € H, and that H C Sp(V, ) implies that the mappings and b intertwine this ac- 
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tion with the original action of H on V. It follows that (V4)? =(V*)7 and hence 
(V4)2 = ((V*)4)°. On the other hand, since gis H-invariant, the induced isomor- 
phism g: V > V* maps V” onto (V*)”. This implies 


((v*)")"=(e(V"))=(V")" 


and thus (7.4.2). 


Exercises 


7.4.1 Show that a = ie al belongs to Sp(n, R) iff B'D and C'E are symmetric 
and B? E — D?C =1. Here, B, C, D, E are quadratic real matrices. 
7.4.2 Prove Proposition 7.4.5. 


7.5 Compatible Complex Structures 


In symplectic algebra and geometry, complex structures play an important auxiliary 
role. 


Definition 7.5.1 A complex structure on a real vector space V is an endomorphism 
J with the property J* = —id. 


If a complex structure exists, V has even dimension and acquires the structure of 
a complex vector space of dimension 5 dim V with scalar multiplication by complex 
numbers defined by 


(x+iy)v:=xu+yJv, veEV, x,yER. 


In what follows, we do not distinguish in notation between the real and the complex 
vector space structure on V. Note that a real linear mapping of V is complex linear 
iff it commutes with J. Denote by Endc(V) the vector space of complex linear 
mappings and by GLc(V) the group of complex linear automorphisms of V. 

Now, let (V, @) be a symplectic vector space. 


Definition 7.5.2 A complex structure J on V is said to be compatible with w if the 
bilinear form 
gy:VxVOR, gy(u,v)=alu, Jv) (7.5.1) 
is a (real) scalar product. 
We will prove below that compatible complex structures exist on every sym- 


plectic vector space. The isometry group of g,; will be denoted by O(V, g,). 
Furthermore, we denote the space of w-compatible complex structures on (V, w) 
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by 4(V,«@) and endow it with the topology induced from End(V). If J is o- 
compatible, it is both symplectic and isometric with respect to g;: 


o(Ju, Jv) =g7(Ju, v) = g7(v, Ju) = —o(v,u) = alu, v), 
gy(Ju, Jv) = —o@(Ju, v) =v, Ju) = g7(v,u) = g7(u, v). 


Due to J* = —id, the first equation implies A (V,@) C sp(V, w). The proof of the 
following proposition is left to the reader (Exercise 7.5.1). 


Proposition 7.5.3 Let J «¢ Y(V,«). 


1. J defines a scalar product” on the complex vector space V by 
hy(u, v) :=g7(u, v) +iw(u, v). (7.5.2) 
For the corresponding isometry group U(V, hz), one has 


UV, hy) = Sp(V, a) NGLe(V) = O(V, g7) NGLc(V) = O(V, £7) NSp(V, w). 
(7.5.3) 

2. For every Lagrangian subspace L of (V, w), the image of L under J is a com- 
plementary Lagrangian subspace, that is, V = L @ JL. It coincides with the 
gy-orthogonal complement L+ of L in V. 

3. Every endomorphism A of a Lagrangian subspace L extends uniquely to a com- 
plex linear endomorphism of V by A(u+ Jv) := Au + JAv, where u,v eé L. 
Every complex linear endomorphism of V is uniquely determined by its values 
on L. 

4. Every gy;-orthonormal basis {e;} of a Lagrangian subspace L constitutes an hy - 
orthonormal basis in V, viewed as a complex vector space. Moreover, the ele- 
ments e; and f; := Je; form a gy-orthonormal symplectic basis in (V, @). 


Remark 7.5.4 


1. Point 2 of Proposition 7.5.3 has the following converse. Let L; and L2 be com- 
plementary Lagrangian subspaces. According to Proposition 7.2.9, w induces an 
isomorphism L} — L2. Using this isomorphism and a basis in L;, one can con- 
struct an w-compatible complex structure J such that Lz = JL, (Exercise 7.5.2). 

2. Due to J? = —1, the eigenvalues of J are +i, both with multiplicity 5 dim V. 
Hence, for any two complex structures J and J’, there exists a € GL(V) such 
that J’ = aJa™~!. If both J and J’ are w-compatible, a can be chosen from 
Sp(V, @). 


Example 7.5.5 The matrices +J, are complex structures on V = R2”. The matrix 
J = —Jy is compatible with the standard symplectic structure of R7” defined by Jy. 
The corresponding scalar product g, coincides with the standard scalar product on 


?That is, a positive definite Hermitian form. 
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IR”, With respect to the complex vector space structure induced on R2” by J, the 
mapping R~” = R” @ R” > C”, given by (x, y) + x + iy, is an isomorphism of 
complex vector spaces. Via this isomorphism, hy, corresponds to the standard scalar 
product on C”. The induced mapping M,, (C) > Mo, (R), defined by 


A+iBR E a 


, A,BeM,(R), (7.5.4) 
yields an isomorphism of real algebras from M,,(C) onto the subalgebra of Ma» (R) 
of elements commuting with —J,,. This isomorphism maps U(7) onto U(V,h,). 


Under the identification of M,,(C) with its image in M2, (R), the identities (7.5.3) 
read 


U(n) = Sp(v, R) N GL(, C) = O(2n) N GL(, C) = O(2n) N Spm, R). 


The Lagrangian subspace of IR?” defined by the first n standard basis vectors is 
mapped under J to that spanned by the last n basis vectors. The counterparts of 
these Lagrangian subspaces in C” are given by the real subspaces R” and iR”, 
respectively. 


Denote the space of symmetric bilinear forms on V by S?V* and the convex 
open subset of positive definite elements by St v*. 


Proposition 7.5.6 Every symplectic vector space (V, w) admits a compatible com- 
plex structure. More precisely, there exists a surjective and continuous mapping 
F:S7V*—> Y(V,o) satisfying F(gy) = J. 


Proof Since w is non-degenerate, every g € sty defines an endomorphism a of 
V by 
g(u,v) =u, av). 


Since g is non-degenerate, a is invertible. Let a* denote the adjoint of a relative to 
g, defined by g(u,a*v) = g(au, v) for all u,v € V, and let a = J|a| be the polar 
decomposition of a relative to g, that is, ja] = Va*a and J = ala|~!. The anti- 


symmetry of w implies a = —a*. It follows that J = a|a|~! = |a|~!'a and hence 
P= a*(\a|*)~! = a?(—a*)~! = —id. Consequently, J is a complex structure on 
(V, @). Due to 


atu, Jv) = o(u, ala\‘v) — g(u, |a|~'v) — g(|al~2u, la|~?v), 
it is w-compatible. Thus, 4 (V, w) # @ and we can define a mapping 
F:SiV*> ZV,o), = F(g):= J. 


Since J* = —id, we have F(g,) = J. In particular, F is surjective. 
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It remains to show that F is continuous. Since J = ala|~! and since multiplica- 
tion and inversion in GL(V), as well as the assignment g +> a, are obviously con- 
tinuous, it suffices to show that for arbitrary A €¢ End(V), the assignment g +> |A| 
is continuous. To see this, fix a reference scalar product go € SV". By means of 
orthonormal bases, one with respect to g and another one with respect to go, one 
can construct b € GL(V) such that g(u, v) = go(bu, bv) for all u, v € V. Consider 
the algebra homomorphism 


Cy : End(V) > End(V), Cp(A) := DAD. 


It satisfies Cpy(A*) = Cp(A)*, where A*° denotes the adjoint of A with respect to 
go (Exercise 7.5.3). This implies that C, maps endomorphisms which are positive 
relative to g to endomorphisms which are positive relative to gg. Thus, Cp,(V A* A) 
is positive relative to go. Since it satisfies Cp(/ A*A)* = Cy(A)*°Cp (A), it is the 
square root of Cp(A)*°C, (A). Hence, 


|A| =C, ' (VC, (A)*C; (A)). 


Since C,(A) depends continuously on b and b can be chosen? to depend continu- 
ously on g, the assertion follows. 


Proposition 7.5.7 The space Y (V, w) is contractible. 


Proof By Corollary 7.1.5, we may assume that (V, w) = (R2”, wo) with wo(x, y) = 
x- (Jny). According to Example 7.5.5, —Jn € A(R”, @o). We are going to con- 
struct a homotopy between the constant mapping J +» —J,, and the identical map- 
ping of 4 (R", wo). By wo-compatibility, for every J € Y (IR*”, wo), the matrix 
ay := JnJ is positive and symmetric. Hence, we can take a} for every s € IR and 
define a mapping 


H :[0,1] x Y£(R”",@0) > Man(R), (8, J) = — Ina. 


Clearly, H is continuous and satisfies H(O, J) = —J, and H(1, J) = J. Thus, 
in order to prove that it yields the desired homotopy, we have to check that 
His, J)e J (R*",@o) for all 0 < s < 1. Compatibility with wo follows from 
wo(x, H(s, J)y) =x- (ayy) and the fact that a‘, is positive symmetric for all s. Since 
H(s, J)? =In (as) Jas; to see that H(s, J)” = —1, it suffices to show that a‘, is 
symplectic. Since a, is positive symmetric, it possesses a basis {e;} of eigenvectors 
with eigenvalues a; > 0. Since — J, and J are wo-compatible, they are symplectic. 
Hence, so is a;. Therefore, 


wo (€@;, €j) = Wo(ayej, ase; ) = aja j;H0(e;, ej) 


3By means of a local section of the submersion GL(V) — GL(V)/O(V, go) (right cosets). 
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for all i, 7. It follows that aja; = 1 or wo(e;, ej) = 0. Then, 


wo(aye;, ae; ) = (aja) wo(e;, ej) = wo(e;, ej). 
Jer Gy j 


Thus, a‘, is symplectic for all s. This proves the proposition. 


Exercises 

7.5.1 Prove Proposition 7.5.3. 

7.5.2 Prove the statement of Remark 7.5.4/1. 

7.5.3 Let V be a real vector space. Let gj, g2 € 7. V* and let b € GL(V) such that 
gi(u, v) = go(bu, bv) forall u, v € V. Show that for A € End(V), the adjoints 
A*i of A with respect to g; are related by b(A*!)b~! = (bAb7!)*2, 


7.6 The Lagrange-Grafmann Manifold 


Let (V, w) be a symplectic vector space of dimension 2n. In this section, we study 
the structure of the space “(V, w) of Lagrangian subspaces of (V, w). For that pur- 
pose, we choose an w-compatible complex structure J on (V, w). For simplicity we 
write U(V) = U(V, h;) and O(V) = O(V, g;). For L € &(V, w), let O(L) denote 
the isometry group of L with respect to the scalar product on L induced by gy. 
Proposition 7.5.3/3 implies that O(L) may be naturally viewed as a Lie subgroup of 
U(V) via the identification 


O(L) = {a €U(V) :aL=L}. (7.6.1) 


Let G(k, V) denote the GrafSmann manifold of k-dimensional (real) linear subspaces 
of V, cf. Example 5.7.6. 


Proposition 7.6.1 2(V, @) is an embedded submanifold of G(n, V) of dimension 
mr) Every L € £(V, a) defines a diffeomorphism 


U(V)/O(L) > Z(V,@), [a]ReaL. (7.6.2) 
Proof Let L€ @(V, @) be given. It defines a diffeomorphism 
y : O(V)/(O(L) x O(L*)) + Gv, V), [a] aL, 


where L+ denotes the gy-orthogonal complement of L, which is Lagrange by 
Proposition 7.5.3/2. For every a € U(V) C O(V), the subspace aL is Lagrange, be- 
cause, by (7.5.3), a is symplectic. Conversely, for every Lagrangian subspace L’ we 
find a € U(V) such that L’ = aL: choose orthonormal bases in L and L’. Accord- 
ing to Proposition 7.5.3/4, these bases induce orthonormal symplectic bases in V. 
There exists a € O(V) transforming the latter bases into one another. Since these 
bases are symplectic, so is a. Hence, (7.5.3) implies a € U(V). We conclude that 
aL is Lagrange iff ae U(V). 
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On the other hand, by (7.6.1), the stabilizer of L under U(V) is given by the Lie 
subgroup O(L) Cc U(V). Thus, #(V, w) is the image of the homogeneous space 
U(V)/O(L) under gy. According to Theorem 5.7.2, this endows Y(V,@) with a 
natural smooth structure. Applying points | and 2 of Corollary 6.5.3 to the mapping 


U(V)/O(L) > O(V)/O(L) > O(V)/(O(L) x O(L*)), 


we obtain that “(V,q@) is an embedded submanifold of G(n, V). Finally, using 
U(V) = U(n) and O(L) = O(n), we find dim. Y(V, w) = 4. 


Remark 7.6.2 


1. Under the identification (7.6.1), with respect to the decomposition V = L @ L+, 
the subgroup O(L) consists of the block diagonal endomorphisms a @ (—JaJ) 
with a € O(L). 

2. By Proposition 7.6.1 and (7.5.3), Sp(V, @) acts transitively on #(V, w). 


Definition 7.6.3 2(V, @) is called the Lagrange-GraBmann manifold of (V, @). 


Example 7.6.4 Consider V = R2” with the standard symplectic structure defined by 
J,. Take J = —J, and L = R” x {0}. Then, under the identification of R*” with C”, 
cf. Example 7.5.5, O(L) corresponds to O(n) C U(n). 


Later on, the intersection properties of Lagrangian subspaces will be relevant. To 
study them, one exploits the partition of @(V, w) relative to a chosen Lagrangian 
subspace L, given by the subsets 


B= e2 Vo) dm AL)=k). k= 0.5.2.0, 


Definition 7.6.5 The subset 


LL) = L(V, 0)\ A(L) =| J&L) (7.6.3) 


k=1 


of L(V, w) is called the Maslov cycle of L. 


First, we study %(L). Let L’ € Yo(L) and let Py;: V — L be the projection 
relative to the decomposition V = L @ L’. Then, 1 — Py yields a vector space 
isomorphism between the Lagrangian subspaces Lt C V and L’, because the kernel 
of 1 — Py is L. Moreover, by Proposition 7.5.3/2, we have L+ = JL. Thus, we 
obtain 


L’ = (1— Py)(L+) = ((1— Py) J)(L). (7.6.4) 


Next, we extend the endomorphism P;/J of L to a complex linear endomorphism 
Q_ of V, cf. Proposition 7.5.3/3: 


Ov(ut+Jv):= Py Jut+JPy Jv, u,veL. (7.6.5) 
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By construction, 
On (L)=L, (J -Orn)(L)=L, On J=JQz, (7.6.6) 
and for any u, v € L we have 
hy(Qzru, v) —hy(u, Orv) 
=o(Qzu, Jv) — ou, JQpv) = —o((J — Oru, (J — Qr)v), (7.6.7) 


because L and L+ are Lagrange. Since L’ = (J — Qy/)(L) is also Lagrange, Q 7) 
is Hermitian with respect to hy. Using this, together with Q7;/ J = J Q7,, we obtain 
the polar decomposition of J — Q7, with respect to hy;: 


J-Oy= er Os 


Since Q7,/(L) = L, we have ,/1 + OF, (L) = L. To summarize, the polar decompo- 


sition yields a certain element a of U(V) such that 


j= , 
Paul, as J- ou (7.6.8) 


1+ 0%, 
Note that a is a square root of the Cayley transform of Q7, and that Q7, can be 
reconstructed from a via 


1+ a 


5 (7.6.9) 


Ov=J 


i—a 


(Exercise 7.6.1). 


Proposition 7.6.6 Let Le 2(V, a). 


1. Yo(L) is open and dense in L(V, w). 
2. There exists a natural diffeomorphism* Yy(L) > S*L*, L' +> Sy), defined by 


Sy(u,v):=gy7(u, Qyv), u,veL. (7.6.10) 
The kernel of Sy) is given by JL'OL. 
As a consequence of point 2, %(L) is contractible. 


Proof 1. In view of Proposition 7.6.1 and the fact that the natural projection from 
U(V) to U(V)/O(L) is a submersion and hence open, it suffices to show that the 


4Recall that S?L* denotes the vector space of symmetric bilinear forms on L. 
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subset of U(V) of elements a satisfying aL M L = {0} is open and dense. The latter 
follows from the fact that the condition aL N L = {0} is equivalent to the condition 
that, for some basis {e;} in L, the system {e;}U {ae;} is linearly independent. 

2. Since L is Lagrange, the restrictions of g; and h, to L coincide. Thus, symme- 
try of S; follows from hermiticity of 7. We prove bijectivity. To show injectivity, 
assume that there are two Lagrangian subspaces L’ and L” such that S;) = S;. This 
means 


gi(u, (Qu — Qx")v) =0 


for all u,v € L, that is, Q7; and Q;»” coincide on L. Now, the second relation in 
(7.6.6) implies L’ = L’’. To prove surjectivity, let S € S?L*. Via (7.6.10), to S there 
corresponds a gy-symmetric endomorphism Q of L, which according to Propo- 
sition 7.5.3/3 extends uniquely to a Hermitian endomorphism of V. Consider the 
subspace L’ = (J — Q)(L). Since J(L) = L+, we have L’ N L = {0} and since Q 
is Hermitian, (7.6.7) implies that L’ is Lagrange. Hence, L’ € Y%(L). Using that Q 
leaves L invariant and that 


(Py) tt =idz, (PL) tL = 9, 


we find 


(Qr — Q)pr = Pro (J — OQ), = 0. 


This implies S;/ = S. Thus, the assignment L’ +> S,) is bijective, indeed. Dif- 
ferentiability in both directions follows from the facts that the natural projection 
U(V) > &(V,o) is a submersion and that the assignments Q +> a andar Q 
given by (7.6.8) and (7.6.9), respectively, are analytic. Finally, to determine the ker- 
nel of Sz’, let u € L such that O7/(u) = Py J(u) = 0. Then, J(u) € L’ and hence 
ue JL'OL. 


Remark 7.6.7 
1. By definition, we have 
Spy(u,v)=a(tu,JQzv), u,veL. 


2. Let {e;} be an orthonormal basis in L and let A be the matrix of —@Qz) in this 
basis. The relation L’ = (J — Q7/)(L) reads 


n n 
L'= Seat neon =D avai}, (7.6.11) 
i=1 al 


where f; = Je;, and the diffeomorphism %(L) — S*L* yields a diffeomor- 
phism 


gy: L(L) > S?R", L' > 9(L') =A. (7.6.12) 
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With respect to the basis {e;} of the complex vector space V, the element a € 
U(V) transforming L to L’, found in (7.6.8), is represented by the matrix 
i+A 
a= —__., (7.6.13) 
V1+ A? 


A can be reconstructed from a via 


+a? 


A=-i- 5 


l-a 


(7.6.14) 
(Exercise 7.6.1). 


Next, we study the Maslov cycle LP (L) of L. For that purpose, we choose an 
orthonormal basis {e;} in ZL and take the diffeomorphism (7.6.12) as a local chart on 
the open subset %(L) of Y(V, w). To construct local charts covering the remaining 
part of #(V, w), that is, the Maslov cycle S(L), for every subset K C {1,...,n}, 
we define 


(a) Lx to be the real subspace of V spanned by all f; with i €¢ K and all e; with 
JéK, 
(b) ax to be the linear mapping V — V defined by 


- TEK -e ice€K 
ake; = fi a , aK fi = “i a , (7.6.15) 
e i¢K, fi i€kK. 


Obviously, Ly = L. One has Lx =axL and ax € U(V), because it is both orthog- 
onal and symplectic. It follows that Lx is Lagrange and that %o(Lx) =ax Zo(L). 
Hence, for every K we can define a local chart gx on %o(Lx) by transporting g by 
the help of ax: 


ox: A(LK) > S’R", gx (L') :=9(aZ'L’). (7.6.16) 


Proposition 7.6.8 For every L’ € &%(L), there exists a subset K of {1,...,n} with 
k elements such that L' € Lo(L x). In particular, the family 


{(Zo(Lx), ox): K C{l,...,n}} 


yields an atlas on Z(V,@). 


Proof Let W, := L'N L. If dim W,; =k <n, there exists i; € {1,...,} such that 
ei, € W,. Now, let W2 be the subspace of L spanned by W and ¢;,. Iterating the 
argument, after n — k steps we arrive at a sequence ij,..., in, With the property 
that the subspace of L spanned by ¢;,,..., é;,_, is complementary to L’N L. Let 


R211 3h) ligne 
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Then, LM Lx is spanned by ¢;,,..., ¢;,_,, so that L = (LN L’) @ (LN Lx). Since 
both L’ and Lx are Lagrange, this implies w(L’ 1 Lx, L) = 0 and hence, since L 
is Lagrange, L'N Lx C L. It follows that 


EL nlg=LOLEAL=(LNL)n(lolg)={(0) 


and hence L’ € %(Lx). It is obvious that the family {%o(Lx): K C {l,..., n}} 
defines a covering of “(V, w). The proof of the compatibility of the chart mappings 
with the smooth structure provided by Proposition 7.6.1 is left to the reader. 


Remark 7.6.9 Let L’ € YLxK) and denote A = yx (L’). Then, g~!(A) = aie Lt 
and thus L’ = axaL, where the matrix of a in the basis {e;} is given by (7.6.13). 
Moreover, since A is the symmetric matrix assigned to det! by 9, (7.6.11) yields 


n n 
ax'L' =} Gi fit pie): pi =~ suai}. (7.6.17) 
i=1 j=1 


Using (7.6.15), we conclude that L’ consists of the vectors )~_, (qi fi + piei) which 
satisfy the n equations 


gk = >) Angi — Yo Aur, KEK, (7.6.18) 
j¢K leK 

pi= > Aijai- Aur, i¢K. (7.6.19) 
j¢K leK 


Proposition 7.6.10 For every k= 1,...,n, the local charts (Zo(LK), 9K) with K 
consisting of k elements cover &(L). For every such K, the subset 


Li (L)1 Lo(Lk) 
is mapped under ~x onto the subspace of S*R" defined by the ker) relations 
Ajj =9, ijekK. 


Proof The first assertion is a direct consequence of Proposition 7.6.8. To prove the 
second assertion, let K be given, let L’ ¢ &(L) 1 Zo(Lx) and let A = x (L’). 
Then, eae bl is given by (7.6.17). By definition of ax, we have ag hy =axL’ and 
thus 


ay L'N Lx =axLl'/ NLx =ax(L'NL). 
Hence the subspace ae Lx has dimension k. Writing 


n 
Le =} > G@ifit pie): qi =0 for alli ¢ K, pj =orwan jen} 
i=l 
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we see that in the intersection a, i LK, p; is determined by g; foralli =1,..., 
and q; can only be nonzero for i € K. Since ig DB Lx has dimension k, the 
equations 


n 
O= p=) Aiujgqi= >. Aig; i€K, 
j=l jek 


must, therefore, be satisfied for all g; ¢ R. Hence, Ajj = 0 for alli, j € K. 


Combining Proposition 7.6.10 with Proposition 1.7.3, we obtain 


Corollary 7.6.11 For every k =0,...,n, the subset 4 (L) of L(V, w) is an em- 
bedded submanifold of codimension Mer) 


Remark 7.6.12 Corollary 7.6.11 implies that the Maslov cycle L (L) is a stratified 
subset of #(V, w), with the stratum % (L) having codimension | in @(V, q@) and 
the other strata having codimension at least 3. This implies that (L) is open and 
dense in SAL). 


Exercises 
7.6.1 Prove the reconstruction formulae (7.6.9) and (7.6.14). 


7.7 The Universal Maslov Class 


In this section, we use some elementary facts from topology, see e.g. [55], [124] or 
part III of [76]. Our presentation follows the line of reasoning of a classical paper 
by Arnold [13], see also [79]. As in Sect. 7.6, let there be chosen an w-compatible 
complex structure J. Let us choose L € £(V, w) and let us consider the mapping 


det: U(V) > S'CC, at deta. 


Since on the subgroup O(Z) it takes the values +1, via the isomorphism (7.6.2) 
defined by L, it induces a well-defined mapping 


det? : L(V,@) > S' CC. (7.7.1) 
For a € U(V), the mappings det? and det? , differ by a constant phase factor: 
det”, = det~*(a)det}. (7.7.2) 


Definition 7.7.1 (Maslov index) The Maslov index j(y) of a closed curve y : S! = 
R/2xZ + L(V, @) is defined to be the degree of the mapping? det? oy: Si>s!. 


5Cf. Remark 4.3.6/4. Both S! = R/27Z and S! C C are endowed with the natural orientations. 
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Proposition 7.7.2 The Maslov index is homotopy invariant, that is, it is constant on 
homotopy classes of closed curves. It does not depend on the choice of L or J. 


Proof Homotopy invariance follows from the continuity of the mapping (7.7.1) and 
from the fact that the mapping degree is homotopy invariant. By (7.7.2), 4 does 
not depend on L. Using this, as well as homotopy invariance of jz and point 2 of 
Remark 7.5.4, one obtains independence of J. 


According to Proposition 7.7.2, the Maslov index defines a homomorphism 
uw: (L(V, @)) > Z, (7.7.3) 
denoted by the same symbol. 
Proposition 7.7.3. The mapping (7.7.3) is an isomorphism. 


Accordingly, 21(2(V, @)) is isomorphic to Z and yu is a generator of the Abelian 
group Hom(1 (L(V, @)), Z) = Z. 


Proof Let L € &(V,q@). We choose an orthonormal basis in L and use the cor- 
responding matrix representation to identify U(V) with U(n). Then, O(L) = {a € 
U(V) : aL = L} is given by the subgroup O(n) C U(n). We have to show that the 
homomorphism zr; (U(n)/O(n)) > 21 (S!) = Z induced by y & det? oy, where y 
is a closed curve in U(n), is bijective. Thus, for every k € Z, define a curve 


ve .[0, 1] > UM), v(t) = diag(e"?, 1,..., 1). 


Since y,(0) and y,(1) lie in O(m), the curves 7% project to closed curves 7% in 
U(m)/O(n), starting and ending at [1]. Note that det? oy, winds k times around 
S!. This proves surjectivity. To prove that it is injective, we show that every closed 
curve y in U(n)/O(n), starting and ending at [1], is homotopic to one of the curves 
x. Since the projection U(n) + U(n)/O(n) admits local sections, y has a lift y : 
[0, 1] — U() starting at 1 and ending at some a € O(n). Depending on whether 
det(y(1)) equals +1 or —1, by composing y with a curve in O(n), we can make 
it end at 1 or at (—1,1,..., 1). In the first case, y is a closed curve in U(n). It is 
therefore homotopic to y% for some even k, see Exercise 7.7.1. Then, y is homotopic 
to 7. In the second case, we use that y - y_; (pointwise multiplication in U(n)) is 
closed in U(n) and hence is homotopic to yz, for some even k. Then, y is homotopic 
to Ye V1 = Ye+1 and hence y is homotopic to 7x41. 


Now, let ¢@ denote the standard angle coordinate on S! CC and consider the 
1-form 


1 x 
w= 5, (det) do (7.7.4) 
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on &(V,@). By (7.7.2), for a € U(V), we have 
(det? )“d@ = d(@ 0 det?) = d(¢ 0 det~*(a) + ¢ o detz) = (det7)*d¢, 


because ¢ 0 det~*(a) is a constant. Hence, the 1-form jz does not depend on the 
choice of L. The following observation justifies the notation 2. 


Proposition 7.7.4 For every closed curve y :S' > Y(V,«), 


Wy) =I ML. 
Y 


Proof By definition of the integral over a submanifold, 


1 
= *u=— | (det? “dd. 
fu [ive =~ [ (eet. er) ) 


By definition of the mapping degree, cf. Remark 4.3.6/4, this equals 


EZ deg(det; o y) [ dp = u(y) 
Qn fy si : 


Definition 7.7.5 (Universal Maslov class) The 1-form (7.7.4) is called the universal 
Maslov class of #(V, w). 


Remark 7.7.6 


1. Integration of 1-forms over closed curves yields a homomorphism from the de- 
Rham cohomology group H! (L(V, w)) to Hom(z; (L(V, w)), R). Using meth- 
ods of algebraic topology, one can show that this homomorphism is in fact an 
isomorphism.° Since 2\(Z(V, @)) = Z, we have Hom(2 (L(V, )), R) = R. 
Hence, Proposition 7.7.4 states that under the above isomorphism, the 1-form 
weH '(L(V,w)) defined by (7.7.4) corresponds to the homomorphism jz € 
Hom(71(7(V, @)), Z) C Hom(71 (L(V, w)), R) defined by (7.7.3). In particu- 
lar, the first one spans H '( L(V, w)). This explains the name universal class. 

2. We express the mapping (7.7.1) in terms of the local charts yx on “#(V, @), in- 
duced by an orthonormal basis {e;} in L, cf. Proposition 7.6.8. Let L’ € Yo(Lx) 
and let A = yx (L’). Due to Remark 7.6.9, L’ = axaL, where the matrix of a 


®Tt decomposes into the isomorphisms 
H'(L(V,@)) > H' (L(V, @),R) > Hom(Ai (-Z(V, w)), R) > Hom(m1((V, @)), R) 


provided by, respectively, the de Rham Theorem, the Universal Coefficient Theorem and the 
Hurewicz Theorem. Here, H! (., R) and Hj(-) denote the first integer-valued singular cohomol- 
ogy group and the first singular homology group, respectively. 
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i+A 
J1+A2- 


with respect to the basis {e;} in the complex vector space V is given by 


Using that detax = 1, cf. Proposition 7.4.1/4, we obtain 


At+i 
det? (LZ) = det(a*) = aer( 4 = 7 


Next, we show that the Maslov index can be viewed as an intersection index. It 
was this way this index occurred first in the work of Maslov [197]. For that purpose, 
we consider the action of the center U(1) C U(V) on #(V, w), given by 


U1) x ZV,a)> L(V,0), — (e",L) he LE L(V,0). 


This action is generated by the vector field’ 
d : 
iin — leh), 
(Js) df ty (e ) 


Proposition 7.7.7 The vector field J, is transversal to Z\(L) and thus defines a 
coorientation® on LY (L) for all L € Y(V, a). This coorientation does not depend 
on J. 


Proof Let L € Y(V, w) and consider the local charts (%o(Lx), gx) defined by an 
orthonormal basis in L. According to Proposition 7.6.10, 21 (L) is covered by the 
charts (Yo(Lx), 9x) with K ={m},m=1,...,n, and the subset 4 (L)NZ(Lr) 
is mapped under yx to the subspace of S7IR” defined by the single equation Amm = 
0. Let L’ € “(L) 1 LYo(Lx) and let A = yx (L’). According to Remark 7.6.9, we 
have L’ =axaL, where the matrix of a with respect to the basis {e;} is given by 
(7.6.13). Hence, the local representative of (J,.)7 in the chart gx is given by 


d ity/\ _ d it _ d it 
a ee (e L ) = ie (e axaL) =i ve aL) 
2it 


see (7.6.16). For sufficiently small t, the matrix of e”“a? does not have an eigenvalue 


1, hence we can use (7.6.14) to calculate 


d # gy. 8 PEPE a. 59 2\-2 2 
rae aL) = rae i = 4a*(1—a*) “ =—-(1+ A’). 


Thus, in the chart yx, (J,)z is represented by —(1 + A’). Since A is symmetric, 
A” has nonnegative diagonal entries. This proves that J, is transversal to Y(L). 
That the orientation of the normal bundle so defined does not depend on J follows 


7This is the Killing vector field generated by J under the action of Sp(V, w) on Z(V, a). 
8That is, an orientation of the normal bundle of the submanifold Y, (L), cf. Remark 2.7.18/2. 
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from the fact that, according to Proposition 7.5.7, the subset Y (V, @) of sp(V, a) 
is arcwise connected. Indeed, since the mapping 


sp(V, a) > Ty L(V, o) > Ny A(L) EZR, 


defined by the Killing vector fields of the action of Sp(V, w) on #(V, w), is smooth 
and since the image of an element of _Y (V, w) under this mapping is nonzero, the 
orientation cannot change along a curve in Y (V,w). 


Remark 7.7.8 As a corollary of the proof, we note that the local representative of 
the Killing vector field J, with respect to the local chart gj) points from the side 
where Am iS positive to the side where Aj» is negative. 


Now let L € #(V,o), leta,b € Rand let y : [a,b] > &@(V, w) be a curve with 
endpoints in %(L). A number t € (a, b) such that y(t) € Y(L) is called a cross- 
ing of y with SL). A crossing ¢ is said to be simple if y(t) € “(L). A simple 
crossing is said to be transversal if y(t) ¢ Ty (7-4 (L). Depending on whether y(t) 
is positively or negatively oriented with respect to the canonical coorientation of 
-£\(L), a simple transversal crossing is said to be positive or negative. That is, a 
simple transversal crossing is positive iff y(t) and (J), point to the same side of 
-£\(L). In the following, when we say that two curves with the same end points are 
homotopic with fixed end points we always mean that there is a homotopy preserv- 
ing the endpoints. 


Proposition 7.7.9 


1. Every curve in 2(V, @) with end points in Zo(L) is homotopic with fixed end 
points to a curve which has only simple transversal crossings with @(L). 

2. If two curves in L(V, @) which have the same end points in Yo(L) and which 
have only simple and transversal crossings with @(L) are homotopic with fixed 
end points, their differences between the numbers of positive and negative cross- 
ings coincide. 


Proof pews by a homotopy we mean a homotopy with fixed end points. Denote 
n(n+ 

1. Let y : [a,b] > L(V, ow) with y(a), y(b) € Zo(L) be given. Since the com- 
plement oP (L) \ “{(L) is the closure of the embedded submanifold %(L) which 
has codimension 3, y is obviously homotopic to a curve, denoted by the same sym- 
bol, which has the same endpoints but only simple crossings, that is, it is contained 
in Yo(L) UY (L). Since “(L) is an embedded submanifold and since [a, b] is 
compact, there exist real numbers a = fp < t) <--- <t-—-1 <t, =b and local charts 
(U;,«;),i=1,...,r,0n Z(V,w) mapping % (L) to open subsets of hyperplanes 
in R” such that 


y((#-1,t]) CU; y ((j — 2¢, | + 2e)) CU;1NU;, i=l,...,r. 
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Using «;, we can construct a smooth homotopy H; from y to a curve y, with 
1 (t1) € Yo(L), which has only transversal simple crossings between a and t; and 
which coincides with y for t > tf) + ¢. Next, using «2, in a similar way we can con- 
struct a smooth homotopy H2 from y; to a curve 72 with y2(t1), y2(t2) € Zo(L), 
which has only transversal simple crossings between f} — € and f2 and which coin- 
cides with y; outside (t; — &, f2 + €). We iterate this procedure until we arrive at a 
smooth homotopy H, from y;—; to a curve y, with y,(t--1) € Yo(L), which has 
only transversal simple crossings between t,_1 — ¢ and b, and which coincides with 
y-—1 for t < t-_1 — ¢. Then, y, is homotopic to y and has only transversal simple 
crossings. 

2. Denote the homotopy by H : [a,b] x [0,1] ~ @(V,q@). For s € [0,1], we 
define tt y,;(t) := H(t, s). By the codimension argument of point 1, H can be 
chosen so that it stays in %(L) UY (L). Choose an orthonormal basis in L and 
consider the corresponding local charts (-4o(Lx), 9x). Due to Proposition 7.6.10, 
Lo(L) U YL) is covered by the charts yx with K being empty or having a sin- 
gle element. Using this fact and compactness of [a,b] x [0, 1], we find a = tg < 
th <-++<t, =b and 0= 59 < 51 < +--+ <5» =1 such that each of the squares 
[ti, ti+1] x [sj,5j+1] is mapped under H to the domain of a single chart PKi;- 
We may furthermore assume that the crossings of yo and y are distinct from 
ti,...,t--1. For each j = 1,..., p — 1, by applying the procedure of point 1, the 
family of charts {PKo;> 65 QK,_ ijt can be used to construct a homotopy from y; ; to 
some curve, still covered by these charts, whose crossings are simple and transver- 
sal and are distinct from t,..., ¢-—1. Thus, by plugging in such a homotopy forth 
and back at s = s; if necessary, we may assume that the crossings of ys, are sim- 
ple transversal and distinct from t),...,t--1. This shows that it suffices to prove 
the assertion under the assumption that the homotopy H stays in the domain of a 
single chart gx with K having at most one element. Now, if H stays in %(L), the 
assertion obviously holds. If not, then K has one element and gx maps the relevant 
subset of 7 (L) onto a whole subspace of R” of codimension 1. Here, the assertion 
is obvious as well. 


As a consequence of Proposition 7.7.9, we can define 


Definition 7.7.10 (Maslov intersection index) The Maslov intersection index rela- 
tive to L € L(V, q@) of acurve y in Y(V, @) with end points in Y%(L) is defined 
as 


Indz(y) := v4 —v-, 


where v, is the number of positive crossings and v_ is the number of negative 
crossings with &(L) of a curve which is homotopic with fixed end points to y and 
whose crossings are all simple and transversal. 


According to Proposition 7.7.9, the Maslov intersection index is invariant under 
homotopies with fixed end points. Moreover, by construction, it is additive with re- 
spect to the composition of curves. We now show that, for closed curves, it coincides 
with the Maslov index defined before. 
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Theorem 7.7.11 Let y : [0,1] > @(V, w) be aclosed curve and let Lée L(V, w) 
be transversal to the Lagrangian subspace y (0) = y (1). Then, Indy (y) = u(y). 


Proof Let us take L’ := y(0) = y (1) as the base point of 2 (2(V, w)). By additiv- 
ity, Indy defines a homomorphism 7 (2(V, w)) > Z. Since, by Proposition 7.7.2, 
w1( L(V, w)) = Z and since two homomorphisms Z — Z coincide if there exists 
k € Z where they take the same nonzero value, it suffices to find a closed curve 
y, : [0,1] > Y(V,@) with Ind, (1) = “(“1) 4 0. Such a curve will be con- 
structed now. Since L and L’ are complementary, according to Remark 7.5.4/1, 
there exists J € Y(V,w) such that L’= JL. Choose a g;-orthonormal basis {e;} 
in L and let f; := Je;. Define the curve y; by requiring y)(t) to be the real sub- 
space of V spanned by emit fi =cos(at)e; — sin(zt) f; and fo,..., f,. Obviously, 
yvi(1) = 71 (0) = L’. Under the identification of Y(V, w) with U(n)/O(n) induced 
by L’, J and the orthonormal basis { fj} in L’, this curve corresponds to the curve 
ve with k = | used in the proof of Proposition 7.7.3. Either by this observation or 
by direct inspection, we find 


det” (i (t)) = det? diag(e™", 1,..., 1) =e?7", 


hence j4(y1) = 1. To determine Indz,(y1), we observe that 7; (t) 1 L = {0} unless 
s= 5 and that yj G)  L is spanned over R by e;. Hence, y; has a single crossing 
with “(L) and this crossing is simple and transversal. To find out whether it is 
positive or negative, we use the local charts (%(Lx«), yx) defined by the basis {e;} 
in L. For t € (0,1), y(t) € Zo(Lx) with K = {1}, because Ly1; is spanned by 
Fi, €2,-+-5n- Since any (0) is spanned by fo,..., f, and the vector 


ai} (e™" fi) = ee, = cos(rt)e; + sin(xt) fi, 


from (7.6.17) we read off 911)(71(t)) = diag(cot(zt),0,...,0). Since cot(zt) is 
positive for t < 5 and negative for t > 5 according to Remark 7.7.8, the crossing is 
positive. This proves the assertion. 


Corollary 7.7.12 If two curves yo, 1 : [0,1] > @(V, w) with the same end points 
satisfy Indy (y,) = Indz (y2) for some L € L(V, w) which is transversal to both end 
points, they are homotopic with fixed end points. 


Proof By additivity, the closed curve “i - Yo obtained by composing yo with 
running backwards has Maslov intersection index zero. By Proposition 7.7.3, it is, 
therefore, homotopic to the constant curve t +> yo(0) = 7 (0). We can certainly 
find a homotopy 4H : [0,1] x [0,1] ~ @(V,q@) such that H(1,t) = yo(2t) for 
t< 5 and H(1,t) = y1,(2 — 2t) for t > 5. It is enough to show that yo and y; 
are homotopic with fixed endpoints to the curve y 1 : [0,1] ~ &C(V, w) defined by 
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yi(t):= A(t, 5t). For yo, a homotopy is given by 


H(s, 1r) t<s 


A:(0,1x[0.>L(V,o), As.) = Hel): ‘tsp 


For y;, there is an analogous formula. 


Remark 7.7.13 In [252], Robbin and Salamon have shown that the definition of the 
Maslov index relative to a chosen Lagrangian subspace for non-closed curves can 
be extended in a natural way to the case of arbitrary positions of the end points. We 
add a few comments on that important paper. In the next section, we will present 
a systematic treatment of this situation in terms of the Kashiwara index. The basic 
tool of the analysis of Robbin and Salamon is the following quadratic form associ- 
ated with a tangent vector of Y(V,w). Let L’e L(V, @) and let X € Ty) L(V, w) 
be represented by a curve y. Without loss of generality, we may assume that y stays 
in “(L'), where L’/' = J(L’) for some chosen J € J (V,w). Then, Proposi- 
tion 7.6.6/2 assigns to y acurve S, in S?L. Define Qy € S*L’* by 


Qx = —S,(). 


The assignment X +> Qy is a vertical vector bundle morphism from TY(V, @) to 
the vector bundle over “(V, w) with the fiber over L’ being S?L’*. The argument 
of the proof of Theorem 1.1(1) in [252] shows that Qy does not depend on J, which 
entered the definition presented here through L’ +. Now, let y :[a,b] > L(V.) 
be a curve and let L € Y(V,q@) be given. For every crossing t of y with SL), 
define a crossing form by 


Ty. = (Qywp) fywnL- (7.7.5) 


A crossing is said to be regular if [,,7,, is non-degenerate. This generalizes the 
notion of transversality for crossings with (L) to arbitrary crossings. Regular 
crossings are isolated. For curves y which have only regular crossings, the Maslov 
index relative to L is defined by 


U4 I 
Ind, (7) = 5 sign Ty,t.a + Y= sign ly.24+ 5 sign Ty,.1.b. (7.7.6) 


a<t<b 


where the sum runs over all inner crossings and sign denotes the signature.” One 
can show that every curve is homotopic with fixed end points to a curve with only 
regular crossings and that Ind; is invariant under homotopies with fixed end points. 
Thus, Ind; naturally extends to arbitrary curves. The Maslov index so defined has 


°The number of positive minus the number of negative eigenvalues. 
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a number of natural properties, listed in Theorem 2.3 of [252]. In particular, it is 
additive with respect to the composition of curves and satisfies 


Indy (vy) =Indgt (ay), aeSp(V,o). (7.7.7) 


The latter is a consequence of the invariance of the crossing form (7.7.5) under 
symplectomorphisms (Exercise 7.7.3). Finally, to express I), for a crossing t 
with y(t) € &(L) in local charts, choose an orthonormal basis {e;} in L. For an 
appropriately chosen index set K consisting of k elements, let A(s) := gx (y(t+s)). 
Then, 


n 
Fy ir10,w)= >) vAjOw!, vweyOOL, (7.7.8) 
i,j=l 


where v! and w/ denote the coefficients with respect to the basis {e;} (Exer- 
cise 7.7.4). 


Exercises 
7.7.1 Prove that the mapping 


Ud) x SU(m) > Um), (a,a)+> diag(a, 1,..., la, 


is a diffeomorphism. Use this and the fact that SU(n) is simply connected to 
show that every closed curve in U(7) is homotopic to one of the closed curves 


ve 20, 1] > UM), veG) = diag(e?" 1 4<51): 


This completes the proof of Proposition 7.7.3. 

7.7.2 Determine “(V,w) for V = R* and analyze the Maslov indices in this ex- 
ample. 

7.7.3, Prove that the crossing form (7.7.5) is invariant under symplectic transforma- 
tions, that is, 


T(aA,aL,thoa=I(A,L,t), aeéSp(,R). 


7.7.4 Verify Formula (7.7.8). Analyze this formula by bringing A(s) for small s to 
an appropriate block form and showing that the corresponding (k x k)-block 
of A spans the normal bundle of &%(L). 


7.8 The Kashiwara Index 


In this section, we discuss another very useful index, which in the literature is usu- 
ally called the Kashiwara index.!° 


10Some authors call it Wall-Kashiwara index, others Hérmander-Kashiwara index. We refer to 
the textbooks by de Gosson [72], Libermann and Marle [181], as well as to the Lecture Notes of 
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By Remark 7.6.2/2, the symplectic group acts transitively on the space of La- 
grangian subspaces of the symplectic vector space (V,q), that is, any two La- 
grangian subspaces can be transformed into one another via a symplectomorphism. 
A pair (L1, L2) of Lagrangian subspaces can be transformed via a symplecto- 
morphism into another pair (L/,, L,) of Lagrangian subspaces iff dim(L, 9 L2) = 
dim(L) 1 L5) =k, because in this case there exists a symplectic basis {e;, fj}, such 
that L; and L> are spanned by {e;,...,e,} and {e1,..., ex, frri,---, fn}, respec- 
tively (Exercise 7.8.1). That is, the action of Sp(V, w) on Z(V, w) x L(V, w) has 
n+ 1 orbits, labeled by dim(L; 9 L2). It turns out that the symplectic group does 
not act transitively on triples (L;, L2, L3) of Lagrangian subspaces with prescribed 
dimensions of 


L100 Lp, LiNL3, Loa L3, Li,NL2NL3. 
This is related to the fact that for triples there is one further symplectic invariant. 


Definition 7.8.1 (Kashiwara index) Let (V,w) be a symplectic vector space. The 
Kashiwara index s(L1, L2, L3) of a triple (L;, L2, L3) of Lagrangian subspaces of 
(V, @) is defined as the signature of the quadratic form 


Q(L1, Lo, L3)(v1, v2, v3) = w(v}, v2) + w(v2, v3) + @(V3, V1) (7.8.1) 


on the vector space L; @ L2 @ L3. 


Example 7.8.2 Let us consider the canonical symplectic vector space (R*, — J), cf. 
Example 7.1.4. We denote v; = (q;, pi), i = 1, 2, 3, and choose 


L, =R x {0}, Ly := {(q2, p2) : p2 =aq2}, L3 = {0} x R, 
where a € R is fixed. This yields 
Q(L1, L2, L3)(v1, V2, V3) = —V} J1V2 — V5 J1¥3 — V3 J1V1 
= —44142 — P3492 + P34. 


Diagonalizing this quadratic form in the variables qi, g2 and p3, one gets (Exer- 
cise 7.8.2) 


Q(L1, Lz, L3) = x? — y? — sign(a)z”. (7.8.2) 
From this formula we read off 


—l1 fora>0 
s(L1, Lp, L3) =10 fora=0 (7.8.3) 
1 fora <0. 


Meinrenken [208], where the reader can find an exhaustive treatment including historical remarks. 
An axiomatic approach can be found in [58]. 
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We note that the index s measures the relative position of the three subspaces with 
respect to a chosen orientation: if R? is oriented clockwise, s vanishes if two of the 
three subspaces coincide, its value is +1 if Lz is located between L, and L3 (rel- 
ative to the orientation) and it is equal to —1 otherwise. For the counter-clockwise 
orientation, the signs are reverted. In the sequel, we will see that an analogous result 
holds for any n. 


Proposition 7.8.3. The Kashiwara index s has the following properties. 


1. Symplectic invariance: s(aL,,aL2,aL3) = s(Lj,L2,L3) for every aé 
Sp(V, @). 

2. Antisymmetry: s(Lz(), Lr), La3)) = (—1)8!8"™) s(L1, L2, L3) for every per- 
mutation 1. 

3. Additivity: for all triples (L',, L, L3) and (L), L5, L3) of Lagrangian subspaces 
in (V', a") and (V", w"), respectively, (Li ® Li, L, @ Ly, L3 ® L4) is a triple 
of Lagrangian subspaces in (V', w') ® (V", w") and one has 


s(L, @L1,.L, @ Ly, L5 @ L4) =s' (£4, L4, 14) +5" (£4, 14, 15). 


4. Normalization: the triple (IR, (1 +i)R, iR) in R? has Kashiwara index 1. 
5. Cocycle property: 


s(L1, Lo, L3) =s(Lo, L3, La) — s(L1, £3, La) + 5(L1, L2, La) 
for arbitrary Lagrangian subspaces L,, L2, L3 and L4 in (V,@). 


Proof Points 1-3 are obvious. 

4. This is Example 7.8.2 with a = 1 and —J, replaced by J,. Hence, sign(Q) = 
+1. 

5. According to Remark 7.2.7, there exists a Lagrangian subspace L which is 
transversal to each L;. Choose an orthonormal basis {e;} in L, let g : Z(L) > 
SR” be the diffeomorphism (7.6.12) and let A; := g(L;). Then, for each i, 
the vectors fj + ye (Ai) ver form a basis in L;. With respect to these bases, 
Q(L;, Lj, Lx) is given by the matrix 


0 Aij—Aj  Ak—Ai 
Q=| Ai - A; 0 A; = Ax |); (7.8.4) 
Ag—Aj; Aj — Ak 0 


The reader easily checks that Q is block-diagonalized by the help of 
0 1 


M= 0 1 
0 
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One obtains 
—M'QM = 0 Ag — A; 0 (7.8.5) 


Thus, 


s(L;, L;, Lx) = sign(A; — A;) + sign(A ; — Ag) + sign(Ag — Aj). 


This yields the assertion. 


Remark 7.8.4 


1. One can show that properties 1-4 define the Kashiwara index uniquely, see [58]. 
Thus, these properties can be used for an axiomatic definition of s. If a system 
5(L1, L2, L3) has only properties 1—3, it is proportional to the Kashiwara index s. 

2. For the special case L; 1 L3 = 0 we obtain 


s(L1, Lz, L3) =sign(q), (7.8.6) 


where q is a quadratic form on L2, given by g(v) := w(v1, v3), where vj and v3 
denote the components of v € Lz with respect to the decomposition V = L; @L3 
(Exercise 7.8.2). This is the situation of Example 7.8.2. Thus, in this case, (7.8.6) 
can be taken as the definition of s. If, in addition, L; N L2 = 0 and L3 = ia; 
Formula (7.8.6) yields 


s(L1, L2, Ly) = sign(A), (7.8.7) 


where A = ¢(L2) and ¢ is the diffeomorphism (7.6.12) (Exercise 7.8.2). This 
generalizes (7.8.3) to the case n > 1. 

3. Point 5 of Proposition 7.8.3 means that the Kashiwara index defines a 2-cocycle 
on .£(V, q@) with respect to the natural coboundary operator 


: Map(.2(V, @)?, R) > Map(“(V,@)*,R), 
given by 
Of (L1, Lo, L3, La) = f(L1, Lo, £3) — f (L2, £3, La) + f(L1, L3, £4) 
~ fi, Lo, L4), 


see [72, §1.4.2]. 
The Kashiwara index has a number of topological properties. 


Proposition 7.8.5 Let L; : [a,b] > @(V,@) be curves, i = 1, 2,3. 


1. If the dimensions of L,(t) Lot), Lo(t) N £3(t) and L3(t) NL; (t) are constant 
in t, so is the Kashiwara index s(L(t), L2(t), L3(t)). 
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2. Let L be a Lagrangian subspace transversal to both L,(t) and L(t) for every t. 
Then, the difference 


s(L (a), L2(a), L) — s(L1(b), L2(b), L) (7.8.8) 
does not depend on the choice of L. 


Proof 1. It suffices to prove that s(L1(t), Lo(t), £3(t)) is constant in a neigh- 
bourhood of fo for all a < to < b. Choose L € Y(V,q@) transversal to L; (to). 
There exists ¢ > 0 such that L remains transversal to L;(t) for |t — to] < e. As 
in the proof of Proposition 7.8.3/5, we use an orthonormal basis in L to represent 
Q(L1(t), Lo(t), L3(t)) for every such t by a matrix Qa), given by (7.8.4). Since 
Qa) depends continuously on ¢, if the rank is constant, the signature must be con- 
stant, too. Diagonalizing Qq), we obtain (7.8.5) and hence 


dimker Q = dim(L (¢) M Lo(t)) + dim(L2(t) N L3(t)) + dim(L1 (4) 1. L3(0)). 
Since rank Q(t) = 3n — dimker Qa), the assertion follows. 
2. Let L and L’ be two different Lagrangian subspaces with the required property. 
Then, the cocycle property implies 
s(Li(a), L2(a), L) — s(L1(@), Lo(a), L') = s(L2(a), L, L’) — s(Li(a), L, L’) 
and analogously 


s(L1(b), Lo(b), L) — s(L1(b), L2(b), L') = s(L2(b), L, L’) — s(L1@), L, L’). 


By point 1, s(L,(t), L, L’) and s(L2(t), L, L') are constant in t. This yields the 
assertion. 


Remark 7.8.6 


1. Point 1 of Proposition 7.8.5 implies the following. Let L; and Lz be Lagrangian 
subspaces and let t > L(t) be a curve of Lagrangian subspaces, such that for ev- 
ery t, the subspace L(t) is transversal both to L; and to Lz. Then, the Kashiwara 
index s(L1, L2, L(t)) is constant in f. 

2. One can show that, up to a symplectomorphism, every triple of Lagrangian sub- 
spaces is determined by the following five numbers [208]: 


dim(L;N L2), dim(Z1 9 £3), dim(L2 L3), 


dim(L1 N L2NL3), S(L1, Lo, L3). 


Now we can define the Maslov intersection index for an arbitrary pair of curves 
of Lagrangian subspaces. 
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Definition 7.8.7 (Maslov intersection index for pairs) For a given pair of curves 
L1, Lz: [a,b] > L(V, @), choose a = tg < t) <--- <t, =b such that for every 
j=1,...,r there exists a Lagrangian subspace L/ which is transversal to both 
L,(t) and L2(t) for all t € [t;-1, t;]. Then, the Maslov intersection index of L; and 
L2 is defined by 


r 


1 F . 
[L1: Lo] =5) (s(Li(¢j-1), Lo(j-1), L’) — s(Li (ty), Lo(tj), L’)). (7.8.9) 
j=l 


By Remark 7.2.7, a partition of the interval [a, b] having the desired properties 
exists. By Proposition 7.8.5/2, [L; : L2] neither depends on the chosen partition of 
[a,b] nor on the choice of the Lagrangian subspaces L/. Note that the curves Lj 
and L2 need not be transversal at the endpoints, cf. Remark 7.7.13. The intersection 
index of a curve, introduced in Definition 7.7.10, is a special case of the Maslov 
intersection index for pairs of curves. 


Proposition 7.8.8 Let L € @(V,q@) and let y be a curve in L(V, @) with end 
points in Lo(L). Then, [y : L] = Ind, (vy), where L stands for the constant curve 
th L. 


Proof Without loss of generality, we may assume that y has only simple transversal 
crossings with oP (L). Then, the partition of the domain of y in (7.8.9) can be cho- 
sen in such a way that LM y(¢;) = {0} for all 7 and every segment contains at most 
one crossing. By Lemma 7.8.5/1, segments without crossings do not contribute to 
[y : L]. If the j-th segment contains a crossing, we write the corresponding sum- 
mand in (7.8.9) in the form 


(s(v @j-1), L, L/) — s(v (ty), L, L’)) = —5(9(L, vj), L/) 


Nile 


—s(L, y(t;), L’)) 


and apply Formula (7.8.6). To determine the quadratic form qg, we choose an or- 
thonormal basis {e;} in LZ and an appropriate chart gx, with K = {m}. Denote 


A(t) := ox (y(t) = (goag')(y@), 


cf. (7.6.16). Hence, according to Remark 7.6.7/2, y(t) is spanned by the vectors 
ak(fi+ pee Ajjej),i=1,...,n, where fj = Je;. A straightforward calculation 
(Exercise 7.8.6) yields that g is given by 


q(v) = > Ajuiv! — Ammv™v™, (7.8.10) 
i,jAm 


where v = )0y_, Vax (fi + Vij=1 Aijej) € v7). Thus, if the crossing is positive, 
the signature of A(t) jumps by +2. Therefore, in this case, the j-th summand in 
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1 : ; 

5 (s(v @j-D, L, L!) — s(v(@), L, L/)) = 1. 

This yields the assertion. 


Remark 7.8.9 


1. The Hérmander index for a 4-tuple (L1, L2; M1, M2) of Lagrangian subspaces 
with the property that each L; is transversal to each M; is defined by 


1 
o(L1, La; My, M2) := 5 (s(L1, Ly, Mz) — s(L1, L2, M1), 
cf. [141]. By means of this index, the sum in (7.8.9) can be written in the form 


1 n 1 ; 
[L1: La] = 58(L1(a), Loa), L )- 59(L1(6), Lo(b), L ) 


r-l 
+> 0 o(LiGy), Loy); LY, L!*'). 
j=l 


The Hormander index has the following properties: 


o (Lj, L2; M,, M2) = —o(L2, L1; M), M2) = —o (Mj, M2; L), L2), (7.8.11) 


o(L1, Lo; M1, Mz) + o(L1, Lo; M2, M3) =o (11, Lo; Mi, M3). (7.8.12) 


2. There is a number of generalizations of the concept of Maslov index. In partic- 
ular, the Arnold-Leray-Maslov index is of importance. This index is defined on 
the universal covering manifold!! of the Lagrange-GraBmann manifold, see [72]. 
There also exists a generalization to the case of arbitrary n-tuples, see [289]. 


Exercises 

7.8.1 Determine the stabilizers and the orbits of the action of the symplectic group 
Sp(V, @) on L(V, @) x L(V, o). 

7.8.2 Prove Formulae (7.8.2), (7.8.6) and (7.8.7). 

7.8.3 Let (R*,—J) be the canonical symplectic vector space in two dimensions 
and let {e, f} be the canonical basis. Calculate [L; : L2] for L1(t) =Rf and 
Lo(t) = Rf +tfe) for alla <t <b. 

7.8.4 Calculate the sum [L; : £2] + [L2: £3] + [L3: L1] for arbitrary curves L; : 
[a,b] > L(V,o). 

7.8.5 Prove the following. If 2) = (£19 L2) + (£1 Z3), then s(L1, L2, £3) = 0. 

7.8.6 Prove Formula (7.8.10). 


11 This manifold is also called the universal Maslov bundle, see [72]. 


Chapter 8 
Symplectic Geometry 


Symplectic geometry plays a tremendous role both in pure mathematics and in 
physics. As we will see, it provides the natural mathematical language for the study 
of Hamiltonian systems. In this chapter, we present the basic notions of symplectic 
geometry. The starting point will be the Theorem of Darboux, which states that, lo- 
cally, all symplectic structures of a given dimension are equivalent to the standard 
symplectic vector space structure on R2” defined in the previous chapter. Thus, in 
sharp contrast to the situation in Riemannian geometry, there are no local invariants. 
Symplectic manifolds of the same dimension can at most differ globally.! Accord- 
ing to the above equivalence, many structures of local symplectic geometry have 
their origin in symplectic algebra. 

The second elementary, but very important observation is that on symplectic 
manifolds the symplectic form provides a duality between smooth functions and cer- 
tain vector fields, called Hamiltonian vector fields. As an immediate consequence of 
this duality, we obtain the notion of Poisson structure. Given the great importance of 
Poisson structures both in mathematics and in physics, we go beyond the symplectic 
case and give a brief introduction to general Poisson manifolds. 

There are two classes of symplectic manifolds which are especially important in 
physical applications: cotangent bundles and orbits of the coadjoint representation 
of a Lie group. We will see that the cotangent bundle serves as a mathematical 
model of phase space and coadjoint orbits are relevant in the study of systems with 
symmetries. Both classes of examples are discussed in detail. 

Moreover, in this chapter we discuss elementary properties of coisotropic sub- 
manifolds, present a number of natural generalizations of the Darboux Theorem and 
give an introduction to general symplectic reduction. The important special case of 
symplectic reduction for systems with symmetries will be dealt with in Chap. 10. 
The more advanced theory of Lagrangian submanifolds (including topological as- 


'There is a huge field of research called symplectic topology, which deals with the study of global 
invariants of symplectic manifolds. For a nice intuitive introduction to this field we refer the reader 
to an article of Arnold [22]. There is a number of detailed expositions of this subject, see e.g. [206]. 
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pects and the study of singularities) is contained in Chap. 12, where it finds one of 
its natural applications. 

Finally, the last section of this chapter is devoted to an elementary introduction 
to Morse theory. As we will see, the basic notions of this theory can be naturally 
formulated in the language of symplectic geometry. Moreover, concepts of Morse 
theory are of special importance in the study of symplectic manifolds and Hamilto- 
nian systems. This is related to the above-mentioned duality between Hamiltonian 
vector fields and functions. This duality yields a relation between the critical points 
of vector fields and the critical points of functions. Thus, qualitative dynamics of a 
Hamiltonian vector field can be investigated using methods of Morse theory. 


8.1 Basic Notions. The Darboux Theorem 


Let M be a manifold. The notions of linear symplectic algebra have the following 
counterparts on the level of M. Every differential 2-form w € 27(M) induces a 
vertical vector bundle morphism w?: TM —> T*M by 


(w?(X), Y):=@m(X,Y), X,Y €TnM. (8.1.1) 


The kernel and the rank of w are defined to be the kernel and the rank of w”. The 
2-form w is called non-degenerate if w is an isomorphism. This is equivalent to 
the requirement that w,, € A?(1*M ) be non-degenerate for all m € M. If w? is an 
isomorphism, the inverse mapping is denoted by w*. We often write X’ = w”(X) 
and a? = w*(a). 


Definition 8.1.1 (Symplectic manifold and symplectic mapping) 


1. A symplectic manifold is a pair (M, w) consisting of a manifold M and a closed 
non-degenerate 2-form @, called the symplectic form or the symplectic structure. 

2. A smooth mapping ® : M > N of symplectic manifolds (M, w) and (N, p) is 
called symplectic, or canonical, if ®*o = w. If @ is in addition a (local) diffeo- 
morphism, it is called a (local) symplectomorphism. 


If (M, w) is asymplectic manifold, then (T,, M, @m) is a symplectic vector space 
for all m € M. Therefore, a symplectic manifold must have even dimension. More- 
over, the tangent bundle TM is a symplectic vector bundle.” A smooth mapping 
@®: M — N is symplectic iff the linear mapping ®, : TnM — TounyN is sym- 
plectic for all m € M. This implies, in particular, that every symplectic mapping 
is an immersion, see Proposition 7.4.1/1. As in the case of symplectic mappings of 
symplectic vector spaces, the inverse of a symplectomorphism is symplectic. Hence, 
if one takes the symplectic mappings as the morphisms of symplectic manifolds, 


A symplectic vector bundle is a real vector bundle E endowed with a section w in N E* which 
is fibrewise symplectic. 
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the symplectomorphisms are the corresponding isomorphisms. Finally, we note that 
Definition 8.1.1 extends in an obvious way to manifolds with boundary, thus yield- 
ing the notions of symplectic manifold with boundary and symplectomorphism of 
symplectic manifolds with boundary. 


Example 8.1.2 


1. Every symplectic vector space (V,q@) is a symplectic manifold, where @ is 
viewed as a differential 2-form by means of the identification of the tangent 
spaces of V with V itself. This applies in particular to the canonical symplectic 
vector space (R2”, wo) discussed in Example 7.1.4. 

2. Let M =C", viewed as a (real) manifold with the coordinates x; given by 
Zk =Xe+IXnrg, K=1,...,n. The form 


i 7 
ea encs 


is symplectic. The coordinates x; define a symplectomorphism from (C”, @) onto 
(R2”, wo). 

. Every volume form on a 2-dimensional manifold is symplectic. 

4. The cotangent bundle T*Q of a manifold Q carries a natural symplectic struc- 
ture, see Sect. 8.3. This fact is one of the cornerstones of the theory of Hamil- 
tonian systems. In this context, Q plays the role of the configuration space and 
T* Q is the phase space of the system. 

5. Every orbit of the coadjoint representation of a Lie group carries a natural sym- 
plectic structure, see Sect. 8.4. This fact is of special importance for the theory 
of Hamiltonian systems with symmetries, discussed in Chap. 10. 

6. The product of two symplectic manifolds (M, wy) and (N, wy) is symplectic, 
where the symplectic form may be taken as the sum or the difference of the 
pullbacks of the symplectic forms of the factors under the natural projections, 


OW 


Ouxn = Ply Om + pry On- (8.1.2) 


Let (M, w) be a symplectic manifold of dimension 2n. By Proposition 7.1.7, the 
2n-form w” = w A---Aq@ isa volume form. In particular, symplectic manifolds are 
orientable. The form 


n(n—1) 


6,248 Zw" (8.1.3) 
n! 


is called the canonical volume form, or the Liouville form, of (M, w). 


Proposition 8.1.3. Every symplectic mapping between symplectic manifolds of the 
same dimension is a volume preserving local diffeomorphism. 


Proof Obviously, a symplectic mapping ® preserves the canonical volume. More- 
over, as already observed, Proposition 7.4.1 implies that &’ is fibrewise injective. 
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For dimensional reasons, it is fibrewise bijective then and the Inverse Mapping The- 
orem implies that @ is a local diffeomorphism. 


Remark 8.1.4 


1. By Proposition 8.1.3, the total volume of a symplectic manifold is a global sym- 
plectic invariant. 

2. Since dw = 0, in a neighbourhood U of every point there exists a local 1-form £ 
such that wy = df. Such a form is called a local symplectic potential. A sym- 
plectic manifold is said to be exact if there exists a global symplectic potential. 

Geometrically, the requirement dw = 0 means that the symplectic surface area 
of any 3-dimensional ball is equal to zero. This yields an obstruction for (even- 
dimensional) manifolds to admit a symplectic structure. In more detail, let us 
consider the case of an orientable compact manifold. Being closed, w defines an 
element [w] € H?(M, R), which must be nonzero, because its n-th power is the 
class of a volume form on an orientable compact manifold and hence nonzero. 
Thus, orientable compact manifolds fulfilling H?(M, IR) = 0 do not admit a sym- 
plectic structure. This applies, in particular, to any sphere S*” with n > 1. The 
study of the existence and uniqueness problem of symplectic structures for com- 
pact manifolds is a field of active research in symplectic topology, see Chaps. 7 
and 13 of [206]. In this context, a whole tool kit of operations for constructing 
symplectic manifolds has been developed. On the other hand, for noncompact 
manifolds (without boundary), Gromov has shown that a symplectic form exists 
iff the manifold admits a non-degenerate 2-form [206, Thm. 7.34]. Concerning 
the uniqueness aspect, it is worthwile to note that even on R2” there exist exotic 
symplectic structures, not symplectomorphic to the canonical one. 

3. The symplectomorphisms of (M, w) form a group with respect to composition, 
denoted by Symp(M, w). This group is at the heart of symplectic topology, see 
[206], Chap. 10. We will briefly discuss it in Sect. 8.8. 


The following theorem is fundamental for the theory of symplectic manifolds. 
It tells us that, locally, all symplectic manifolds of the same dimension are isomor- 
phic. This in accordance with the corresponding statement in symplectic algebra, 
see Corollary 7.1.5. 


Theorem 8.1.5 (Darboux) Let (M,w) be a symplectic manifold of dimension 2n. 
For every m € M, there exists a symplectomorphism from an open neighbourhood 
of m in M onto an open subset of the canonical symplectic vector space (R*", 0). 


Proof The proof is based on a deformation method developed by Moser [218]. Since 
the statement is local, without loss of generality we may assume that M = R?” and 
m = 0. According to Corollary 7.1.5, there exists a basis in R*” such that w(0) 
coincides with the canonical symplectic form wo. For t € [0, 1], let 


QO; =O +t- 0, 
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where @ := w— wo. Then, dw, = 0 and w,(0) = w(0) = wo is non-degenerate for all 
t €[0, 1]. Since the function [0, 1] x R2” 5 (t, x) > det@;(x) € R is continuous, a, 
remains non-degenerate on an open neighbourhood W of [0, 1] x {0} in R x R”. 
Since the interval [0, 1] is compact and R2” is locally compact, the Tube Lemma 
of elementary topology yields an open ball K about the origin of R” and an open 
interval J containing [0, 1] such that J x K C W. On the other hand, since dw = 0, 
the Poincaré Lemma yields a 1-form @ on K such that @ = da. By adding an exact 
1-form if necessary, we can achieve that a(0) = 0. Since @; is non-degenerate on K 
for all t € J, we can define a time-dependent vector field X on K by 


X10; = —aQ, tel. 


Let ® be the flow of X and let J C I x I x K be the domain’ of ®. Since a(0) = 0, 
we have X;(0) = 0 for all t € J, that is, 0 is a fixed point of ®. Hence, [0, 1] x 
[0, 1] x {0} C @ and by applying the Tube Lemma once again, we find an open 
neighbourhood U C K of 0 such that [0, 1] x [0,1] x U C J. Writing &; 9 = &, 
and using (4.1.28), as well as (4.1.30) with d replaced by ®*, we compute 


d d ~ 7 


Taken pointwise on U, this implies that fw = Of a; = Oj wo = wo. Thus, ® = 


dD, | is the desired local symplectomorphism. 


Remark 8.1.6 


1. By the Darboux Theorem and by Corollary 7.1.5, for every m € M, there exists 
a local chart (U, «) at m such that 


n 
Ory = Side! Adk"*, 
i=1 


Moreover, by Proposition 7.2.9, the complementary Lagrangian subspaces R” x 
{0} and {0} x R” of R?” define a symplectomorphism between the canonical 
symplectic vector space (R7”, wo) and the vector space R” @ (IR”)* endowed 
with the natural symplectic form (7.1.5). By composing « with this symplecto- 
morphism, we obtain a chart taking values in R” @ (R")*. Motivated by applica- 
tions in physics, we denote the coordinates corresponding to the factors R” and 
(R")* by q and p;, respectively. Then, 


n 


MTU — So dpi A dq' =dp; AN dq' 


i=1 


3See Sect. 3.4 for the notation. 
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(if there is no danger of confusion, we will use the summation convention). The 
elements of the corresponding local frame in TM will be denoted by 0,: and dp,. 
Both types of charts will be referred to as Darboux charts and the corresponding 
coordinates as Darboux coordinates. 

2. Let (Mj, @1) and (M2, w2) be symplectic manifolds of the same dimension and 
let ©: M> — M, be a smooth mapping. In Darboux coordinates g', pj on My 
and q', p; on Mp, the condition on @ to be symplectic reads 


®* (dp; A dq‘) = dp; A dq’. (8.1.4) 
Using the simplified notation 
(qi oPox')=q', (pioPox')= pi, 


where « denotes the chart defined by g', p;, a straightforward calculation (Ex- 
ercise 8.1.1) yields that ® is symplectic, and hence a local symplectomorphism, 
iff 

dpi 9q' Api Bq’ _ 9 _ Api Aq' Api Aq! — Avi Aq’ AP: AG" _ 

Op] IPk OPK AB; ag! ag* = agk agi’ apj ag = agk pj 


3. Let q' and p; be Darboux coordinates on U and let j; be functions on U such 
that g' and pj; are Darboux coordinates, too. Consider the functions fj := pj — Pi 
on U. Taking q’ and p; as coordinates on U, we find 


Of; 
0=df; Adq' = aa 


Of; ; 
——dgq! Adq' + ies A dq'. 
q! Opj 
fi OF; 
dq) agi 
of equations is given by fj = aj og with smooth functions a; on g(U) Cc R” 
which combine to a closed 1-form aw = ajdx! on q(U), where x' denote the 
standard coordinates on R”. 


Hence, ie = 0 and 0 for alli, 7. The general solution of this system 
J 


The types of subspaces of a symplectic vector space given in Definition 7.2.2 
carry over to submanifolds of a symplectic manifold: 


Definition 8.1.7 Let (M, w) be a symplectic manifold. An immersion g: N > M 
is called isotropic (coisotropic, Lagrangian or symplectic) at p € N iff g’T,N is an 
isotropic (coisotropic, Lagrangian or symplectic) subspace of (Ty(p)M, @g(p)). If 
this is true for all points p € N, the pair (N, ¢@) is called an isotropic (coisotropic, 
Lagrangian or symplectic) immersion. 


These notions apply, in particular, to the case when (N, g) is a submanifold of M. 
Since the pull-back g*@ of the symplectic form to the submanifold (N, g) is the 
counterpart of the restriction ww to a subspace W of the symplectic vector space 
(V, @), Proposition 7.2.3 yields: the submanifold (NV, @) is 
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(a) isotropic iff y*w = 0, 
(b) Lagrange iff y*w@ = 0 and 2dim N = dim M, 
(c) symplectic iff g*w is non-degenerate. 


According to Remark 1.6.2/1, we may assume that N is given by a subset of M 
and that ¢ is the natural inclusion mapping. In this case, from (7.2.2) and (7.2.5) we 
obtain 


ker(g*w) , =TpN A (TpN)®, (8.1.5) 


max(0, 2dim N — dim M) < rank(p*a) , <dimJN, (8.1.6) 


and the statements of Proposition 7.2.4 carry over pointwise. In particular, a sub- 
manifold of dimension | (codimension 1) is always isotropic (coisotropic) and every 
submanifold contained in an isotropic submanifold is isotropic. A submanifold con- 
taining a coisotropic submanifold need not be coisotropic though, because Proposi- 
tion 7.2.4/4 does not apply to points outside the latter. 

Finally, consider a vector subbundle E C TM over a submanifold N Cc M. 
The symplectic orthogonal of E is defined by E® := UJ peN BS. This is a sub- 
bundle of TM over the submanifold N, because according to Proposition 7.2.1/1, 
it is mapped to the annihilator E° under the vertical vector bundle isomorphism 
w : TM — T*M. The vector subbundle E is called isotropic (coisotropic, La- 
grangian or symplectic), if every fibre is isotropic (coisotropic, Lagrangian or sym- 
plectic), that is, if, respectively, EC E®, E? CE, EY =E or EN E® =s0(N), 
where so denotes the zero section of TM. Thus, a submanifold N C M is isotropic, 
coisotropic, Lagrangian or symplectic iff so is the vector subbundle TN Cc TM. In 
this case, the symplectic orthogonal (TN)® of TN in TM is referred to as the sym- 
plectic normal bundle of the submanifold NV. 


Lemma 8.1.8 Let (M,w) be a symplectic manifold. Every Lagrangian subbundle 
E CTM over a submanifold N C M admits a Lagrangian complement, that is, a 
Lagrangian subbundle E C TM over N such that E® E=TMyy. 


Proof The proof follows [305], Lecture 2. Choose an auxiliary Riemannian metric g 
on M. For every m € M, the mapping F',, : S34 (T*,M) > J (TmM, wm) of Propo- 
sition 7.5.6 assigns to gj) an @,-compatible complex structure J, on T,,M. Since 
the endomorphism associated with w,, via gm is bijective, its absolute square with 
respect to g, is strictly positive. Therefore, the square root and hence J, depend 
smoothly on m. As a consequence, the J,,, combine to a vertical vector bundle mor- 
phism J of TM satisfying J ? — —idyy.* It follows that J(E) is a vector subbundle 


of TM over N. By Proposition 7.5.3/2, it is Lagrange and complementary to E. 


4A vertical vector bundle morphism of TM with this property is called an almost complex structure 
on M. 
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Exercises 

8.1.1 Prove point 2 of Remark 8.1.6. 

8.1.2 Do T?, S? x S! or CP? admit a symplectic structure? 

8.1.3 Show that on a symplectic manifold (M,q@) of dimension greater than 2, 
a function multiple f@ of w is symplectic iff f is nonzero and locally con- 
stant. 


8.2 Hamiltonian Vector Fields and Poisson Structures 


Via the exterior derivative and the natural isomorphism w* = (w’)-!:T*M > TM, 
one can assign a vector field to every smooth function on M. This assignment is 
basic for the theory of Hamiltonian systems, see Chap. 9. 


Definition 8.2.1 (Hamiltonian vector field) Let (M,q@) be a symplectic manifold 
and let f € C°(M). The vector field 


Xp :=—(df)* 
is called the Hamiltonian vector field generated by /f. 


The flow of X ¢ is referred to as the Hamiltonian flow generated by f. The defi- 
nition of X ¢ is equivalent to 
Xfio=—df. (8.2.1) 
In particular, the 1-form (X fy is exact. 
Remark 8.2.2 Let us write down X ¢ in local Darboux coordinates qi and p;. For 
that purpose, we decompose X ¢ = Aj0p;, + B'd,i with respect to the corresponding 


local frame (Ogi; dp;) and determine the coefficients A;, B’ from (8.2.1), which in 
coordinates reads 


(Aj dp, + B',1) s(dpj A dq!) = Ajdq' — B'dp; = —0,i fdq' — dp, fdp;. 
Thus, 
Xp = (8p; f)dgi — (gi fp;- (8.2.2) 


Besides the Hamiltonian vector fields, the infinitesimal symmetries of the sym- 
plectic structure are important: 


Definition 8.2.3 (Symplectic vector field) A vector field X on (M, w) is called sym- 
plectic if is invariant under the flow of X. 


Proposition 8.2.4 Let (M, w) be a symplectic manifold and let X be a vector field 
on M. The following statements are equivalent: 
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. X is symplectic. 

. LYw = 0. 

. X? is closed. 

. X is locally Hamiltonian, that is, for every m € M there exists a function f ona 
neighbourhood U of m such that X}y = Xf. 


BRWN re 


Proof The equivalence of points 2 and 3 follows from 
Lxyw = X sda + d(X1@) = dX?. 


To see that point 2 implies point 1, we apply (4.1.28) to obtain 


d 

© (®%0),, = (85(Lx0), 

for all s between 0 and ¢ and all m in the domain of @;. Due to D5 = id, integration 
from 0 to t yields the assertion. The remaining statements are obvious. 


Remark 8.2.5 


1. According to (8.2.1) and Proposition 8.2.4/3, every Hamiltonian vector field is 
symplectic. 

2. Definition 8.2.3 carries over to time-dependent vector fields by requiring that 
®;, © =o for all pairs (1), f2). Then, Proposition 8.2.4 holds with X replaced 
by X;. 


We denote the set of Hamiltonian vector fields on (M, w) by Xq(M, w) and the 
set of symplectic vector fields by X,y(M, w).° 


Proposition 8.2.6 Let (M, w) be a symplectic manifold. 


1. X~H(M, @) is a Lie subalgebra of X(M). 
2. For Y,Z € Xiy(M, w), we have 


LY, Z] = Xavy,z)- (8.2.3) 
In particular, Xy(M, @) is an ideal in XL y(M, o). 


Proof 1. According to Proposition 3.3.3, we have Yy yjw = (Lx, Ly]. 
2. Using Yyw = 0 and d(Z iw) = 0, we find 


[Y, Z]sw = Ly (Zw) — Zs Lyw = Y sd(Zsw) + d(¥ s(Z1)) = —d((Y, Z)). 


5 With the index LH standing for locally Hamiltonian. 
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Remark 8.2.7 Viewing w as a mapping from X(M) to 2!(M), we obtain vector 
space isomorphisms 


Xy(M,o)=B'(M), — X_H(M,@) = Z'(M) 
and hence 
H'(M) = Xpy(M, o)/Xy(M, @). 
Thus, Xq(M, w) = XLH(M, o) iff the first de Rham cohomology of M is trivial. 


Example 8.2.8 Consider the torus T? with the angle coordinates x and y and sym- 
plectic form w = dx A dy. Let 


X = ad, + bdy 


be a vector field with constant coefficients a,b 4 0. Since X° = ady — bdx, we 
have dX? = 0 and, therefore, X € XLy (T?, @). However, as a nowhere vanishing 
vector field on a compact manifold, X cannot be Hamiltonian, because every smooth 
function on M has a maximum and a minimum and hence critical points. 


Symplectic mappings can be characterized in terms of Hamiltonian vector fields: 


Proposition 8.2.9 A smooth mapping ® : M — N of symplectic manifolds (M, w) 
and (N, p) is symplectic iff it is an immersion and the Hamiltonian vector fields 
Xo» on M and Xf on N are ®-related for all f € C®(N). In particular, a dif- 
feomorphism ® : M — N is a symplectomorphism iff 


@,Xn = Xjo0-1 (8.2.4) 


forallheC®(M). 


Proof Since every symplectic mapping is an immersion, we have to show that for 
an immersion ®, * p = w is equivalent to the condition that 


®'o Xoxf=Xfo@ 


for all f € C°(N). By taking the interior product of this equation with p and eval- 
uating both sides on ®’(Y), where Y € T,, M, we find 


(O*p),, Xr p,¥) = (X pp) om (PY). 
Hence, we have to show that * = w is equivalent to 
Xp» ¢s(®*p) = —d(* f) (8.2.5) 


for all f € C*(N). If &* p =a, then (8.2.5) holds by definition of X@*¢. Con- 
versely, if (8.2.5) holds, then Xg* ¢i(®*p) = X@* sw for all f € C*(N). Since 
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@ is an immersion, the mapping (p/ yt a an > T*,M is surjective for every 
m € M, hence every covector at m is of the form d(@* f) for some f € C®(N). 
Accordingly, every tangent vector at m is of the form Xg«¢ for some f € C*(N). 
Hence, (8.2.5) implies ®*o = w. Finally, if ® is a diffeomorphism, by setting 


h= f o@7' in the defining equation of ®-relation we obtain (8.2.4). 


Since for a symplectic vector field X with flow ®, the mappings 
(D,)i,, : TnM > To,(my)M 
are linear symplectomorphisms, the Symplectic Eigenvalue Theorem 7.4.3 yields 


Proposition 8.2.10 Let (M, w) be a symplectic manifold and let X € XLy(M). 


1. If m is an equilibrium of X and if is a characteristic exponent of m with 
multiplicity k, then jt and — are also characteristic exponents of m with multi- 
plicity k. The multiplicity of the characteristic exponent 0 is even. 

2. If y is a periodic integral curve of X and if x is a characteristic multiplier of 
y with multiplicity k, then 2 and 27! are also characteristic multipliers of y 
with multiplicity k. The multiplicity of the characteristic multiplier | is odd and 
nonzero. 


Proof 1. The characteristic exponents of m are given by the eigenvalues of the Hes- 
sian endomorphism Hess,, (X) = 77 to (P;)),,- Since Hess,, (X) is an element of the 
symplectic Lie algebra sp(T;,M, @m), the assertion follows from point 2 of the 
Symplectic Eigenvalue Theorem 7.4.3 and from Corollary 7.4.4/2. 

2. Let T be the period of y and let m € y. By Remark 3.6.11/1, the character- 
istic multipliers of y are given by the eigenvalues of (®7)/,,, with the eigenvalue 
1 corresponding to the eigenspace T,, y omitted. Since (@7)j,, is an element of the 
symplectic group Sp(T,,M, @m), the assertion follows from point | of the Symplec- 
tic Eigenvalue Theorem 7.4.3 and from Corollary 7.4.4/1. 


Remark 8.2.11 


1. The characteristic multipliers of y, with the multiplicity of the characteristic 
multiplier 1 reduced by 1, are called the Floquet multipliers of y. It will be 
shown in Sect. 9.5 that the Floquet multipliers coincide with the eigenvalues of 
the tangent mapping of an isoenergetic Poincaré mapping, see Remark 9.5.2/2. 

2. A subset of the set of characteristic exponents of an equilibrium m (of the set 
of Floquet multipliers of a periodic integral curve y), counted with multiplici- 
ties, is called a basis set for m (for y) if it contains one member of each of the 
pairs (j2, /£) (the pairs (A, n)). By construction, a basis set has n elements (n — 1 
elements), where 2n = dim M. 


Next, let us study the surjective mapping 


C°(M) 3 fie Xp € Xy(M, a). 
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Since Xyq(M,q@) carries a natural Lie algebra structure, it is reasonable to ask 
whether there is a Lie algebra structure on C°(M) which makes this mapping into 
a homomorphism. Such a structure exists, indeed: 


Definition 8.2.12 (Poisson bracket) Let (M,w) be a symplectic manifold and let 
f,heC® (mM). The function 


{f, h} = @(X¢, Xn) 


is called the Poisson bracket of f and h. 


Using w(X ¢, Xg) = —X ¢a(Xgiw) = X fi(dg) = X ¢(g), the Poisson bracket 
can be rewritten as 


{f, gs} =X ¢(g) =—Xe(f). (8.2.6) 


Proposition 8.2.13 Let (M, w) be a symplectic manifold. 


1. The Poisson bracket defines on C®(M) the structure of a Lie algebra. With re- 
spect to this structure, the mapping f +> X ¢ is a Lie algebra homomorphism. 
2. The Poisson bracket satisfies the Leibniz rule, that is, for all f, g,héC~°(M), 


{fsht=thsih+ thas. 


Proof 1. Obviously, the Poisson bracket is bilinear and antisymmetric and the as- 
signment f +> Xf is linear. Thus, it is enough to show that the Poisson bracket 
fulfils the Jacobi identity and that [X ¢, X,] = X{f,2} holds for all f, g ¢ C°(M). 
The latter follows from (8.2.3). Using this and (8.2.6), we verify the Jacobi identity: 


{f.{g, a}} + {gh fh} + {h, {Ff 3h} 
=X ¢(Xg(h)) — Xg(X¢) — Xip.9)() =0. 


2. This follows from the fact that vector fields are derivations of C°(M). 


Remark 8.2.14 


1. Due to Proposition 3.2.15 and the fact that the mapping f +> Xf is a Lie alge- 
bra homomorphism, two functions on M Poisson-commute iff the flows of their 
Hamiltonian vector fields commute. 

2. Let us write down the Poisson bracket in a Darboux chart « with coordinates q! 
and p;. Using (8.2.6) and (8.2.2), we find 


(f, 8} = X ¢(8) = (9p; A) (Ogi 8) — (Ogi f) (9p; 8)- 


Using the simplified notation f = f o«~!, we arrive at the standard formula 


af dg Of dg 


a eae (8.2.7) 
dpi Og dq’ Opi 


{fgh= 
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In particular, the Poisson brackets of the coordinate functions are 
{q',.g' }=0, {pi. pj} =9, lpg?) = 37. 


Symplectic mappings can be characterized in terms of the Poisson bracket as 
well: 


Proposition 8.2.15 A smooth mapping ® : M — N of symplectic manifolds (M, w) 
and (N, p) is symplectic iff it is an immersion and for all f, g € C~(N) one has 


{o* f, O*g} = O*{ f, g}. (8.2.8) 
Proof Due to 
{P* f, B*g}(m) = (P'(Xoxp)m)(g), — (®*{F, gh) n) = (Xp) oom), 
the condition that {®* f, *g} = &*{ f, g} forall f, g ¢ C°(N) is equivalent to the 


condition that the vector fields Xg«r and X ¢ are @-related for all f ¢C %(N), 
Hence, the assertion follows from Proposition 8.2.9. 


In the remaining part of this section we digress from symplectic manifolds to 
give a brief introduction to the more general notion of Poisson structure. For an ex- 
haustive treatment of Poisson manifolds, we refer to the books of Vaisman [295] and 
Waldmann [301], where the reader can find a lot of further references. Pioneering 
work on this subject goes back to Sophus Lie [182-184]. 


Definition 8.2.16 (Poisson manifold) 


1. A Poisson structure on a manifold M is a Lie algebra structure {, } on C°(M) 
fulfilling the Leibniz rule {f, gh} = g{ f,h} + {f, g}A for all f,g,h EeC™(M). 
The pair (M, {, }) is called a Poisson manifold. 

2. A smooth mapping ®: M — N of Poisson manifolds is called Poisson, or a 
Poisson morphism, if &*{ f, g} = {* f, &* g} forall f,g eC™°(N). 


According to Propositions 8.2.13 and 8.2.15, every symplectic manifold is Pois- 
son and a smooth mapping of symplectic manifolds is symplectic iff it is Poisson 
and an immersion. The Leibniz rule implies that for every f € C°(M), the map- 
ping {f, -}: C°°(M) > C™(M) is a derivation and hence defines a vector field X y, 
called the Hamiltonian vector field of f. Thus, 


X (sg) ={f, 8} =—-Xe(f) (8.2.9) 
for all f, g € C~(M). The Jacobi identity implies 


[X¢, Xel= Xi pg), (8.2.10) 
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hence the mapping f +> X ¢ is a Lie algebra homomorphism. Moreover, from 


{f, g}(m) = (Xp)ms (dg) m) = —((Xe)m; (df) m) 


we read off that the value of {f, g} at m depends on (df), and (dg), only and 
that this dependence is linear. Since every element of T7,M can be written as 
df(m) for some f € C~(M), it follows that there exists a unique bivector field 
ITeé ese T M) such that 


{f, gt =f, dg) (8.2.11) 


for all f, g € C~(M). IT is called the Poisson tensor of (M, {, }). The Jacobi identity 
implies 


Le 1 =0 (8.2.12) 


for all f € C°(M). The Poisson tensor J7 defines a vertical vector bundle mor- 
phism 


TT": T*M > TM, (B, 117 (a)) := IT (a, B), (8.2.13) 
fulfilling 
TT odf =X; (8.2.14) 


for all f € C°(M). The rank of (M, {, }) at m is defined to be the rank of ae. Since 
TT is antisymmetric, this is an even number. For a symplectic manifold, 77 = w” and 
TT is the image of @ under the isomorphism w* A w!: 22(M) > re TM). 

Poisson morphisms can be characterized in terms of the Poisson tensor as fol- 
lows. From (8.2.11) we read off that a diffeomorphism @: M — M is Poisson iff 
@,11 = IT. More generally, for a smooth mapping ®: M — N of Poisson mani- 
folds one finds that it is Poisson iff the corresponding Poisson tensors are ®-related® 
(Exercise 8.2.1). 


Remark 8.2.17 


1. Ina local chart (U,«) on M we have 
1. i 
T= Bit; A Oj, {f, g} = 11 0; fajg. (8.2.15) 


In particular, 


Ti ={xie},  (fpgh= fli la; faje. 


The notion of ®-relation extends in an obvious way from vector fields to multivector fields, cf. 
Definition 2.3.6/1. 
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2. 


By Formula (8.2.11), every bivector field JT on M defines a bilinear antisymmet- 
ric mapping {,}: C°(M) x C®(M) > C®(M) which fulfils the Leibniz rule. 
It satisfies the Jacobi identity, and hence yields a Poisson structure, iff 


IT(df, d(T (dg, dh))) + IT(dg, d(I7(dh, df))) + (dh, d(I7(df, dg))) =0 
(8.2.16) 
for all f, g,h € C~(M). In local coordinates, this condition reads 


TT", T17* + 174" 0,11 + 1a, 174 =0 


(Exercise 8.2.2). Thus, a Poisson manifold may as well be viewed as a pair 
(M, IT), where /T is a bivector field fulfilling (8.2.16). 


Example 8.2.18 


ile 


2. 


Every manifold M can be equipped with the trivial Poisson structure { f, g} = 0 
for all f, g € C~(M). In this case, 7 = 0. 

Let V be areal vector space. Every element IT € A7(V) defines a constant bivec- 
tor field on V. This bivector field satisfies (8.2.16) and thus defines a Poisson 
structure (Exercise 8.2.3). 


. Consider the dual space V* of areal vector space V. Recall that under the natural 


identification of the tangent spaces of V* with V* itself, every bivector field [7 
on V* corresponds to a smooth mapping V* > Ke V*. JT is called linear if this 
mapping is linear. In this case, for every v, w € V, the function € +> ITg(v, w) 
is linear and hence corresponds to an element of V. By assigning this element to 
the pair (v, w) we obtain an antisymmetric bilinear mapping [,]: V x V > V. 
Thus, [v, w] is defined by 


(€, [v, w]) = Te(v, w) (8.2.17) 


for all € € V*. Conversely, if an antisymmetric bilinear mapping [, ]: V x V > 
V is given, (8.2.17) defines a linear bivector field JT on V*. Let us analyze the 
condition (8.2.16) on JT in terms of [, ]. First, we observe that (8.2.16) holds 
iff it holds for all linear functions f, g,4 on V*. For linear functions, we have 
df = f, and the linearity of 17 implies dU7(df, dg)) = T(f, g), where f and g 
are viewed as elements of V. It follows that /T satisfies (8.2.16) iff [, ] satisfies the 
Jacobi identity. In this case, the corresponding Poisson bracket of f, g € C~(V*) 
is given by 


{f, shu) =(u, [df (u), dg(u)]), (8.2.18) 


where df () and dg(z) are interpreted as elements of (V*)* = V. 

Thus, the Poisson structures on V* whose Poisson tensor is linear are in bi- 
jective correspondence with Lie algebra structures on V. In particular, (8.2.17) 
defines a natural Poisson structure on the dual space g* of a Lie algebra g. This 
structure is usually referred to as the Lie-Poisson structure on g*. In the coordi- 
nates &; defined by a basis in g*, the Poisson tensor and the Poisson bracket of 
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this structure are given by 


1 0 0 of dg 
TT = —&, onan eae F = ko nY ar A 8.2.19 
(§) = 5 8kCij ag DE, (f, 8G) = Eke; 36 Gag. (),  ( ) 
where ee are the structure constants of g with respect to the dual basis in g. 


4. We know that every volume form w on a two-dimensional manifold M is sym- 
plectic. Let IT, be the corresponding Poisson tensor. Then, every bivector field 
is of the form fT, for some f € C~(M) and f JT, defines a Poisson structure 
(Exercise 8.2.4). In contrast, f@ is symplectic only if f is nowhere vanishing. 


Definition 8.2.19 (Poisson vector field) A vector field X on a Poisson manifold is 
called Poisson iff the Poisson structure is invariant under the flow of X. 


Equivalently, X is Poisson iff “x JT = 0. This follows by the same argument as 
the corresponding assertion about symplectic vector fields in Proposition 8.2.4. As a 
consequence, (8.2.12) implies that every Hamiltonian vector field is Poisson. In the 
special case of a symplectic manifold, a vector field is Poisson iff it is symplectic. 
In contrast to Proposition 8.2.4/4, however, on a Poisson manifold a Poisson vector 
field need not be locally Hamiltonian. For example, for the trivial Poisson structure, 
every vector field is Poisson, but only the zero vector field is Hamiltonian. 

We conclude this section by showing that every Poisson manifold is foliated by 
symplectic manifolds. For that purpose, consider the characteristic distribution 


D" := I1*(T*M) CTM 


of (M, {,}). By (8.2.14), this distribution is spanned by the Hamiltonian vector 
fields. For every m € M, the Poisson tensor [7 induces an antisymmetric 2-form 
@m On pt by 


On(X, Y) := In (@, B), (8.2.20) 


where a, 8 € T*M such that X = TT*(a) and Y = IT*(B). The reader easily con- 
vinces himself that wm is well-defined and non-degenerate (Exercise 8.2.5). Thus, at 
each point m we obtain a symplectic vector space (DE ,@m). 


Theorem 8.2.20 (Symplectic Foliation Theorem) The characteristic distribution of 
a Poisson manifold (M, IT) is integrable. On every integral manifold N, there exists 
a unique symplectic form wy such that the natural inclusion mapping is Poisson. 
This form is given by 


(On)m(Xf,Xe)={f.gh(m), meN, f,geC™(M). (8.2.21) 


The resulting foliation by maximal integral manifolds is called the symplectic 
foliation of (M, {, }) and the maximal integral manifolds are called the symplectic 
leaves of (M, {, }). By construction, the Hamiltonian vector fields are tangent to the 
symplectic leaves. 
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Proof According to (8.2.10), the set of Hamiltonian vector fields is involutive. Since 
every Hamiltonian vector field X ¢ is Poisson, its flow preserves JT. It follows that 
the rank of I7 and hence the dimension of D” is constant along the integral curves 
of X ¢. Thus, Theorem 3.5.10 implies integrability. Let N be an integral manifold. 
For every m € N, the form (@y)m coincides with the form defined in (8.2.20). 
Hence, it is well-defined and non-degenerate. It remains to prove that w is closed. 
Since X ¢ is tangent to N, it induces a vector field x f on N. Using Proposition 4.1.6, 
together with Proposition 3.1.5/1 and Corollary 3.1.6, and the Jacobi identity for 
{, }, one finds dwy (X ¢, Xz, X,) =0 for all f, g,h € C©(M) (Exercise 8.2.6). This 
yields the assertion. 


Example 8.2.21 


1. For the trivial Poisson structure { f, g} = 0 on M, the symplectic foliation is given 
by the points of M. 

2. In Sect. 8.4 it will be shown that the symplectic leaves N of the Poisson structure 
on the dual space g* of a Lie algebra g, discussed in Example 8.2.18/3, coincide 
with the connected components of the coadjoint orbits and that the symplectic 
form wy is given by the Kirillov form. 


Theorem 8.2.20 can be used to prove the following generalization of the Darboux 
Theorem to Poisson manifolds. 


Theorem 8.2.22 (Splitting Theorem) Let (M, IT) be a Poisson manifold of dimen- 
sion k, let m € M and assume that IT has rank 2r at m. Then, there exists a local 
chart (U, K) at m with coordinates yi. ees Gy Ply ees Pro zis... ck-2" such that 


Thy = Ogi A Op, + 5u (z)0,1 A 0,;, x4 (0) =0. (8.2.22) 


In these coordinates, the symplectic leaf through m is described by the equations 


glaz?=--- =z?" =0 and q', p; provide Darboux coordinates on this leaf. 


Proof See [181], Theorem II.11.5, or [310]. 


Remark 8.2.23 


1. The Splitting Theorem contains the following local decomposition property: let 
(M, IT) be a Poisson manifold of dimension k and rank 2r at the point m. Then, 
there exists a neighbourhood U of m which can be identified with the Cartesian 
product of a symplectic manifold V of dimension 2r and a Poisson manifold W 
of dimension k — 2r whose rank vanishes at m. It was shown by Weinstein [310] 
that up to an isomorphism the Poisson structure on W does not depend on the 
chosen local coordinates. 

2. A smooth function f on M is called a Casimir function for {, } if X = 0. The 
set of Casimir functions is called the Poisson centre of (M, {, }). Indeed, this is 
the centre of the Lie algebra (C™(M), {, }). One can show that in the case of 
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a symplectic manifold, every Casimir function is constant (Exercise 8.2.7) and 
that, in the general case, the restriction of a Casimir function to a symplectic leaf 
is constant. 


Exercises 

8.2.1 Show that a smooth mapping ®: M — N is Poisson iff the Poisson tensors 
of M and N are @-related. 

8.2.2 Prove the statements of Remark 8.2.17/2. 

8.2.3 Prove that every constant bivector field on a real vector space defines a Pois- 
son structure. 

8.2.4 Prove the statement of Example 8.2.18/4. 

8.2.5 Show that the 2-form defined by (8.2.20) is well-defined and non-degenerate. 

8.2.6 Complete the proof of Theorem 8.2.20 by showing that the 2-form wy is 
closed. 

8.2.7 Prove that on a symplectic manifold, every Casimir function is constant. 


8.3 The Cotangent Bundle 


The cotangent bundle T* Q of a manifold Q provides the basic model of a symplec- 
tic manifold. It has the following special properties. 


(a) It carries a canonical symplectic potential. 

(b) Every diffeomorphism of Q lifts to a symplectomorphism of T* Q. 

(c) It contains a variety of Lagrangian submanifolds related to the bundle structure. 

(d) Since its fibres carry a natural affine structure, every 1-form on Q induces a 
vertical vector field on T* Q. 


In what follows, let 2: T*Q — Q be the canonical projection. As already men- 
tioned, in Hamiltonian mechanics, T* Q is the phase space of a system with configu- 
ration space Q. Consequently, for the local description, we use the standard notation 
taken from physics: the coordinates of a local chart (U,«) on Q are denoted by q’. 
For the local frame {0*} in TQ we write {0} and for the local coframe {d«‘} in 
T* Q we write {dq‘}. The chart « induces a local chart «rt» on T*U by 


xt (&) = (q' (a (&)),-..,9"(7&)), pi), ---. Pn), 


where the coordinate functions p; are defined as the components of € € T*U with 
respect to the coframe {dg’}: 


E = pi(é)dq'. 


Points of ««(T*U) will be written as pairs (q, p). 
First, we show that T* Q carries a natural exact symplectic structure. 
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Definition 8.3.1 (Canonical 1-form) The canonical 1-form 9, or Liouville form, on 
T* Q is defined by 


(0g, X) :=(E, 14 (X)), 
where € € T*Q and X €T;(T*Q). 


We show that the 2-form w := dé is a symplectic form on T*Q. Obviously, 
dw = 0. To show that w is non-degenerate, we calculate 0 and w in bundle coor- 
dinates g' and p;. The ansatz 6 = a;dq' + B'dp; and X = A‘d,: + B;dp, yields the 
defining equation 


aj A! + B'B; = pjA' 
for all A’, B; € R, which is solved by a; = p; and Bi = 0. Thus, we obtain 
6=pidg' and w=dp; dq’. (8.3.1) 
In particular, w is non-degenerate, indeed. 


Definition 8.3.2 The form w = dé is called the canonical symplectic form of T* Q. 


According to (8.3.1), for every chart on Q, the induced chart on T*Q is a Dar- 
boux chart for the canonical symplectic structure. 


Remark 8.3.3 
1. The canonical 1-form @ is the unique 1-form on T* Q with the property that 
a“od=a (8.3.2) 
for all 1-forms w on Q (Exercise 8.3.1). As a consequence, 
a*w= da. (8.3.3) 


2. Since the covectors dg’ form a basis in T* Q and the vectors d,i form a basis in 
T, Q, the assignment 0, +> ji, Ip, dq’ defines a vector space isomorphism 
Tz(T*Q) > T, O ®TY Q. Since w” maps d,i to —dp;, this isomorphism is sym- 
plectic with respect to the canonical symplectic structure (7.1.5) on T,; O@® TF Q. 

3. The isomorphism w* assigns to @ a vector field 0° on T* Q, called the canonical 
vertical vector field or Liouville vector field. By definition, 


Oa =0. (8.3.4) 


In bundle coordinates, 6% reads 64 = Pi9p;, Which shows that it is vertical, in- 
deed. Thus, 6°.@ = 0, so that (4.1.24) implies 


Lo:0 =0, LyiW=w. (8.3.5) 
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Example 8.3.4 The special case of the cotangent bundle T*G of a Lie group G is 
of particular interest. According to Proposition 5.1.6, both TG and T*G are trivial. 
We use the (inverse) trivialization 


xL:Gxg oT*G, vi asp) i= po ya), (8.3.6) 


and write tangent vectors of G x g* at (a, w) in the form (L/,A, p) with A € g and 
p€g*. Then, 


(X09) (ap) (Las P) = Pn ca.u) (XL (LaA: p)) = (Ho Lae) (7! © xL(Li A, #))- 
Since z o xz is the projection to the first component, we obtain 
(X09) ay) (La: p) = (ul, A). (8.3.7) 


Then, using Proposition 4.1.6 with left-invariant vector fields A, B on G and con- 
stant vector fields p, 0 on g*, we calculate (Exercise 8.3.2) 


oa,u)((L,4, p); (LB, o)) = (p, B) — (a, A) — (uw, [A, B]}. (8.3.8) 


Next, we show that the canonical 1-form is invariant under the lift of diffeo- 
morphisms from Q to T*Q. This yields a special class of canonical transforma- 
tions, called point transformations. Recall from Sect. 2.4 that every diffeomorphism 
gy: OQ > Q induces a vector bundle isomorphism gy’? : T*Q > T*Q projecting to 
gy '. Let ® be the inverse of this isomorphism. ® is determined by the conditions 
mzo®=gon and 


(®(&), y'X) =(E, X) (8.3.9) 


for all € ¢ T¥Q, X € T, Q and x € Q. On the level of 1-forms a on Q, (8.3.9) can 
be rewritten as 


D0 (y*a)=ao09. (8.3.10) 


Definition 8.3.5 (Point transformation) The vector bundle isomorphism © = 
(y’')—! is called the point transformation induced by 9. 


It turns out that point transformations can be characterized by the property that 
they leave the canonical 1-form @ invariant: 


Proposition 8.3.6 A fibre-preserving diffeomorphism ®: T*Q — T*Q is a point 
transformation iff 6*0 = 6. In particular, every point transformation is a symplec- 
tomorphism. 


Proof Since ® preserves the fibres, there is a unique diffeomorphism g: Q > Q 
such that 7 o ® = yom. Foré € T*Q and X € Tz (I* Q) we compute 


(*6),(X) =(©), (' 0 8')(X)) 
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= (G(E), (g’ om’) (X)) 
=(p'"(®()), 2(X)). 


Since 6¢(X) = (&, 2/(X)) and since ’ is surjective, this implies that ®*6 = @ iff 
glo@= idy+g, that is, iff @ is the point transformation induced by g. 


Remark 8.3.7 Let g be a diffeomorphism of a manifold Q and let @ be the induced 
point transformation. We compute the local representative of ® with respect to local 
charts (U,«) and (V, ¢) on Q with induced bundle coordinates q'; pi and q's Dis 
respectively. The diffeomorphism ¢ is locally given by the functions 
qi ogo er. 

Moreover, for € € T* Q, the natural fibre coordinates p;(&) and Di (®(E)) are deter- 
mined by the equations € = p;(&)dq' and ®(€) = p;(®(é))dq', respectively. Using 
(8.3.9), we obtain 


a(q' og !op7') - 
®(E) = pilE) aq] ((o. 0 y)(x(E)))dg/ 
Using the simplified notation 
qgopow'=q, qog lop '=q', 


we obtain the following local formula for ®: 


agi 
(q, p) (a. Ping “(a)), 


This formula can be interpreted both actively (a point transformation induced by 
a diffeomorphism) and passively (the change of bundle coordinates induced by a 
change of coordinates on Q). 


Now, we turn to the discussion of Lagrangian submanifolds of cotangent bundles. 


Example 8.3.8 The cotangent bundle T*Q contains the following classes of La- 
grangian submanifolds: 


1. The fibres of T*Q: let x € Q and let i: T}Q — T*@Q be the natural inclu- 
sion mapping. Since T* Q has half the dimension of T*Q, it is enough to show 
isotropy. For € ¢ TQ and X € Tz (T% Q), we find 


('*0).(X) = Gig) (i'X) = (7), 2! 0! (X)) = 


Thus, i*@ = 0 and hence i*w = 0. In bundle coordinates the proof is even sim- 
pler: since q' is constant along T¥ Q, we obtain i* (dp; A dq') = 0. 
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2. The image of a closed 1-form: let a € 2 : (Q). Since a(Q) has half the dimension 
of T* Q, formula (8.3.3) implies that a(Q) is Lagrange iff da = 0. In particular, 
the zero section and the image of an exact 1-form a = dS is Lagrange. In the lat- 
ter case, S is called a generating function for the Lagrangian submanifold a(Q). 
In bundle coordinates g' and p;, we have dS = 3g . Hence, in coordinates, 
a(Q) consists of the points (q, p= a (q)). 

3. The conormal bundle of a submanifold: let N be a submanifold of Q and let 
(TN)? cT* Q be the conormal bundle, cf. Remark 2.7.18/1. Obviously, (TN ? 
has half the dimension of T* Q. Moreover, the canonical 1-form 6 vanishes on 
(TNY°, because 77’ maps vectors tangent to (TN )° to vectors tangent to N. Thus, 
(TN)° is Lagrange. Following Tulczyjew, we call it the canonical lift of N to 
T*Q and denote it by N. 

4. The canonical lift of the pair (V,S), where N C Q is a submanifold and S : 
N — Risa smooth function: define 


(NS) := {€ € (T*Q) y : (&, X) = (dS, X) for all X € Tyg) N}. 


One can check that j*6 = 2y,dS, with j being the natural inclusion mapping of 
(N, Ss S) and zy : (N, Ss S) — N denoting the restriction of the canonical projection 
(Exercise 8.3.3). This implies that (N, S) is Lagrange. Note that an obvious gen- 
eralization is obtained by replacing dS by an arbitrary closed 1-form. Also note 
that the special cases N = {x}, N = Q and S = const exhaust the Lagrangian 
submanifolds of points 1-3. 


From point | of Example 8.3.8 we get the following 


Proposition 8.3.9 The restriction (T*Q),n of T*Q to a submanifold N of Q is a 
coisotropic submanifold of T* Q. 


Proof For every & € (T*Q);y, the tangent space T;((T* Q);y) contains the sub- 
space T; (The) 2), which is Lagrange by Example 8.3.8/1. Hence, the assertion 
follows from Proposition 7.2.4/4. 


The following Proposition shows that point 2 of Example 8.3.8 (locally) exhausts 
the set of Lagrangian submanifolds in T* Q which are transversal to the fibres. 


Proposition 8.3.10 Let L c T*Q be a Lagrangian submanifold which is transver- 
sal to the fibres and which intersects every fibre at most once. Then, U := 1(L) is 
open in Q and there exists a closed \-forma on U such that L = a(U). If L is in ad- 
dition contractible, there exists a smooth function S: U + R such that L=dS(U). 


In accordance with the terminology of Example 8.3.8/2, S is called a generating 
function for L. For Lagrangian submanifolds which are not transversal one has to 
use the more general concept of generating Morse family, see Sect. 12.4. 
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Proof By transversality, the restriction 2+, : L — M is an immersion. Since L in- 
tersects every fibre at most once, zz, is injective. Hence, U is a submanifold of Q. 
Since it has the same dimension as Q, it is open and zy, is a diffeomorphism 
onto U. Thus, a := ip o (7 v)-', with i, denoting the natural inclusion mapping 
of L, is a local 1-form over U satisfying L = a(U). Since L is Lagrange, by Exam- 
ple 8.3.8/2, a is closed. If L is contractible, so is U. Thus, in this case, there exists 
a function §: U — R such that a = dS and hence L = dS(U). 


Finally, let us discuss the consequences of the fact that the fibres of T*Q carry 
a natural affine structure. For given x € Q, under the natural identification of the 
tangent spaces of the fibre T¥ Q with T* Q itself, every 7 € T¥ Q defines a constant 
vector field on T* Q, denoted by 7. Accordingly, every a € 2 '(Q) defines a vertical 
vector field & on T*Q by 


dito = &. (8.3.11) 


This vector field is complete and its flow is given by 

@*:Rx TO > T*O, P(t, €):=E + ta(w(E)). 
The flow induces a vertical affine transformation of T* Q (fibre translation) by 

Dy = OY :TO>T*Q, Pa(&) =E +a(m(E)). 
Since fibrewise addition is commutative, for arbitrary 1-forms a and f one has 

[@,B1=0, GyoOgp=PpoGy=Pgiy. 
One says that T* Q acts on itself fibrewise transitively. 
Proposition 8.3.11 For every a € 2'(M), 
@r0=047"*a, L50 =n a, daw = 1a. (8.3.12) 
Proof We have 
(230) .(X) = 60,(¢)(P,X) = (E +. a(x(E)), 2X) = (0+ m*a),(X) 


and hence 
d d 


LO = — ox o=— 0 *(t =m"*a. 
edt t=0 ‘dt al ick) aa 


Finally, since & is vertical, 7*a = %0 = @id6 + d(@0) = G10. 


Now, let a be closed. Then, the first equation in (8.3.12) implies that ®y is a 
symplectomorphism. Moreover, by Example 8.3.8/2, the image of a is a Lagrangian 
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submanifold of T*Q. Along this submanifold, the tangent spaces of T*Q decom- 
pose as 


Tacx) (T* Q) = Tax) (Te Q) @ a (T, Q) (8.3.13) 


into complementary Lagrangian subspaces. In applications, the special case of the 
zero 1-form is important. Its image is the zero section in T* Q, that is, in this case 
(8.3.13) is a decomposition of the tangent bundle into complementary Lagrangian 
subspaces along the zero section. The decomposition (8.3.13) induces an identifi- 
cation of the canonical symplectic structure on T* Q with the canonical symplectic 
structure on T; Q @ T* Q, given by (7.1.5). 


Proposition 8.3.12 For every x € Q, the mapping 

TrQ@TyO>Tas)(T*Q), (Kn) a'(X) + fax, (8.3.14) 
is a symplectomorphism. 
Proof Clearly, the mapping (8.3.14) is a vector space isomorphism. Thus, it is 


enough to show that it is symplectic: by the last relation in (8.3.12), for o € 2'(Q) 
we find 


Wax) (0"(X), Fa(x)) = —(F 40) a(x) (&'(X)) = —(7*0) (2! (X)) = 0x (X). 
Thus, since the subspaces in the decomposition (8.3.13) are Lagrangian, we obtain 


(a! (X) + tax), &/ (Y) + Ga(xy) = Tx(Y) — oy (X) = wows ((X, tx), (Y, x)), 


with W =T,Q. 


Remark 8.3.13 To see how Proposition 8.3.12 is related to Proposition 7.2.9, we 
identify Ty.) (T¥ Q) with T¥Q and set W = TiQ and W’ =a'(T,Q). Then, the 
isomorphism x : W’ — W* defined by (7.2.7) is given by 


x (a’(X)) =-X. 


This can be read off immediately from the above proof, using Gy.) = 6,. Note that 
the roles of W and W* have been interchanged here. 


Exercises 

8.3.1 Prove Remark 8.3.3/1. 

8.3.2 Verify Formulae (8.3.7) and (8.3.8). 

8.3.3 In Example 8.3.8/4, verify the formula j*@ = 1,58. 
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8.4 Coadjoint Orbits 


Let G be a Lie group, let g be its Lie algebra and let Ad* denote the coadjoint repre- 
sentation of G on the dual space g*. By a coadjoint orbit of G one means an orbit of 
this representation, viewed as an action of G on g*. According to Corollary 6.2.9, 
coadjoint orbits are initial submanifolds of g*. If G is compact, Corollary 6.3.5 im- 
plies that they are also closed and embedded. 

In this section, we will show that every coadjoint orbit @ of G carries a natural 
symplectic structure and that this structure makes it into a symplectic leaf of the 
natural Lie-Poisson structure of g*. Since G acts transitively on @, it is enough to 
define the desired symplectic form on the Killing vector fields of Ad*. According to 
(6.2.3), the value at 4 € @ of the Killing vector field generated by A € g is given by 


d 
Ax (ML) = = Ad*(exp(tA)) u = ad*(A)u. (8.4.1) 
0 


We read off that A,(u) = 0 iff (u,[A, B]) = 0 for all B € g. Thus, the following 
2-forms on @ are well-defined: 


wf (Ax, Be) = £(u,[A, B]), A,Beg, wed. (8.4.2) 


These forms will be called the positive and the negative Kirillov form, respectively. 


Theorem 8.4.1 (Kirillov) The 2-forms w~ are symplectic and G-invariant. 


This means that (@, w~, Ad*) are symplectic G-manifolds, cf. Remark 6.1.3 and 
Definition 8.6.2. 


Proof We give the proof for @ = w. First, we prove G-invariance. Using Proposi- 
tion 6.2.2/1, for uw € G@ and A, B € g we compute 


((Ad*(a))"«) (Ax, Bs) = @aar(ayu((Ad*(@)), Ax. (Ad* (a), Bs) 
= waa*(ayy((Ad(a)A),, (Ad(@)B),) 
= (Ad* (a), [Ad(a) A, Ad(a) B]) 
= (u, [A, B]) 
= Oy (Ax, Bx). 


Next, we show that w is closed: According to Proposition 4.1.6, 


dw(Ax, By, Cx) =A,(o(Bx, C.)) — By (w(Ax, c)) + C,(@ (As, B.)) 
_ o([Ax, By, Cy) + w([Ax, Cxl, Bx) _ o([By, Cxl, Ax). 
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By G-invariance, 24,0 = Lp, w = Lc,,w = 0. Hence, Propositions 3.3.2 and 3.3.3/3 
yield 


Ax (w(Bx, C,)) = LA, (w(Bx, C,)) = o([As, By), C.) i w(Bz, [Ax, Cul), 


and analogous formulae for B,.(@(Ax, C,)) and Cy (@(Ax, B,)). Using this, Propo- 
sition 6.2.2/2 and the Jacobi identity, we obtain 


(dw), (Ax, By, Cx) = oy (Bs, [Ax, C,]) — Ou (Ax, [Bx, C,]) + Ou (Cy, [Bx, Ax]) 
= (x, [B,[C, Al]) + (1, [A,[B, Cl]) + (z, [C, TA, BI) 
=0. 


Thus, w is closed, indeed. It remains to show that it is non-degenerate. Let u € C 
and A € g such that w,,(A,x, B,) =0 for all B € g. Since 


Wu (Ax, By) = (4, LA, B]) = (ad*(A)q, B), 


this implies ad* (A) = 0 and hence, by (8.4.1), Ax() = 0. 


Now, recall from Example 8.2.18/3 that g* carries a natural linear Poisson struc- 
ture, namely the Lie-Poisson structure, whose Poisson tensor and Poisson bracket 
are given by 


7,(A, B)=(u,[A, Bl), — {f, g}(w) =(u, [df a), dg(@)]), (8.4.3) 


HE pk OY pk DY 


respectively. Here, via the identifications T;,g" = g™ = g, the differential df (j.) is 
viewed as an element of g. We will show that the symplectic leaves of this Pois- 
son structure coincide with the connected components of the coadjoint orbits of 
G and that the symplectic structure induced on a leaf coincides with the Kirillov 
symplectic structure (8.4.2). For that purpose, for every A € g we define a function 


fa € C*(g*) by 
fa(H) = —(H, A). 
Lemma 8.4.2 Let A € g. 


1. The differential of f4 is given by dfa = —A. 
2. The Hamiltonian vector field generated by fa is given by X ¢, = Ax. 


Proof To determine d f4(j) for uw € g*, we choose a tangent vector o € T,,g* = g*, 
represent it by the curve tt» + to and compute 


d d 
(o,dfa(w)=< faetto)=—-< (u+to, A) =—(o, A). 
ro dt to 


dt 


This yields point 1. Using this, for h € C®(g*) we obtain 


(X fy ulh) = {fa hy) =(u, [dfa(w), dh(u)]) = —(u, [A, da(u))). 
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By (8.4.1), the right hand side equals (A,,),, (2). This yields point 2. 


Proposition 8.4.3 The symplectic leaves of the Poisson structure (8.4.3) coincide 
with the connected components of the coadjoint orbits and the symplectic form in- 
duced on each leaf coincides with the positive Kirillov form (8.4.2). 


Proof Since the value of the Hamiltonian vector field Xf at  € g* depends on 
df(t) only, the characteristic distribution of the Poisson structure [7 is spanned by 
the Hamiltonian vector fields X ¢,, A € g. Thus, by Lemma 8.4.2, this distribution 
coincides with the distribution spanned by the Killing vector fields of Ad*, so that 
Theorems 6.2.8 and 8.2.20 yield that the symplectic leaves coincide with the con- 
nected components of the coadjoint orbits. To prove the second statement, let C be 
a connected component of a coadjoint orbit and let wg denote the symplectic form 
induced on @ by the Poisson structure, cf. (8.2.20). Using this, Lemma 8.4.2/2 and 
(8.4.3), for uw € @ and A, B € g, we obtain 


wo(As, Bs)(U) = 06 (X fy, X fp) (WH) = (fa, fa}(w) = (u, [d fa), dfa())]). 


By Lemma 8.4.2/1, the right hand side equals (jz, [A, B]) = of (A,,, By). 


Remark 8.4.4 Recall from Remark 5.4.11/2 that in the special case when G is 
semisimple, the Killing form k defines an equivariant isomorphism F : g > g* by 


(F(A), B)=k(A, B), A, Beg. (8.4.4) 


By equivariance, F maps adjoint orbits a onto coadjoint orbits @. Since 6 and 0 
are initial submanifolds, the restriction F : @ + @, denoted by the same letter, is a 
diffeomorphism. Then, 


O- := F*@* 


are Ad-invariant symplectic forms on C.To compute them, let A, B, C € g and let 
B,, Cx denote the Killing vector fields generated by B and C under the adjoint 
representation. By (6.2.2), we have F,, B, = B,.. Using this, we find 


4 (Bs, Cx) = (F*0*) 4 (Bas Ca) = OF (4) (Bas Cx) = +(F (A), [B, C]) 


and hence 


@ (By, Cx) = +k(A, [B, C]). (8.4.5) 


If G is in addition linear and simple, then 


@ (By, Cx) = £ctr(ALB, C]), (8.4.6) 
where the factor c is given in Example 5.4.12. 


We conclude this section with a bunch of examples. In all of them, G is semisim- 
ple, so that it is enough to consider the orbits of the adjoint action. 
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Example 8.4.5 


1. G=SU(2): Since every element of su(2) can be diagonalized by means of an 
SU(2)-matrix, every orbit of the adjoint action of SU(2) contains a diagonal 
element. Since tr(A) = 0, the eigenvalues are of the form 1A and —iA with A € R. 
Since the set of eigenvalues is invariant under Ad, the orbits are labelled by A > 0 
and the orbit corresponding to 4 is given by 


{adiag(ia, —id)a* : a € SU(2)}. 


According to the Orbit Theorem 6.2.8, it is diffeomorphic to the homogeneous 
space of the right cosets in SU(2) of the stabilizer of diagGA, —iA) under Ad. 
There are two distinct types of orbits. If 7 > 0, the stabilizer consists of the 
diagonal matrices a = diag(a, a) with a € U(1). Hence, in this case, the orbit is 
diffeomorphic to the homogeneous space SU(2)/U(1) = U(2)/(UC) x UC)), 
which according to Example 5.7.6 is diffeomorphic to CP! = S*. In case 4 = 0, 
the stabilizer is SU(2) and the orbit consists of the origin alone. 

That the nonzero adjoint orbits are 2-spheres can be seen alternatively by 
the following more explicit argument. Recall from Example 5.4.7 that the basis 
{If, In i} in su(2), given in Example 5.2.8, defines a vector space isomorphism 
onto R* which is equivariant with respect to the representation of SU(2) on R? 
induced by the covering homomorphism SU(2) — SO(3) of Example 5.1.11. 
Thus, this isomorphism maps the nonzero adjoint orbits to the spheres S? of 
radius r > 0 in R3, indeed. We calculate the symplectic form for the nonzero ad- 
joint orbits: since [I°, IF] = 6,j*If, for A, B, C € su(2), Formula (8.4.5) yields 


04 (Bs, Cx) = £A' BIC" (IP, [IF Ty ]) = £4" BI C"e jn’ k(IP IP) 


and with k(IP, IP) = 4 tr(-I-) = —28;) we obtain 


4 (Bs, Cx) = 2A! BI C* ei jx. 


Since the tangent vector y of Ss? atx € = is the value at x of the Killing vector 
field generated by the Lie algebra element , we read off that via the isomor- 


phism s0(3) = R?, the Kirillov form @* gets identified with the scaled natural 
volume (or area) form on See 


x: (y x Z) 
r2 


2. G = SU(3): SU(3) is dealt with analogously to SU(2). The eigenvalues are 
1A1, 1A2, 143 with Ay € R and Aj +A2 +A3 = 0. The orbits are given by 


, xeS?, y,z1x. (8.4.7) 


x(y, Z) = —2 


{adiag(id1, id2, iA3)a" a € SU(3)}. 


There are three types of orbits. If the 4, are pairwise distinct, the stabilizer con- 
sists of the diagonal matrices a = diag(a, 6B, a6) with a, 6 € U(1). Hence, the 
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orbit is diffeomorphic to the homogeneous space 
SU(3)/(U() x UC) = U(B)/(UC) x UC) x UD). 


This is a flag manifold of dimension 6, cf. Example 5.7.7. If A; = A2 4 A3, the 
stabilizer consists of matrices of the form 


0 
b 

0 |, beU(), 
00 | det b 


and the orbit is thus diffeomorphic to the GraBmann manifold 
SU(3)/U(2) = U(3)/(U() x U)), 


which, in turn, is diffeomorphic to the complex projective space CP”, cf. Exam- 
ple 5.7.6. Finally, if the eigenvalues A; are all equal, they must vanish. Then, the 
stabilizer is SU(3) and the orbit consists of just the origin. The computation of 
the Kirillov symplectic form is left to the reader (Exercise 8.4.1). 

3. G =SO(3): As for SU(2), according to Example 5.4.7, the basis {I®, 5, I} in 
s0(3) given there defines an isomorphism between the adjoint representation and 
the identical representation of SO(3). This isomorphism identifies the nonzero 
adjoint orbits of SO(3) with the spheres S? of radius r > 0 in R?. A calculation 
similar to that of point 1 shows that the Kirillov form @ on an adjoint orbit 
coincides via this isomorphism with the 2-form (8.4.7). 

4. G=SO(4): This Lie group is semisimple but not simple. Recall from Exam- 
ple 5.4.7 that the Lie algebra isomorphism d@ : su(2) @ su(2) — so0(4) induced 
by the covering homomorphism @ : SU(2) x SU(2) — SO(4) of Example 5.1.11 
is an isomorphism of representations of SU(2) x SU(2), where the representa- 
tion on so(4) is induced via ¢ by the adjoint representation of SO(4). Thus, d@ 
identifies the adjoint orbits of SO(4) with those of SU(2) x SU(2). According to 
point 1, the following types of orbits occur: 


O.a.py) =S? x S?, O.A.0) = Gw,a) =S?, 6,0 ={0,0)} (8.4.8) 


with A, B esu(2), A#0, B £0. As explained there, under the Lie algebra iso- 
morphism su(2) = R? of Example 5.2.8, orbits of the second type are identified 
with spheres of fixed radius in R? with the symplectic form given by (8.4.7) and 
orbits of the first type are identified with products thereof. 


Exercises 

8.4.1 Determine the Kirillov symplectic form for the coadjoint orbits of SU(3), cf. 
Example 8.4.5/2. 

8.4.2 The Euclidean group E(3) in three dimensions is defined as the semidirect 
product of SO(3) with the group of translations, that is, E(3) := SO(3) « R3 
with multiplication 


(a1, b1) (az, bz) := (a1a2, by + ayb2). 
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Determine the Lie algebra of E(3) and show that under the isomorphism 
50(3) = R? of Example 5.2.8, the coadjoint orbits are given by 


Ged) ={(% y) € R? x RP :y’=c?,x-y=cd}, c>0, deER. 


8.4.3 For the Lie group G of real upper triangular (7 x n)-matrices with unit deter- 
minant, prove the following. 
(a) The Lie algebra g of G consists of the real upper triangular matrices with 
trace 0. 
(b) By means of the Ad-invariant scalar product (A, B) = tr(AB), g* can be 
identified with the space of real lower triangular matrices with trace 0. 
(c) Under this identification, the coadjoint action takes the form 


Ad*(g)A =(gAg~')_, 


with the minus sign meaning that all entries above the main diagonal are 
set to zero. Find the explicit matrix representation for elements of the 


orbit through 
0 O -- O O 
1 0 
A= 10 0 
0 O 
O -- O 1 O 


8.5 Coisotropic Submanifolds and Contact Structures 


Coisotropic submanifolds play an important role in symplectic reduction, in the 
theory of integrable systems and especially in Hamilton-Jacobi theory. The results 
to be derived in this section apply in particular to Lagrangian submanifolds. 

Let (M, w) be a symplectic manifold and let (NV, g) be a coisotropic submani- 
fold of M. For simplicity, we assume that N is given by a subset of M and that ¢ is 
the natural inclusion mapping, cf. Remark 1.6.2/1. By definition of coisotropy, we 
have (TN)® C TN. This implies that (T,N)° coincides with the characteristic sub- 
space Fa” of the 2-form wy = y*a, see Definition 4.2.18: indeed, since dwy = 0, 
Formula (8.1.5) yields 


ker(wy)p =TpNO(TpN)® = (TpN)®. (8.5.1) 
Moreover, by point 2 of Proposition 7.2.1, the dimension of (T,N)® is equal to the 


codimension of N and hence independent of p. This shows that the characteristic 
distribution D®” of wy is regular and that (TN)® coincides with this distribution. 
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Definition 8.5.1 The distribution D®” = (TN)® is called the characteristic distri- 
bution of NV. Sections of D®” are called characteristic vector fields. 


Note that D®" is isotropic as a vector subbundle of TM and that (D®")® = TN. 
Moreover, Proposition 4.2.20 yields 


Corollary 8.5.2 The characteristic distribution D® of a coisotropic submanifold 
is integrable. 


Integral submanifolds of D®" are called characteristics of N. 


Proposition 8.5.3 Let N be an initial coisotropic submanifold of the symplectic 
manifold (M,w) and let L be a Lagrangian submanifold of M contained in N. 
Then, L is a union of characteristics of N.! 


Proof By Proposition 1.6.14, L is a submanifold of N. Hence, TL C (TN), and 
thus 
(D°”) = (TN) w) COL =TL. 


It follows that, through every point of L, there exists a characteristic of N which is 
contained in L. 


Lemma 8.5.4 Let (M, w) be a symplectic manifold and let N C M be a coisotropic 
submanifold. A Hamiltonian vector field X ¢ on M restricts to 


1. avector field on N iff f is constant on the characteristics of N, 
2. acharacteristic vector field on N iff f is locally constant on N. 


Proof For allm € N and Y € T;,N, we have 


®mn (pms Y) =—-Y(f). 


First, this implies that (X ¢);y takes values in TN = (D®”)® iff Y(f) =0 for all 
Y € D®, that is, iff f is constant along the characteristics of N. Second, this im- 
plies that (Xf); takes values in D®°’ = (TN)® iff Y(f) =0 for all Y € TN, that 
is, iff f is locally constant on NV. 


We recall that in Sect. 2.7 we had characterized the tangent bundle of an em- 
bedded submanifold N Cc M and its annihilator by means of the ideal Cy (M) of 
smooth functions on M, vanishing on N: 


T,N ={Xe€T,M: X(f) =0 forall f eC?(M)}, (8.5.2) 


Note that the statement of the proposition is not about the maximal integral manifolds of D®” . 
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(T,Ny° = {a € TUM: a=df(p) for some f € Cr (M)}. (8.5.3) 


According to Proposition 7.2.1/1, the isomorphism w’: TM — T*M yields an iden- 
tification of the symplectic normal bundle (TNV)® C TM;w with the annihilator 
(TN)° of TN in T*M tv. Since this isomorphism identifies X ¢ with —df, we get 
the following characterization of the normal bundle: 


(TpN)° ={X €T,)M: X= X;(p) for some f e Cy (M)}. (8.5.4) 
These observations yield the following criteria for coisotropy [311]: 


Proposition 8.5.5 For an embedded submanifold N of a symplectic manifold 
(M, w), the following statements are equivalent: 


1. N is coisotropic. 
2. For every f € C?(M), the Hamiltonian vector field X ¢ is tangent to N. 
3. CX (M) is a Poisson subalgebra of C~(M). 


Proof The submanifold N is coisotropic iff (TN)® C TN, that is, according to 
(8.5.4), iff Xy is tangent to N for all f ¢ C}(M). Thus, points 1 and 2 are 
equivalent. By (8.5.2), point 2 is equivalent to the requirement that X ¢(g);w = 
{f. g}tw =0 for all f, g € CY (M). This is equivalent to point 3. 


Now, let F: M — R’ be asmooth mapping for which 0 is a regular value and let 
N = F~!(0). In this context, one has the following criterion for coisotropy. Let Fy 
denote the components of F and recall from the Level Set Theorem that for every 
m € N, one has 


TN =ker F,,. (8.5.5) 


Proposition 8.5.6 N = F—!(0) is coisotropic iff the functions F, are in involution 
on N, that is, 


(Fa, Fo}tn =0. 


Proof By Proposition 8.5.5/3, if N is coisotropic, then { Fy, Fp}; = 0. Conversely, 
if {Fa, Fo}}w = 0, then (dFy, Xp,)(m) = 0 for all m € N. In view of (8.5.5), this 
yields that the vector fields X ~, are tangent to NV. On the other hand, (8.5.5) implies 
that the differentials dF, span the annihilator (TN)° and, consequently, that the 
Hamiltonian vector fields X-, span (TN)®. Thus, (TN)? Cc TN. 


Remark 8.5.7 


1. If we assume that F: M — R" is a submersion and that {F,, Fy} = 0 holds 
on the whole of M, we obtain a foliation of M by coisotropic submanifolds 
(level sets) of codimension r. The special case r = 5 dim M yields a foliation by 
Lagrangian submanifolds. This is the situation of an integrable system. Chap. 11 
is devoted to the study of such systems. 
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2. It is easy to show that all coisotropic submanifolds can be described locally as 
level sets in the sense of Proposition 8.5.6. More precisely, let N C M be a 
coisotropic submanifold. Then, for every point p € N, there exists a neighbour- 
hood V of p in N, aneighbourhood U of p in M anda submersion F: U > R" 
such that V = F~!(0) and (Fa, Fo};v =0 (Exercise 8.5.1). 


Example 8.5.8 Let N C M be an embedded submanifold of M. 


1. The restriction T*M@}y of T*M to N is coisotropic. This has been shown before 
(Proposition 8.3.9). 

2. Let 7: Q > N be a surjective submersion. Then, VQ := kerz’ is a vertical 
subbundle of TQ, cf. Example 2.7.7. Its annihilator V°Q is the union of the 
conormal bundles of the fibres of 2. According to Example 8.3.8/3, these conor- 
mal bundles are Lagrangian submanifolds. Thus, Proposition 7.2.4/4 implies that 
V°Q is coisotropic. 


By Proposition 7.2.4/2, a submanifold of codimension | of a symplectic manifold 
(M, w) is always coisotropic. Such submanifolds play a prominent role in the theory 
of Hamiltonian systems and in Hamilton-Jacobi theory, see Chaps. 9 and 12. Often 
they carry the additional structure of a so-called contact manifold. To explain this, 
let us start by considering the following special case. Assume that (M, w) is a 2n- 
dimensional exact symplectic manifold, that is, there exists a global potential 1-form 
B such that wm = df. Then, there exists a unique vector field Z on M such that 


Zio= Bp. (8.5.6) 


Remark 8.5.9 The vector field Z defined by (8.5.6) fulfils 
L70 =. (8.5.7) 


A vector field with this property is said to be of Liouville type, cf. Remark 8.3.3/3 
for the cotangent bundle case. 


Now, let (P,1) be a hypersurface® in M which is transversal to Z. Then, the 
1-form 


a:=u*B 
on P has the property that a A (da)” is a volume form on P,, because 
1 
BA (dB)""! = (Zw) Aw"! = -Z1(0") (8.5.8) 
n 
and because P is transversal to Z. It follows that @ is nonzero on the characteristic 


distribution D®? = (TP)® = kerda and that da is non-degenerate on the hyper- 
plane distribution? kera. In view of the integrability criterion of Proposition 4.7.6, 


8That is, an embedded submanifold of codimension 1. 


° A hyperplane distribution on P is a regular distribution E C TP of codimension one. 
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the latter means that the hyperplane distribution kera@ is maximally non-integrable. 
Indeed, for any X, Y € I’ (kera), we have 


da(X, Y) =—a([X, Y]). 


One says that aw is a contact form on P and calls the pair (P, w) an exact, or strict, 
contact structure. 

Given the great importance contact geometry has acquired in recent years, espe- 
cially in connection with symplectic topology, we will discuss the basics here. There 
is a huge literature where one can find detailed presentations, e.g. [24], [102], [103], 
[181] and [206], see also [57]. 

The abstract notion of contact manifold is defined without taking recourse to a 
symplectic manifold. Thus, let P be a manifold and let E be a hyperplane distri- 
bution on P. Motivated by the above special case, we take the property of E to be 
maximally non-integrable as a defining condition and we formulate this condition 
in terms of locally defining 1-forms. A 1-form a on U C P is said to be locally 
defining for E if 


kera = Eju. 


It is said to be globally defining for F if in addition U = P. Locally defining 1-forms 
exist in a neighbourhood of every point of P: consider the quotient vector bundle 
fL :=TP/E, called the characteristic line bundle of E. The natural projection pr : 
TP — Y isa vertical vector bundle morphism. Let pr! : @* —> T*P be the dual 
morphism. Every local section & of * over U C P defines a local 1-form a on U 
by a = pr! o@. By construction, a} E =0. Hence, o is locally defining for E iff @ is 
nowhere vanishing. 


Definition 8.5.10 (Contact manifold) Let P be a manifold. 


1. A contact form on P is a 1-form @ for which (dq) pkerg is non-degenerate. 

2. Acontact structure on P is a hyperplane distribution F with the property that any 
locally defining 1-form is a local contact form on P. The pair (P, E) is called a 
contact manifold. 

3. (P, E) is said to be exact or strict if there exists a globally defining contact form. 

4. A smooth mapping ®: P; — P> of contact manifolds (P|, E;) and (Po, E2) is 
called a contact mapping if ®’(E,) = E>. If @ is in addition a diffeomorphism, 
it is called a contactomorphism. 


Remark 8.5.11 


1. Let a be a contact form on P. Since it is a 1-form, the subspace kera, C T»P 
has codimension | if a,» #0 and codimension 0 otherwise. Since da is non- 
degenerate on kera,, Proposition 7.1.2 implies that kera, has even dimension. 
Hence, the codimension must be the same for all p € P. It follows that 
(a) @ is nowhere vanishing, 

(b) kera@ is a hyperplane distribution on P and hence an exact contact structure, 
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(c) P has odd dimension. 

Moreover, non-degeneracy implies ker(da) » M kera» = {0}. Since a 2-form on 
an odd-dimensional manifold must have nontrivial kernel, it follows that ker(da) 
is a vector bundle complement of kera in TP. Thus, a defines a splitting 


TP =kera @ kerda. (8.5.9) 


2. Let (P, E) be a contact structure. Then, point | yields that E has even dimension 
and P has odd dimension. Moreover, every locally defining contact form for E 
on U C P defines a splitting of TP over U given by (8.5.9). This implies that 
over U, the characteristic line bundle Y of E can be identified with ker(da). 

3. Let E be a hyperplane distribution on P. If @ is a locally defining 1-form for 
E over U, then so is fa for every nowhere vanishing function f € C°(U). 
Conversely, for any two locally defining 1-forms a1, a2 over U, there exists 
f €C™(U), necessarily nowhere vanishing, such that v2 = fa. It follows that 
E is acontact structure iff for every p € P there exists a locally defining contact 
form at p. Thus, contact structures on P correspond bijectively to equivalence 
classes of local contact forms on P under the equivalence relation a, ~ a2 iff 
a2 = fa, for some smooth function f on the common domain of a and a2. 

4. By Remark 2.7.11/3, every hyperplane distribution E admits a vector bundle 
complement in TP and by identifying the characteristic line bundle & with such 
a complement one obtains a splitting 


TPZE@Y. (8.5.10) 


For example, if we choose an auxiliary Riemannian metric g on P, we can iden- 
tify Y with the orthogonal complement E+ of E in TP and thus realize the 
splitting (8.5.10) in the form TP = E @ Et. This way, every nowhere vanishing 
local section a of * corresponds via g to a nowhere vanishing local section Y 
of E+ and the 1-form w = pr! o@ is given in terms of Y by a = g(Y,-). 


Proposition 8.5.12 A 1-form a on P is a contact form iff a \ (da)” is a volume 
form on P. 


Proof First, assume that a is a contact form. Since (dq) ;kerq is non-degenerate, 
Proposition 7.1.7 yields that (da) er ie # 0 for all p € P. The decomposition 
(8.5.9) implies O tker(der) » # 0 and hence a,» A (day’, # 0 for all p € P. Thus, 
a A (da)” is a volume form. Conversely, assume that a A (da)” is a volume form 
and let p € P. By Proposition 2.7.5, we find a basis {e1,...,€2n, f} in TpP such 
that ker@, is spanned by e1,..., €2n. Then, 


Ap A (da)'(e1, +5 €2ny ff) = ap(f)(da)', (e1,.--, €2n) #0 


and hence (da), (e1, .++,€2n) #0. Thus, (do) tkerg is non-degenerate. 
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Remark 8.5.13 In analogy to the special case discussed in the beginning, Proposi- 
tion 8.5.12 states that a hyperplane distribution is a contact structure iff it is maxi- 
mally non-integrable. 


Proposition 8.5.12 implies 


Corollary 8.5.14 Let (P, E) be an exact contact manifold with globally defining 
contact form a. Then, a A (da)” is a volume form on P. In particular, every exact 
contact manifold is orientable. 


More generally, one has the following result on the orientability of contact man- 
ifolds. 
Proposition 8.5.15 Let (P, E) be a (2n + 1)-dimensional contact manifold. 


1. [fn is odd, P is orientable. 
2. Ifn is even, P is orientable iff (P, E) is exact. 


Proof See Exercise 8.5.2 or [102]. 


Now, let us derive a criterion for exactness. Recall from Remark 2.7.11/4 that a 
hyperplane distribution E is called coorientable if the characteristic line bundle 2 
is orientable. For dimensional reasons this means that it admits a global nowhere 
vanishing section and hence that it is trivial. Since every such section induces a 
globally defining 1-form for E, we have 


Proposition 8.5.16 A contact manifold (P, E) is exact iff E is coorientable, that is, 
iff the characteristic line bundle 2 = TP/E is trivial. 


Remark 8.5.17 (Reeb vector field) Let (P, E) be an exact contact manifold and let 
a be a defining 1-form. Then, there exists a unique vector field Ry on P such that 


Rysa=1, Ryda =0. (8.5.11) 


It is called the Reeb vector field associated with a. According to (8.5.11), it spans 
ker da and is transversal to FE. By (4.1.28) and (4.1.24), the flow ® of Ry satisfies 


d 
yee = OF Lao =O} ( Ryda + d(Rya)) = 0, 


that is, it leaves w and hence E invariant. Thus, Ry is an example of a contact vector 
field (a vector field whose flow preserves the contact structure E). 
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Example 8.5.18 


1. Let P = R2"*! with the standard coordinates x!,..., x”, y, ..., y” and z. Then, 
the 1-form 
n 
a:=dz+) > x'dy! (8.5.12) 


i=1 


is a contact form, called the standard contact form on R2”*! (Exercise 8.5.3). 


2. Consider R2”*+? with the standard coordinates x!,...,x”*+! and y!,..., y"t! 
and define 
1 n+l 
SS So (x! dy’ — y'dx'). (8.5.13) 


i=1 


Let P = S*”"+! and let 1: S2”+! — R?"*+? be the natural inclusion mapping. 
Then, :*@ is a contact form, called the standard contact form on sent (Exer- 
cise 8.5.4). 

3. Let Q be a manifold of dimension n. The projective cotangent bundle of Q is 
defined by 


P*Q:=(T*O\50)/~, (8.5.14) 


where so denotes the zero section in T* Q and & ~ &’ iff § = aé&’ for some ae R. 
This is a locally trivial fibre bundle over Q with typical fibre RP”~! and projec- 
tion z induced from that of T* Q. For [E] € P* Q, we take the mapping 


a 
(te) : Trqey Q > Tie (P*Q) 
and define 


Ejgy = ker{(r'te))' ©}. (8.5.15) 


with € being an arbitrary representative of [E] € P* Q. This is a hyperplane distri- 
bution on P* Q. We leave it to the reader to prove that the canonical 1-form @ on 
T*Q descends to a globally defining contact form for E (Exercise 8.5.5). Thus, 
(P* Q, E) is an exact contact manifold. 

4. Similarly, the cotangent sphere bundle of Q is defined by 


S*Q:=(T*O\50)/~, (8.5.16) 


where & ~ &’ iff € = ag’ for some positive real number a. This is a locally trivial 
fibre bundle over Q with typical fibre S’~! and projection induced from that 
of T* Q. As in the previous example, Formula (8.5.15), with z interpreted as the 
projection of S* Q, defines an exact contact structure on S*Q with contact form 
induced from the canonical 1-form 6 on T*Q. 

5. The following example plays an important role in applications. Given a 2n- 
dimensional exact symplectic manifold (M, w) with potential 1-form 6, define 
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P=M x Rand!° 
a :=dt — B, (8.5.17) 


where ¢ denotes the standard coordinate on R. Since 
a A (da)” = (—1)"dt A (dB)” 


and since the right hand side is a volume form on P, @ is a contact form on P. 
6. As a consequence of point 2 of Proposition 8.5.15, one can use manifolds which 

are not orientable to construct contact manifolds which are not exact. For exam- 

ple, let P = R"+! x RP” with the standard coordinates x!,...,x”+! on R"+! 


and the homogeneous coordinates [y! :...: y”*+!] on RP” and define 


One can check that F is a contact structure on P. From Example 4.2.5/3 we know 
that RP” is not orientable for even n. Hence, in this case, E is not exact. For 
an exhaustive discussion of this contact structure, we refer the reader to [102], 
Example 2.14 and Proposition 2.15. Let us add that in the case where n is odd, 
P is orientable but E is not coorientable and, therefore, (P, E) is not exact, too. 


Now, let us return to the discussion of hypersurfaces of symplectic manifolds. 
To begin with, we discuss the procedure of symplectization for an exact contact 
manifold (P, E). Choose a globally defining 1-form a, and endow P x R with the 
exact 2-form 


Wy = d(e'o) =e! (dt A a + da), (8.5.18) 


where ¢ denotes the standard coordinate on R. By assumption, we have the decom- 
position (8.5.9). Since daw is non-degenerate on kera = E and a@ is non-degenerate 
on ker(da), and since df is non-degenerate on R, wy is non-degenerate and hence 
symplectic. The symplectic manifold (P x R, w.) is referred to as a symplectization 
of (P, E). Since P embeds into P x R as the hypersurface P x {0}, we obtain 


Proposition 8.5.19 Every exact contact manifold can be embedded as a hypersur- 
face in an exact symplectic manifold. 


Moreover, since every contact manifold is locally exact, by applying the proce- 
dure of symplectization locally, we obtain the following contact counterpart of the 
Darboux Theorem. 


!0We omit the natural projections to the factors of the direct product. 
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Proposition 8.5.20 Let (P, E) be a (2n + 1)-dimensional contact manifold. For 
every p € P, there exist local coordinates x',...,x", y',..., y" and z at p such 
that 


n 
a= So x'dy! +dz 
i=1 


is a locally defining \-form for E. 


Example 8.5.21 For P = S*"~! with the contact structure of Example 8.5.18/2, the 
manifold S*”—! x IR, endowed with the symplectic form (8.5.18), is symplectomor- 
phic to R®” \ {0} with the standard symplectic structure (Exercise 8.5.6). 


Remark 8.5.22 


1. Let (P, E) be an exact contact manifold and let 1: P ~ P x R be the natural 
embedding of P into its symplectization, given by ¢(p) = (p, 0). Since i*w = da, 
the splitting (8.5.9) is a decomposition of TP into the non-integrable contact 
structure E = kere and the integrable characteristic distribution D®? = kerda 
of the coisotropic submanifold P. Moreover, there exists a natural transversal 
Liouville vector field, namely Z = 2. Indeed, Zw = ea. 

2. In the case where (P, E) is not necessarily exact, the symplectization is con- 
structed as follows. Let sg denote the zero section of T*P, let E° denote the 
annihilator of E in T* P and let 6 be the canonical 1-form on T* P. Define 


M :=E°\ 50, B:=00, 


where «: M — T*P denotes the natural inclusion mapping. The action of the 
multiplicative group R, := R \ {0} on M by scalar multiplication turns M into a 
principal R,.-bundle over P with projection z induced from the canonical projec- 
tion of T* P by restriction. Using that every locally defining 1-form a for E over 
U induces a local trivialization P x R, > 2~1(U) by (m, ft) > tam, as well as 
Proposition 8.5.12, one can show that (df)” is a volume form. Hence, d§ is a 
symplectic form on M (Exercise 8.5.7). Thus, in the general situation, symplecti- 
zation yields a principal bundle over P whose bundle manifold M is symplectic, 
and P is only locally embedded into M as a hypersurface. Since locally defining 
1-forms for E define local trivializations of M, in the case where (P, E) is exact 
with global contact form a, M is trivial and hence decomposes into the two con- 
nected components M4 containing the images of ta. Both Mz contain P asa 
submanifold, embedded via +a. Moreover, the mappings P x R > Mu4, defined 
by (m, t) > +e'/?a@, are symplectomorphisms making the diagram 


P xR ———___> Mx. 


ee 


commutative. 
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Finally, we consider a generalization of the class of examples we started with. 


Definition 8.5.23 (Hypersurface of contact type) Let (M, w) be a symplectic man- 
ifold. A compact hypersurface (P, 1) of M is said to be of contact type iff P admits 
a contact form a such that da =c*w. 


Thus, every hypersurface of contact type is an exact contact manifold. 


Lemma 8.5.24 Let (M, w) be a symplectic manifold and let (P,1) be a hypersur- 
face of contact type with contact form a. There exists a 1-form i on a neighbourhood 
U of P in M such that 


dA =r, UA=a. 


Proof Since P is compact, it is embedded. Since it admits a global contact form, it 
is orientable. Thus, P admits a transversal vector field, constructed for example by 
means of the following data: 


(a) a covering by oriented charts (U,«) mapping PU to an open subset of 
R2n-1 x {0} Cc R2" 

(b) a subordinate partition of unity, 

(c) the unit vector field on R7” given by the standard basis element e2,,. 


Since P is compact, the flow of a transversal vector field defines a diffeo- 
morphism @: P x (—e,¢) > U with @(m,0) = m for all m € P. Let pr; : P x 
(—e, €) — P denote the natural projection and define a 1-form on U by 


= (pr, of ')*a. 


Consider the 2-form w;y —dt. Since pr 0 7! ot =idp, we have i* (wy —dt) = 0. 
Hence, the generalized Poincaré Lemma 4.3.14 implies the existence of a 1-form B 
on U such that wpy — dt = df and .*f6 =0. Then, A := Tt + 6 has the desired 
properties. 


Proposition 8.5.25 A compact hypersurface (P, t) of a symplectic manifold (M, w) 
is of contact type iff on some neighbourhood of P in M there exists a Liouville vector 
field which is transversal to P. 


Proof First, assume that there exists a transversal Liouville vector field Z on a 
neighbourhood U of P in M. Define a := Zim. Using the Liouville property 
(8.5.7), we find 


da = d(Ziw) = Lzw=o. 


Now, (8.5.8) and transversality imply that e*(@ A (da)”) is a volume form on P. 
Hence, :*a is a contact form on P. Conversely, assume that a € 92 '(P) is a contact 
form with the property da = 1*w. We define a vector field Z on a neighbourhood U 


8.5 Coisotropic Submanifolds and Contact Structures 393 


of P by Zim =A, where dX is the 1-form provided by Lemma 8.5.24. This vector 
field has the desired properties: due to 


L70 =d(Ziw)=drA=a, 


it is Liouville and due to w =1*A, it is transversal. 


Corollary 8.5.26 Let (M,w) be a symplectic manifold and let P C M be a hyper- 
surface of contact type with contact form a. Let U C M be a neighbourhood of P 
on which there exists a Liouville vector field Z transversal to P.. 


1. U contains a neighbourhood V of P in M which is foliated by hypersurfaces of 
contact type modelled on P.. 
2. The flow of Z yields isomorphisms of the characteristic line bundles. 


Proof Lett: P — M denote the natural inclusion mapping. 

1. Since P is compact and since Z is transversal to P, by restriction, the flow ® 
of Z induces a diffeomorphism from (—¢, ¢) x P onto some open neighbourhood 
V of P in U. Fort € (—€, €), we define 


P,:=@,(P), ay = D2 a. 


Since da, = (to ®_,;)*w and since to ®_, : P,; > M is the natural inclusion map- 
ping, P; is a hypersurface of contact type with globally defining 1-form a;. 

2. Let t € (—e, €). According to Remark 8.5.11/2, the characteristic line bundle 
of P, can be identified with kerda,. From %z@ = @ we obtain @*w = e'w. For 
peéP, X €ker(da), and Y € T,P, we find da(X, Y) = t*w(X, Y) =0 and hence 


0 = (e'i*w)(X, Y) = ((® 01)*w)(X, Y) = o((® 0 0)'X, (® 01)'Y). 


It follows that (®; 01)'/X € kerda;. Thus, by restriction, (®; 01)’ induces an isomor- 
phism from ker da onto ker da;. 


For a deeper discussion of hypersurfaces of contact type we refer to [139]. We 
will meet them again in Sects. 9.3 and 9.4. 


Exercises 

8.5.1 Prove Remark 8.5.7/2 by induction on the number of functions in involution. 

8.5.2 Use Proposition 8.5.12 and a partition of unity to prove Proposition 8.5.15. 
Hint. For local contact forms w and functions f, (fa) A (d(fa))”" = f"tla A 
(da)”. 

8.5.3 Prove that (8.5.12) is a contact form on R”*+!. Calculate it in the polar coor- 
dinates r;, g; on the planes (x;, y;). 

8.5.4 Prove that “a, with a defined by (8.5.13), is a contact form on oot, 

8.5.5 Prove that (8.5.15) defines a contact structure on the projective cotangent bun- 
dle P*Q. 

8.5.6 Prove that the symplectization of S?”+! yields R2” \ {0}, cf. Example 8.5.21. 
Hint. Use the logarithm of the radius function. 

8.5.7 Provide proofs for the statements made in Remark 8.5.22/2. 
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8.6 Generalizations of the Darboux Theorem 


In this section we discuss natural generalizations of the classical Darboux Theorem, 
which are important both in the theory of integrable systems and in the theory of 
symmetry reduction of Hamiltonian systems. Moreover, they have proved useful 
in other applications, too, like e.g. in canonical realizations of Lie algebras, see 
[74]. Most of the results below belong to Weinstein, see [305]. As in the proof of 
the classical Darboux Theorem, the deformation method of Moser plays a central 
role. 


Theorem 8.6.1 (Weinstein) Let M be a manifold, endowed with two symplectic 
forms wo and w and let N C M be an embedded submanifold, on which wo and @ 
coincide, that is, 


wo(X, Y) = w (X,Y) 
for all X,Y € TM and m € N. Then, there exists a diffeomorphism ® of open 
neighbourhoods U and ®(U) of N C M with the properties 


@*o, =a, Pry =id. 


Moreover, ® can be chosen so that ®/, = idy,,m forallmeéN. 


m 


Proof By the Tubular Neighbourhood Theorem for embedded submanifolds, cf. Re- 
mark 6.4.7, there exists an open neighbourhood V of N Cc M which can be diffeo- 
morphically identified with a neighbourhood of the zero section sg of a vector bundle 
(E, N, 7). In what follows, we view V in this way. Let jz; : E — E be the fibrewise 
multiplication by s € R and let V be chosen so that s5(V) C V for all s € [0, 1]. 
Let A := {(s, t) € [0, 1] x [0, 1): t < 5}. There exists an open neighbourhood Y of 
A x V in R? x V such that the mapping 


ge 2+, p(s, t,m) = $s,1(M) = Ma—sya—n-1™M), 


is well defined. We have $0.9 = idy and ¢1,1 = so 0 7, and ¢y ; is a diffeomorphism 
onto its image for all s < 1. Thus, for s < 1, @ defines a time-dependent flow (Exer- 
cise 8.6.1) on V. Let Y be the corresponding time-dependent vector field, 


d 
Ys(m) = ae ort (m) = —Hq—s)-! (m). (8.6.1) 


Obviously, Y, vanishes on NV. Using (4.1.28) and denoting ¢, = ¢5.9, form € V we 
compute: 


(wo — @1)(m) = (6F (@1 — wo) — (@1 — @0)) (m) 


1 d : 
= [ |, (9 (@1 — e0)) (mas 
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ik 
7 / (6% (Ly, (or — o0))) (nas 


0 
1 
= [ d(p* (Ys -s(1 — «o)))(m)ds. (8.6.2) 
Thus, by (4.1.30), 
1 
po —@|=da, where a := / 5 (Ys-(@1 — wo))ds. (8.6.3) 
0 


Although Y, is not defined for s = 1, the integral exists, because ¢{ amounts to 
multiplication by (1 — s) and hence the factor (1 — s)~! in (8.6.1) cancels. Note that 
a vanishes on N. Now, as in the proof of the Darboux Theorem 8.1.5, we consider 
the family of 2-forms 


@, = ao+t(@1—a@o), +t €[0, 1]. (8.6.4) 


By analogous arguments, one can show that w; is non-degenerate on an open neigh- 
bourhood W of [0,1] x N in [0,1] x V. Thus, one can define a time-dependent 
vector field X on W by 


X10; =a. (8.6.5) 


There exists an open neighbourhood Uo of N in V such that [0, 1] x {0} x Up is 
contained in the domain of the flow @ of X.!! Writing ®, = @; 0, one finds 


< Oe a 2; ( Lion + =) = @*(da + @| — 0) =0 
on [0, 1] x Uo. Since ®p = id, this implies Pio = wo. Since X; vanishes on N 
for all t, we have ©} ;y =idy. Thus, 6 = ;: Up > U; = G1 (Up) is the desired 
diffeomorphism. 

To see that (,)),, = idr,,u for all m € N, we choose a local chart with co- 
ordinates x/. Since wp — @; and Y, vanish on N, the coefficients of the 1-form 
Y;4(@1 — @o) are sums of products of two functions vanishing on NV. Then, the first 
derivatives of these coefficients vanish on JN, too, and this is also true for the coef- 
ficients of a. Using this, we conclude that the partial derivatives of the coefficients 
of X; vanish on WN as well. Differentiating the defining equation 


d 

rae m) = X;(®(t,m)) 
with respect to x’ we obtain a differential equation for the matrix of (@,)/,,. Since 
the partial derivatives of the coefficients of X; vanish on N and since ®o = id, the 
solution is (D)}, =idy,, mu for all t, hence in particular for t = 1. 


m 


'l Up is the union of the subsets of V obtained by applying the Tube Lemma to every point of N. 
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Let us consider the special case of an orbit of a symplectic Lie group action. !” 


Definition 8.6.2 (Symplectic Lie group action) Let (M,@) be a symplectic mani- 
fold. An action W of a Lie group G on M is called symplectic, or canonical, if 


VFo=o (8.6.6) 
for all a € G. In this case, the tuple (M, w, W) is called a symplectic G-manifold. 


In what follows, we assume that W is a left action. The following corollary is a 
direct consequence of Theorem 8.6.1, with the submanifold N being an orbit of W. 


Corollary 8.6.3 (Equivariant Darboux Theorem) Let a manifold M be endowed 
with two symplectic forms wo and @ and a proper action W of a Lie group G which 
is symplectic with respect to both wo and @,. If wo and @ coincide on an orbit O 
of ©, that is, if 


ao(X, Y) = a1 (X, Y) 


for allm € O and X,Y € TM, there exist YW -invariant open neighbourhoods Uo 
and U, of O and a W -equivariant diffeomorphism ® : Uy — U, with the property 


P*w, = a0, Pio =id. 


The diffeomorphism ® can be chosen so that ®), = idy,, um for allm € O. 
Proof The proof is completely analogous to that of Theorem 8.6.1, with the general 
Tubular Neighbourhood Theorem replaced by Theorem 6.4.3. Obviously, @; is in- 
variant under W. Since the fibrewise multiplication 45 commutes with W, the flow 
¢ and thus the vector field Y are invariant, too. This implies G-invariance of a and 
X, and thus of the symplectomorphism @. 


Another special case is that of a Lagrangian submanifold L. This is important in 
the theory of integrable systems. By Lemma 8.1.8, the tangent bundle TL admits a 
Lagrangian complement EF in TM}_,, so that 


TM}, =TLOE. (8.6.7) 


Pointwise application of Proposition 7.2.9 yields the vertical isomorphism of vector 
bundles 


X:EoT L, x(Z) =a(Z,-). (8.6.8) 


For all m € L, this isomorphism induces symplectomorphisms 


to 1M = Tbe TL, W(X) := X_ @x(Xe), (8.6.9) 


!2This notion has been already announced in Remark 6.1.3. 
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with X; and Xz being the components of X with respect to the decomposition 
(8.6.7). The reader may compare this with Proposition 8.3.12 by putting M =T*Q 
and taking for L the image of a closed 1-form on Q. 


Theorem 8.6.4 (Weinstein) Let L be an embedded Lagrangian submanifold of a 
symplectic manifold (M, w). Then, there exists a symplectomorphism ® of an open 
neighbourhood of L in M onto an open neighbourhood of the zero section sy of T* L 
such that ®}, = so. For every Lagrangian complement E of TL inTM}j,, ® can be 
chosen so that for allm € L, 


®), (Em) = Tom) (T3,L). (8.6.10) 


Proof Let E be a Lagrangian complement of TL in TM}, and let x : E > T*L 
be the vector bundle isomorphism defined by (8.6.8). The Tubular Neighbourhood 
Theorem for embedded submanifolds implies the existence of a diffeomorphism @ 
of an open neighbourhood U of L Cc M onto an open neighbourhood g(U) of the 
zero section é€g in E such that 


PIL =e; PtE,, = 1D T gm) Em (8.6.1 1) 


for all m € L. Thus, w = x og is a diffeomorphism of U onto an open neighbour- 
hood y(U) of the zero section so in T*L with the property y+, = so. Let @ denote 
the canonical 1-form on T*L. We show that w*d6 and @ coincide on L and apply 
Theorem 8.6.1. For that purpose, we show that w/, : TnM — Tsyam)(T*L) is sym- 
plectic for all m € L. A brief computation using (8.6.11) and x o eg = so shows that 
for X € T,,M, 


Win (X) = Xeocm) © Pin(X) = So(XL) + X(XE)soamys (8.6.12) 


where X(Xp) is the constant vector field on T;, L defined by x (Xg) € Ty, L (Exer- 


m 
cise 8.6.2). Hence, y/, coincides with the composition 


TmM > TmL ® Ty, L > Tsy(my(T*L), (8.6.13) 


where the first mapping is the symplectomorphism (8.6.9) and the second one is the 
symplectomorphism provided by Proposition 8.3.12. Now, we apply Theorem 8.6.1. 
Composing y with the local symplectomorphism provided by this theorem, we ob- 
tain the desired symplectomorphism @. 


Finally, let us assume that the symplectic manifold (M, w) is endowed with the 
additional structure of an integrable Lagrangian foliation , that is, a foliation 
by Lagrangian submanifolds. Let D be the corresponding Lagrangian distribution, 
given by the tangent spaces of the leaves of .F. The standard example of a La- 
grangian foliation is given by the fibres of the cotangent bundle of a manifold. We 
say that a Lagrangian submanifold L of M and a Lagrangian foliation ¥ of M 
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are transversal if L is transversal to every leaf of ¥. In this case, for dimensional 
reasons, one has 


TnM = TnL ® Din 


for every m € L. In the presence of a transversal Lagrangian foliation, the symplec- 
tomorphism ® of Theorem 8.6.4 can be chosen in such a way that it respects the 
fibre structure: 


Corollary 8.6.5 Let (M,w) be a symplectic manifold, let L be an embedded La- 
grangian submanifold and let # be a Lagrangian foliation of M transversal to L. 
Then, the symplectomorphism ® of Theorem 8.6.4 can be chosen in such a way that 
the intersection of any leaf of F with the domain of ® is mapped to a fibre of T*L. 


Proof Let D be the distribution defined by .¥. In the notation of the proof of The- 
orem 8.6.4, we choose E = Dy_. Since x is fibre-preserving, in a first step one 
must show that g can be chosen in such a way that y = x og maps leaves of F 
to fibres of T*L. This follows from the general Tubular Neighbourhood Theorem, 
see Remark 6.4.7. In a second step, one must show that the mapping ® of Theo- 
rem 8.6.1 maps every leaf of ¥ to itself. For that purpose, it is enough to show 
that, in the notation of the proof of that theorem, the time-dependent vector field X 
defined by (8.6.5) lies in D: by construction, the vector field Y;, defined by (8.6.1), 
lies in D. Since D is Lagrange with respect to both y*d@ and a, the 1-form @ de- 
fined by (8.6.3) vanishes on all vectors of D. Thus, the vector field X; lies in the 
@;-orthogonal complement of D. Since D is also Lagrange with respect to @,, it 
follows that X; lies in D. 


The fibrewise transitive action of the cotangent bundle on itself, defined in 
Sect. 8.3, carries over to the case under consideration. Let us recall that a folia- 
tion ¥ on a manifold M is called simple if there exists a surjective submersion 
p: M — B onto another manifold B such that for every point m € M the leaves of 
£ are the closed submanifolds go! (b), b € B, cf. Example 3.5.20/2. We may view 
B as the space of leaves, B = M/.¥. Every  < T; B, b € B, defines a vector field E 


on the leaf p '(b) by 
En JOm = (0',)"(&). (8.6.14) 


In what follows, we assume that the flow ¢§ of é is complete for every € € T*B. 
For example, this is the case if the leaves of .F are compact. Then, we can define 
ge = ¢:, which yields a diffeomorphism of the leaf p~!(b). Thus, every fibre T;B 
acts on the leaf p~!(b) as a vector group. Now, let 


s:B>s(B)=LCM 


be a Lagrangian section of p, that is, a Lagrangian submanifold such that pos = idg. 
Then, 


®: TBS M, P(E) := dbe(s 0 p(E)), (8.6.15) 
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defines a symplectomorphism of a neighbourhood of the zero section of T* B onto a 
neighbourhood of L = s(B) in M, (Exercise 8.6.3). This way we obtain an alterna- 
tive proof of Theorem 8.6.4, with the Lagrangian complement F being replaced by 
the integrable Lagrangian distribution D. 

Let us add that the assignment & b> é extends to 1-forms on B by assigning to 
a € 2'(B) the vertical vector field @ on M given by 


Bim °= Xp(m) 
for all m € M, cf. (8.3.11). Then, 
daw = p*a (8.6.16) 


and the flow ¢% of & is given leafwise by ¢?(m) = ob,”  (m). It defines a diffeo- 
morphism of M by ¢y := ¢{. This diffeomorphism is vertical with respect to p and 
satisfies dy (m) = Papin) (m) for all m € M. As in the case of the cotangent bundle, 
one can show that 


biw=o+ p*da, Luo = p*da, [&, 8] =0 (8.6.17) 


(Exercise 8.6.4). 


Exercises 

8.6.1 Complete the proof of Theorem 8.6.1 by verifying the properties of the 2- 
parameter family of mappings {¢;,+} and proving formula (8.6.1). 

8.6.2 Prove Formula (8.6.12). 

8.6.3 Show that the mapping @, defined by (8.6.15), is a symplectomorphism of a 
neighbourhood of the zero section of T*Z onto a neighbourhood of L in M. 

8.6.4 Prove the formulae in (8.6.17). 

8.6.5 Let ® be the local symplectomorphism of Corollary 8.6.5 and let B := ®*6. 
Show that 


dB=o, DcCkerB, fr, =0. 


8.7 Symplectic Reduction 


The general theory of symplectic reduction goes back to Benenti and Tulczyjew 
[37, 42]. In this section we present the main aspects of this theory without going 
into all details, for which we refer the reader to the above papers. An exhaustive 
treatment can also be found in [181]. The particularly important case of symplectic 
reduction for systems with symmetries will be dealt with in Chap. 10. 


Definition 8.7.1 (Symplectic reduction) Let (M, w) be a symplectic manifold, let 
(N, tx) be a submanifold and let (P, wp) be another symplectic manifold. A map- 
ping z: N —> P is called a symplectic reduction if it is a surjective submersion 
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fulfilling 
T* wp = yo. (8.7.1) 


In this case, the pair (P, wp) is called a reduced symplectic manifold for VN. A sym- 
plectic reduction is called strict if N is coisotropic. 


Comparing this definition with that of linear symplectic reduction in Sect. 7.3, 
we see that P is the geometric counterpart of the quotient vector space W given by 
(7.3.1) and wp corresponds to the symplectic form wy defined by (7.3.2). In partic- 
ular, the case of strict reduction corresponds to the case where W is coisotropic, cf. 
(7.3.3). For what follows, we denote 


ON = lyo. 


Let F®" be the family of characteristic subspaces and let D®” be the characteristic 
distribution of wy, cf. Definition 4.2.18. Recall from Proposition 4.2.20 that D°” 
is integrable and that the foliation consisting of the maximal integral manifolds is 
referred to as the characteristic foliation F°” of wy. Moreover, recall from Exam- 
ple 3.5.20/2 that if #°” is simple, the corresponding space of leaves is unique up 
to diffeomorphisms. We say that a surjective submersion z : N — P has connected 
fibres if x~!(p) is connected for every p € P. 


Proposition 8.7.2 Let (M,w) be a symplectic manifold and (N,tn) a submani- 
fold. 


1. Ifa: N — P is asymplectic reduction onto a symplectic manifold (P, wp), then 
rank wy = dim P, D®N =kera@y =kerz’. (8.7.2) 


In particular, D®’ is regular. If m has connected fibres, FN is simple. 

2. If the rank of wn is constant and if #°% is simple, the space of leaves P admits 
a unique symplectic form wp such that the canonical projection x : N — P isa 
symplectic reduction. If «: N > P is another symplectic reduction onto a sym- 
plectic manifold (P, @), then there exists a surjective local symplectomorphism 
x: P > P such that % = x on. If has connected fibres, x is a symplectomor- 
phism. 


Proof Due to dwy = 0, we have F°Y = keray. 
1. Since wp is non-degenerate, (8.7.1) implies kerwy = kerz’. First, this yields 


rankwy = dim N — dimkerwy = dimkerz’ + dim P — dimkerwy = dim P. 


Second, this implies that ker a, = F®” has constant rank and hence, by the remarks 
after Definition 4.2.18, that it coincides with D®" . In particular, D®” is regular and 
coincides with kerz’. Since z is a submersion, the maximal integral manifolds of 
the distribution ker z’ are the connected components of the fibres of 7. Therefore, if 
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the fibres of z are connected, they coincide with the maximal integral manifolds of 
ker 2’ = D®" and hence with the leaves of ¥°® . Thus, in this case, .¥°®% is simple. 

2. Using an atlas on N adapted to the foliation #°%, one can check that for any 
two pairs of tangent vectors X;, ¥; € Tn, N, i = 1, 2, satisfying 2’X1 = m/X>2 and 
w/Y, =2'Y, one has 


wn (X1, Y1) =n (X2, ¥2), (8.7.3) 


see Exercise 8.7.1. Since in addition z is a submersion, Eq. (8.7.1), taken pointwise, 
defines a unique 2-form (wp), on T,P for every p € P. Since m admits local 
sections, these 2-forms combine to a unique differential 2-form wp on P. Since 


m*dwp = d(x*wp) = dwn =0 


and, again, since wz is a submersion, wp is closed. Due to rankwp = rankwy = 
dim P, it is non-degenerate and hence symplectic. Now, let 7: N — P be another 
symplectic reduction. Then, by point 1, 


ker’ = kerwy =kerz’. 


Since P is the space of leaves of #°%, the fibres of a are the maximal integral 
manifolds of the distribution kerz’ and hence the connected components of the 
fibres of 7. In particular, the fibres of 2 are contained in those of 7. Hence, there 
exists a unique mapping x: P > P such that 7 = x oz. Since x admits local 
sections, this mapping is smooth. Using 


wp =ON = Op =m*(x*op) 


and the fact that z is a submersion, we conclude that y*@p = wp. Since P and P 
have the same dimension, according to Proposition 8.1.3, x is a local symplectomor- 
phism. It is surjective, because 7 is surjective. Obviously, if 7 has connected fibres, 
the same argument applied backwards yields that x is a symplectomorphism. 


The next Proposition characterizes symplectic reductions in terms of their graphs. 


Proposition 8.7.3 Let (M,@) be a symplectic manifold, let (N,tn) be a subman- 
ifold and let m: N — P be a surjective submersion onto a symplectic manifold 
(P, wp). The mapping 1 is a symplectic reduction iff its graph I, is an isotropic 
submanifold of M x P endowed with the symplectic product structure wy, p de- 
fined by (8.1.2). The reduction is strict iff I is Lagrange. 


Proof We calculate 


(ty x 1)*(pry © — prp wp) =@y — T* op. 
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Since ty x 7 is a diffeomorphism from N onto I7,, (8.7.1) holds iff I, is isotropic. 


By Proposition 7.2.4/3, N is coisotropic iff rankwy = 2dim N — dim M. By For- 
mula (8.7.2), we have rank wy = dim P. Thus, N is coisotropic iff 


1 
dimN = (dim M + dim P), 


that is, iff I, is Lagrange. 


Finally, in applications in physics, it often happens that there is a submanifold of 
the symplectic manifold (the phase space), which is invariant under the flow of the 
Hamiltonian vector field of a given function (a physical observable). We show that 
if the assumptions of symplectic reduction are fulfilled, one obtains a reduction of 
the flow to a lower dimensional manifold (the reduced phase space). 


Proposition 8.7.4 Let (M, w) be a symplectic manifold, let N C M be a subman- 
ifold and let 1: N — P be a symplectic reduction with connected fibres onto a 
symplectic manifold (P, wp). Let f € C°(M) and assume that N is invariant un- 
der the flow of the Hamiltonian vector field X ¢. Then, there exists a unique function 
7 € C™(P) such that 


fon f. 


The vector field on N induced by X f is 1 -related to Xp 


Proof Since the characteristic distribution of wy is given by kerwy = TNN(TN)® 
and since Proposition 7.2.1/1 implies (TN )? = ((TN)”)°, we have 


(TN)? c (kerwn)°. 


Thus, if (X ¢);y takes values in TN, then (df), takes values in (ker wy)? and 
hence f is constant on each leaf of the characteristic foliation 7°”. Since mz is a 
surjective submersion with connected fibres, there exists a unique smooth function 
f €C™(P) such that 1* f = iy, f = fx. Then, n*df =dfy. Using oy = 2*wp 
and the fact that the vector field X f induced by Xf on N is ty-related to Xf, we 
obtain 


X /a(x* wp) — X pawn =-—dfy = —n*df — m*(X pswp). 


Evaluating both sides on an arbitrary tangent vector Y €¢ TN and using that wp is 
non-degenerate and that z is a submersion, we conclude 


m'oXs=Xpon, 


f 


that is, xX f is w-related to X 
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Remark 8.7.5 


1. For fa to exist, it suffices to require that f be constant on the leaves of the char- 
acteristic foliation ¥°% . In this case, (df) ;y takes values in (ker wy )° and thus 


Proposition 7.2.1/5 implies that X f takes values in 


(kerwy)® = (TN (TN)®)° = TN + (TN). 


2. By Proposition 3.2.13, the flows ®/ of Xf and &S of xX; fulfil 


bd) on=n0os 


on the domain of of . Thus, by symplectic reduction, the problem of finding the 
integral curves of the Hamiltonian vector field X ¢ through points of N has been 
reduced to the corresponding problem on a space of lower dimension. For this 
reason, such reductions are enormously important in physical applications, see 
Chap. 10. 


Exercises 

8.7.1 Complete the proof of Proposition 8.7.2/2 by verifying (8.7.3). 
Hint. Show that in a local chart (U,«) on N adapted to the foliation ¥°”, 
with «', i < k yielding coordinates on the leaves and x’, i > k labelling the 
leaves, one has wy = Pee) ia Wij dx! A dx/, where the functions Wij 
are constant on the leaves. 


8.8 Symplectomorphisms and Generating Functions 


In this section we will show that symplectomorphisms may be viewed as Lagrangian 

submanifolds and that they can be (locally) generated by functions. Using this fact, 

we will gain some insight into the structure of the group of symplectomorphisms. 
The following proposition is a special case of Proposition 8.7.3. 


Proposition 8.8.1 Let (M), 1) and (M2, w2) be symplectic manifolds of the same 
dimension. A diffeomorphism ®: M, — Mp is a symplectomorphism iff its graph 
is a Lagrangian submanifold of M, x M2 endowed with the product symplectic 
structure @y, x M5 defined by (8.1.2). 


Proof Lette : 'g@ — M, x Mz be the natural inclusion mapping and let pr; be the 
canonical projections onto the factors of M, x M2. Note that  o pr, ol@ = pr, ol. 
Then, 


* = * 2 * 
DOM, x My = (pry @2 — prj @1) 
= (pry 0lp)*w2 — (pry olg)* a 


= (pr, 0 Lp)" (P* ar _ 1). 
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Since pr, ol@ is a diffeomorphism, we conclude that (pm, My = O iff @) = O* a. 


In this case, Im is Lagrange, because dim Ig = dim M; = 5 dim(M, x M2). 


Now, let ® : M; — Mp be a symplectomorphism. By the Poincaré Lemma, 
locally there exists a 1-form t on M, x Mp2 such that mu, x | dt. Since 
(Ey, i= 0, 137 is closed. Applying once again the Poincaré Lemma, we find 
an open subset U C Ig and a smooth function S: U > R fulfilling 


(5) 1p = —dS. (8.8.1) 


The function S is called a generating function of the canonical transformation @. 
Note that S depends on the choice of the potential t. Let us discuss the choices 
commonly used in the physics literature. For that purpose, let g', p; and g', p; be 
local Darboux coordinates on M; and Mo, respectively. The induced coordinates 
on M, x M2 will be denoted by g! = q' o pr, pi = pi o pry, g' =! o pr) and 
Pi = Pi © pry. In these coordinates, we have 


w= dp; A dg! — dp; A dq’. 
The following choices for t occur frequently: 
Bidg’ — pidg', —G' dpi — pidg', — ig’ +.q'dp;i, — —G'd pj +. 4'dp%. 


The corresponding generating functions S are said to be of the i-th kind, i = 
1,...,4. Which choice is convenient depends on @. If, for instance, the functions 
qi ole and q ol@ define a local chart A on Ig, one may use t = p;dq' — p;dq', so 
that the corresponding generating function S is of the first kind. It can be determined 
as follows. Using the simplified notation 


Pi =piotpoal, Pi =Piotpoal, (8.8.2) 
and writing S=So 27!, from (8.8.1) we read off the relations 


as _ os 
Pi = agi’ pe ai (8.8.3) 


taught in the standard course in classical mechanics. Since the functions (8.8.2) are 
determined by @, (8.8.3) is a system of first order partial differential equations for 
a function in the 2n variables q' and q'. This system has a unique solution, up to an 
additive constant. 

By analogy, if the combinations of coordinates given by (q, p), (p,q) or (p, P) 
yield local charts on I, one may work with generating functions of the second, 
third or fourth kind. The corresponding systems of differential equations read, re- 
spectively, 
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. as as 
7 = —, ==, 8.8.4 
q 8 Bi Pi aq! ( ) 

. as as 

ar pj =-—, 8.8.5 
ODi Pi aq! ( ) 

. as F as 
q' =>-—, q' = =>" (8.8.6) 

Opi OPi 


An exhaustive discussion of all possible coordinate systems on I, and thus of all 
generating functions, can be found in the book of Arnold [18]. 
Conversely, given a smooth function S$ in the 2n variables qg' and q' fulfilling 


a? 
det( sai) #0 (8.8.7) 
dq' oq! 


the relations (8.8.3) define 2n functions p; and p; in these variables. By (8.8.7), the 
matrix of partial derivatives a ; is invertible. Hence, the Inverse Mapping Theorem 
implies that the mapping qt p(q, q) can be locally inverted for every fixed q, thus 
yielding a mapping (q, p) + q(q,p). By plugging this into the functions p; we 
finally arrive at a mapping 


(q. P) +> (44. P). PCG, P)) 


which via the chosen coordinates g, p; on M; and g', p; on M2 defines a local 
diffeomorphism @ : M, — M). This diffeomorphism is symplectic: 


(dpi A dg’ — dp; A dq’) 


/, Opi; ; dpi. ; . Opi 
= (agi + gil’ n dg! — ( hdgi + Pag!) adg 
oq/ 0g: 
a*s : a2 BERY ; 
= ———dg/ ) a dg! dg! + 9! | Ady’ 
sina + agiag dq/dq' 0q/ dq! 


Analogously, by interpreting S as a generating function of the second, third or fourth 
kind, one obtains local symplectomorphisms defined, respectively, by the relations 
(8.8.4), (8.8.5) or (8.8.6). 


Remark 8.8.2 The definition of generating function given by (8.8.1) generalizes to 
arbitrary Lagrangian submanifolds (L,i) of a symplectic manifold (M, w). Since 
i*@w = 0, locally there exists a potential t of w fulfilling d(i*r) = 0. Thus, the 
Poincaré Lemma yields an open subset U C L and a function S$: U > R fulfill- 
ing (8.8.1). For the special case of a Lagrangian submanifold of a cotangent bundle 
M = T*Q which is transversal to the fibres, the function S can be viewed as a func- 
tion on Q. This case was dealt with in Proposition 8.3.10. 
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Example 8.8.3 The concept of generating functions of Lagrangian submanifolds is 
useful in thermodynamics: for one mol of an ideal gas, the phase space is the open 
subset R4 C R* with global coordinates p, V, T, S and symplectic form 


w=dV Adp+dT Ads. 


The equations of state 


S 
p-V=R.-T, p:-V” =k-exp—, cy = —— 
cy y-1 
define a 2-dimensional Lagrangian submanifold. All the thermodynamical poten- 
tials are generating functions of this Lagrangian submanifold, see [160]. 


In the remaining part of this section we present some elementary facts concern- 
ing the structure of the group of symplectomorphisms Symp(M, w) of a symplectic 
manifold (M, w). An exhaustive discussion can be found in [206, Ch. 10]. First, we 
equip Symp(M, w) with an appropriate topology, called the C!-topology. To define 
it, recall that the compact-open topology on the space C°(N, P) of smooth map- 
pings from a smooth manifold N to a smooth manifold P is generated by finite 
intersections of subsets VK ,U), where K C N is compact, U C P is open and 
V°(K,U) consists of the mappings y : N > P satisfying g(K) C U. For conve- 
nience, below we will refer to this topology as the C°-topology. Since P, being a 
manifold, is metrizable, this topology coincides with the topology of uniform con- 
vergence on compact sets.!? Now, the C!-topology on C®(N, P) is defined to be 
the initial topology induced by the assignment 


C™(N, P) > C®(TN, TP), greg, 


where C°(TN, TP) is endowed with the C°-topology. That means, it is generated 
by finite intersections of subsets VICK ,U), where K C TN is compact, U C TP 
is open and V!(K, U) is the preimage of V°(K, U) under the above mapping, that 
is, it consists of all mappings gy: N > P satisfying y'(K) C U.'* Since the pro- 
jections TN — N and TP — P preserve compactness and openness, respectively, 
the C!-topology is stronger than the C°-topology. In particular, every C°-open sub- 
set is also C!-open and C!-convergence implies C°-convergence. We note that the 
composition mapping 


CPN, PC (PO) CPN, @), (G.W)r> po, 


'34 sequence of mappings y;: N — P converges to a mapping gy: N — P iff for every compact 
K CN and every ¢ > 0 there exists ng such that sup,,<¢« d(@,(m), p(m)) < € for all n > no; here 
d is some metric on P, compatible with the topology. 


'4quivalently, a sequence {y;} converges to y in the C!-topology iff ¢; converges to g’ in the 
C°-topology on C®(TN, TP). 
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is continuous in the respective C°-topologies. The same is true for the inversion 
mapping y +> y~!, defined on the subset of diffeomorphisms from N to P. Due to 
(vWogy/=w'og’ ond (y~')’ = (g')~!, these mappings are also continuous in the 
C!-topology. Thus, Symp(M, @) is a iepolonical group in both the C°- and the C!- 
topologies. If M is compact, it can be equipped with the structure of a Fréchet-Lie 
group, but this requires the use of a more sophisticated topology, which is beyond 
our scope. We will comment on this below. Rather, as an application of Propo- 
sition 8.6.4 we will show that, in the C!-topology, Symp(M, w) for compact M 
is locally homeomorphic to the space Z!(M) of closed 1-forms on M, endowed 
with the C!-topology induced from C°(M,T*M). This result belongs to Wein- 
stein [303, §6]. By continuity of the group multiplication, it suffices to construct a 
local homeomorphism in a neighbourhood of idy. 


Proposition 8.8.4 (Weinstein) Let (M, w) be a compact symplectic manifold. There 
exists a homeomorphism from an arcwise connected open C' -neighbourhood of idy 
in Symp(M, w) onto an arcwise connected open C!-neighbourhood of the zero 1- 
form in the space of closed one-forms Z'(M). 


Proof Let A: M— M x M, A(m) := (m,m), be the diagonal mapping and let 
pr; : M x M — M denote the natural projections. For g € Symp(M, a), let Ig 
denote the graph of g. Iy is the image of the graph mapping gr, := (idy xg)o A. 

By Proposition 8.8.1, I is an embedded Lagrangian submanifold of M x M for 
all g € Symp(M, w). If g = idy, then Ty coincides with the submanifold (M, A). 
Theorem 8.6.4 implies that there exists a symplectomorphism W from an open 
neighbourhood U of Iq,, = A(M) in M x M onto an open neighbourhood V of 
the zero section sg in T*M, satisfying 


WoA=So. 


Then, W(I),) is a Lagrangian submanifold of T*M for all g € Symp(M, w) such 
that Ty C U. By Proposition 8.3.10, W(Iy) is the image of a closed 1-form on 
M iff it intersects each fibre of T*M transversally and exactly once. Thus, to con- 
struct the desired mapping from symplectomorphisms to closed 1-forms, it suffices 
to show that there exists an open neighbourhood WY of idy in Symp(M, @) in the 
C!-topology such that for all g € Y, the graph y is contained in U and the La- 
grangian submanifold W (I(,) of T*M intersects each fibre of T* M transversally and 
exactly once. For that purpose, let U CT(M x M) be obtained from the preimage 
of U under the natural projection T(M@ x M) > M x M by removing the preimage 
under ’ of the vertical distribution on T* M. This is an open subset of T(M x M). 
Since M is compact, TM contains a compact subset K which generates TM un- 
der scalar multiplication by real numbers and does not intersect the zero section 
(e. Bo the unit sphere bundle with respect to some Riemannian metric on M). Then, 
V'(K,U) is an open neighbourhood of A in C°(M, M x M) in the C!-topology. 
Define 


U= {y € Symp(M, w) : gry € V1(K,U)}. 
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Since the assignment g > Eto is the restriction to {A} x {idy} x Symp(M, w) of 
the mapping 


C™’(M,M x M) x C®(M, M) x C®(M, M) > C®(M x M, M x M), 
(W1, ¥2, Wa) > (2 x Wa) own, 


it is C!-continuous. Hence, Y is an open neighbourhood of idy in Symp(M, w) 
in the C!-topology. By construction, for all g € Y, the graph I y is contained in U 
and W(I,) is transversal to the fibres of T*M. Since M is compact, it is clear that 
by possibly shrinking @% we may achieve that Y(I7,) intersects the fibres at most 
once. Then, the mapping z 0 W o gry, with the canonical projection z : T*M— M, 
is an injective immersion, and hence an embedding of M into M. Using the theorem 
on invariance of domain, stated in Footnote 38 on page 159, one can show that the 
image of this embedding must be M (Exercise 8.8.3). Therefore, YW (I(,) intersects 
each fibre of T*M at least once, and hence exactly once. Thus, WY has the desired 
properties. Now, for every g € Y, Proposition 8.3.10 yields a closed 1-form @ on 
M such that YW (I) = a(M). Explicitly, a is given by 


a=Wogroa! with A:= 7 0 W ogry. 


The assignment ¢ +> a is C!-continuous, because it decomposes into a sequence 
of composition and inversion mappings. To prove that it can be made into a homeo- 
morphism, we show that there exists an open neighbourhood ¥ of the zero section 
in Z'(M) in the C!-topology such that, for alla € Y, a(M) C V and W—!(a(M)) 
intersects each of the submanifolds {m} x M and M x {m}, m € M, transversally 
and exactly once. Y can be constructed in the same way as WY, one just replaces 
Symp(M, w) C C®(M, M) by Z'(M) C C®(M,T*M), by a, gry by YW! oa 
and the preimage of the vertical distribution of T*M under W’ by the two distribu- 
tions kerpr,, i = 1,2. Then, for all a € Y, the mappings pr; o YW! oa:M>M 
are bijective immersions and hence diffeomorphisms. Define 


1 with yo := pr, 0 Wo. 


g:=proW |! oaop 
Since the assignment a +> g decomposes into a sequence of composition and inver- 
sion mappings, it is C!-continuous. A straightforward calculation shows that this 
assignment is inverse to the assignment g +> a@ constructed above. Finally, we inter- 
sect Y with the image of VY and Y with the image of Y. Since Z!(M) is locally 
arcwise connected, the neighbourhoods so obtained can be shrunk so that they be- 
come arcwise connected. 


Proposition 8.8.4 yields 


Corollary 8.8.5 The group of symplectomorphisms of a compact symplectic mani- 
fold is locally arcwise connected. 
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Remark 8.8.6 Since the group multiplication in Symp(M, @) is C!-continuous, 
Proposition 8.8.4 provides an atlas modelling Symp(M,@) for compact M on 
the infinite-dimensional vector space Z!(M) = X_y(M, w) endowed with the C!- 
topology.!> However, since X_y(M, w) is not complete in this topology, this does 
not supply a differentiable structure on Symp(M, w). To obtain completeness and 
thus a differentiable structure one can either enlarge the spaces by passing to forms 
and symplectomorphisms of the differentiability class C!. Then, Symp(M, w) is a 
smooth manifold modelled on the Banach space of closed 1-forms on M of class C!. 
In this setup, the multiplication mapping in Symp(M, q) turns out to be continuous 
but not differentiable [263, Thm. 2.1]. Or one can construct appropriate topolo- 
gies, thus turning X_H(M, o) into a Fréchet space!® and Symp(M, ) into a smooth 
manifold modelled on X_y(M, w) [262], see also [82, 213, 230]. In this structure, 
the multiplication and inversion mappings are smooth and hence Symp(M, w) is 
an infinite-dimensional Fréchet Lie group.!’ The corresponding Lie algebra can be 
naturally identified with X_y(M, w) with the ordinary commutator of vector fields 
and the exponential mapping being given by 


exp: XLy(M, w) > Symp(M, w), exp(X) := DX. (8.8.8) 


In this sense, independent of topologies or differentiable structures, one may speak 
of XLH(M, w) as the Lie algebra of Symp(M, w). We note that, in contrast to the 
finite-dimensional case, the exponential mapping (8.8.8) is not a local diffeomor- 
phism between neighbourhoods of the origin in X_y(M, @) and the unit element of 
Symp(M, w), see [99] and [230]. 


Let us denote the arcwise connected component of Symp(M, w) containing idy 
by Sympp(M, w). This is a normal subgroup of Symp(M, w). Note that if M is 
compact, Corollary 8.8.5 implies that this is also the connected component of idy, 
because in a locally arcwise connected space, the arcwise connected components 
coincide with the connected components. By definition, for every g € Sympp(M, w) 
there exists a C!-continuous curve ® : [0, 1] > Sympyp(M, w) such that py = idy 
and ®; = g. It is not hard to see that ® can be chosen so that the induced homotopy 
@® : [0,1] x M — M is smooth. Since the latter runs through the diffeomorphisms 
of M, it is commonly referred to as a smooth isotopy. According to Remark 3.4.5/2, 
@ is the flow of the time-dependent symplectic vector field X defined by 


d 
a @,oP_:(m), meM, teE[0, 1]. (8.8.9) 


'SSince M is compact, the C!-topology on X.y(M, @) allows for a norm, defined by taking the 


maximum over the usual C!-norms of the local representatives of a in some chosen finite atlas, 
see e.g. [233, 234]. 


‘6A complete metrizable locally convex vector space. 


'74 Lie group modelled on a Fréchet space. 
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By Proposition 8.2.4/4 and Remark 8.2.5/2, X is locally Hamiltonian for every f. 
This observation motivates the study of isotopies generated by time-dependent vec- 
tor fields for which X is Hamiltonian for all tf. 


Definition 8.8.7 (Hamiltonian diffeomorphism) Let (M, w) be a symplectic mani- 
fold. 


1. A smooth isotopy @ : [0,1] x M— M in Diff(M) with associated time- 
dependent vector field X is called Hamiltonian if there exists a smooth function 
H :[0,1] x M— R such that X; = Xy, for all t. We say that ® is generated 
by H. 

2. A diffeomorphism g of M is called Hamiltonian if there exists a Hamiltonian 
isotopy from idjy to g. The set of Hamiltonian diffeomorphisms is denoted by 
Ham(M, w). 


Obviously, every Hamiltonian diffeomorphism is a symplectomorphism. In fact, 
we have 


Proposition 8.8.8 Ham(M, w) is a normal subgroup of Sympo(M, @). 


Proof Let ® and W be Hamiltonian isotopies, generated by the time-dependent 
Hamiltonians H and K, respectively. Using (8.2.4), we compute 


d 
2 Pin) = (Xk, + (W)«XH,) (Yo ®;(m)) = X W, 0 D;(m)). 


K,+H,ow,! ( 
Hence, (t,m) +> Wo ®;(m) is a Hamiltonian isotopy, generated by the Hamiltonian 
(t,m) > K;,(m) + H, oY; |(m). Similarly, one shows that (t,m) > ®;!(m) and 
(t,m) +> go ®,o¢g~!(m), for any symplectomorphism g, are Hamiltonian isotopies 
generated by the time-dependent Hamiltonians (¢, m) t+ — H;0 ®,(m) and (tf, m) bh 
Hy o g ! (m), respectively (Exercise 8.8.4). 


As a consequence of Proposition 8.8.8, one has the following sequence of sub- 
group inclusion mappings: 


Ham(M, w) > Sympo(M, w) > Symp(M, w) > Diff" (M) — Diff(M), 
(8.8.10) 
with Diff* (M) denoting the group of diffeomorphisms which preserve the orienta- 
tion defined by the natural volume form of w. This sequence is studied in symplectic 
topology. 
Next, we show that the local homeomorphism of Proposition 8.8.4 maps Hamil- 
tonian isotopies to families of exact 1-forms. We start with a lemma on exact sym- 
plectic manifolds (which by Remark 8.1.4/2 are necessarily noncompact). 


Lemma 8.8.9 Let (M,do) be an exact symplectic manifold. An isotopy ® 
in Diff(M) with ®j) = idy is Hamiltonian iff there exists a smooth function 
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f : [0,1] x M — R such that for every t € [0, 1], 
O*o —o =A fy. (8.8.11) 


Proof Let X be the time-dependent vector field associated with ®, cf. Re- 
mark 3.4.5/2. Using (4.1.28), we compute 


d 
ree = OF Lx,0 = OF (Xpado + d(X;40)). (8.8.12) 


If ® is Hamiltonian with generating time-dependent Hamiltonian H, this implies 


d 
ae = OF (d(Xy,10) — dH;). 


Integration of these two families of 1-forms yields (8.8.11) with 


f= [0 — Mo ®,)dt. 


Conversely, if f is given such that (8.8.11) holds, for every t we define 


H, := X;10 — Caueaa! 


This yields a smooth function H : [0,1] x M — M. A straightforward calculation 
using (4.1.30) and (8.8.12) shows that X 7, = X; for all t. Hence, ® is Hamiltonian 
with generating time-dependent Hamiltonian H. 


Proposition 8.8.10 Let (M, w) be a compact symplectic manifold. The local home- 
omorphism of Proposition 8.8.4 maps Hamiltonian isotopies of M to smooth fami- 
lies of exact 1-forms on M and vice versa. 


Proof We adopt the notation of the proof of Proposition 8.8.4. According to that 
proof, for every smooth isotopy ® in Y there exists a smooth family {A,} of diffeo- 
morphisms of M and a smooth family {a;} of closed 1-forms on M such that 


Wo grg, =a; 0A, 


for all r.'° Denote the canonical 1-form on T*M by 6. Since ¥*6 is a potential for 
@ on U C M x M and since idy x@ is a smooth isotopy of the exact symplectic 
manifold (U, w), Lemma 8.8.9 yields that idjy x ®, and hence @, is Hamiltonian 
iff there exists a smooth function f : [0, 1] x M x M — R such that for all t 


(idy x®,)* 0 W*(6) —W*O =dfy. 


'8With ®, playing the role of in the proof of Proposition 8.8.4. 


412 8 Symplectic Geometry 
Applying A* to this equation and using A* o W*(@) = sj = 0, we obtain 

gr, oW* (0) = A* 0 (id x B;)* 0 W* (0) =d(fr 0 A). 
By the defining property (8.3.2) of 6, the left hand side yields 


gry, oW* (0) = Aj (a7) = AF, 


so that a, = d(f;o Ao aa Thus, ® is Hamiltonian iff a; is exact for all t. 


As a consequence of Proposition 8.8.10, if the first de Rham cohomology group 
H'(M) of M is trivial, Ham(M, @) coincides with Sympo(M, w), that is, in this 
case, the first inclusion mapping in the sequence (8.8.10) is surjective and, by Re- 
mark 8.8.6, the Lie algebra of Ham(M, w) may be identified with Xyq(M, w). For 
an arbitrary compact manifold, the situation is more complicated. The homeomor- 
phism of Proposition 8.8.4 need not always map Hamiltonian diffeomorphisms to 
exact 1-forms, no matter how C!-close to idy they are. Rather, one can show that a 
symplectomorphism ¢ in the domain of this homeomorphism is Hamiltonian iff the 
cohomology class of its image belongs to a certain countable subgroup of H!(M), 
called the flux group [206, Lemma 10.16]. This group is the image of the fundamen- 
tal group of Sympg(M, w) under the so-called flux homomorphism which assigns 
to the homotopy class of a closed curve in Sympp(M, w), represented by a symplec- 
tic isotopy ®, the cohomology class of the closed 1-form [(X;.)dt. Here, X is 
the time-dependent vector field associated with ® via (8.8.9), the integral is defined 
pointwise and the resulting 1-form is indeed closed due to (4.1.30). For details, like 
the proof that this mapping is well-defined and a group homomorphism, see [206, 
Chap. 10] or [242, Chap. 14]. As a consequence of the facts that the homeomor- 
phism of Proposition 8.8.4 maps Hamiltonian diffeomorphisms to the flux group 
and that the latter is countable, one obtains 


Proposition 8.8.11 Jf M is compact, every smooth isotopy in Ham(M, w) is Hamil- 
tonian. 


Proof See (206, Prop. 10.17]. If the first de Rham cohomology group of M is trivial, 
the assertion follows from Proposition 8.8.10 (Exercise 8.8.5). 


Remark 8.8.12 Let (M, @) be a compact symplectic manifold. 


1. On the basis of Proposition 8.8.11, one may give the following intuitive argu- 
ment showing that the tangent space at idjy of Ham(M, ) is given by Xy(M, ow) 
[242, §1.4]: every curve in Ham(M, w) through idy is a Hamiltonian isotopy 
and hence its tangent vector at idjy is a Hamiltonian vector field. Conversely, 
the flow of any Hamiltonian vector field yields a Hamiltonian isotopy and thus a 
curve in Ham(M, w) through idy. Thus, according to Remark 8.8.6 and by anal- 
ogy with the finite-dimensional situation of Proposition 5.6.5, we conclude that 


8.8 Symplectomorphisms and Generating Functions 413 


Ham(M, w) is the Lie subgroup of Symp(M, @) associated with the Lie subal- 
gebra Xq(M, w) of XL_H(M, w). It was proved by Ono [231] that the flux group 
is discrete.!? Thus, one may shrink the C!-neighbourhood of idy of Proposi- 
tion 8.8.4 so that all Hamiltonian diffeomorphisms in that neighbourhood are 
mapped to exact forms. Hence, for compact M, Ham(M, w) is a C!-closed Lie 
subgroup of Sympo(M, w). 

2. If M is connected, the Hamiltonian function corresponding to a Hamiltonian 
vector field is unique up to a constant. One way to obtain uniqueness consists 
in considering the space </(M) of smooth functions on M with zero mean?” 
with respect to the canonical volume form S§2 of w, see [242]. This is a Poisson 
subalgebra of C°°(M) (Exercise 8.8.6). Thus, in this case the Lie algebra of 
Ham(M, w) can be identified with </(M). From the proof of Proposition 8.8.8 
we read off that the adjoint action of Ham(M, w) on &@(M) is given by 


Ham(M,) x &(M)> @(M), — (@, fy fo}, (8.8.13) 


The following fundamental algebraic statements about Ham(M, w) were proved 
by Banyaga [30, 31]. 


Proposition 8.8.13 (Banyaga) 


1. If the symplectic manifold (M, w) is compact, Ham(M, w) is a simple”! group. 

2. If two symplectic manifolds (M,,@,) and (M2, 2) have isomorphic groups of 
Hamiltonian symplectomorphisms, there exists a diffeomorphism ®: M, —> M2 
and a number c #0 such that ®*w2 = ca. 


The second statement says that the algebraic structure of the group of Hamilto- 
nian diffeomorphisms determines the symplectic structure up to a constant factor. 


Remark 8.8.14 In case M is not compact, the above results on Symp(M, w) and 
Ham(M, w) for compact M carry over to the group Symp‘ (M, @) of compactly sup- 
ported symplectomorphisms~* and the subgroup Ham‘(M, w) of elements which 
can be joined to idy by a Hamiltonian isotopy whose generating time-dependent 
Hamiltonian has compact support. Here, the topology is defined to be the induc- 
tive limit, taken over the directed set of compact subsets K of M, of the topologies 
on the subgroups of symplectomorphisms with support in K. Correspondingly, one 
has to use compactly supported forms. Thus, in particular, Symp°(M, @) is locally 
homeomorphic to the space of compactly supported closed 1-forms and its Lie al- 
gebra consists of compactly supported symplectic vector fields. The Lie algebra of 


!°This was the affirmative answer to the so-called flux conjecture, see also [179]. For noncompact 
M, the flux group need not be discrete, see [205]. 


That is, fy f 2 =0. 
214 group G which does not contain normal subgroups besides {11} and G. 


?2That is, symplectomorphisms which outside a compact set coincide with the identical mapping. 
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Ham‘ (M, w) coincides with the space of compactly supported Hamiltonian vector 
fields. It can be identified with the space .</°(M) of compactly supported functions 
on M with zero mean. For details we refer to [206] and [242]. An exhaustive treat- 
ment of the case M = R2” can be found in [139]. 


We conclude this section with a remark on the geometric structure of Ham(M, w). 
This structure will show up again in the next chapter. Assume that M is noncompact 
and consider the group Ham‘ (M, w) of compactly supported Hamiltonian symplec- 
tomorphisms. In 1990, Hofer [134] observed that there exists a norm on the Lie 
algebra </°(M) of Ham‘(M, w). It can be defined by 


|| 7 || := sup H(m) — inf H(m). (8.8.14) 
meM meM 
This norm is invariant under the adjoint action (8.8.13). The corresponding length 
function for a smooth Hamiltonian isotopy ® in Ham‘(M, w) generated by the time- 
dependent Hamiltonian H is given by 


1 
(@)= [Wit 
0 
and the distance between two Hamiltonian diffeomorphisms ¢ and yf is defined by 


p(y, wv) = inf(l()), 


with the infimum taken over all Hamiltonian isotopies ® fulfilling ®) = 
and ©; = y. It is easy to show that p defines a bi-invariant pseudo-metric on 
Ham‘(M, w), but it is hard to prove that p(y, ¥) = 0 implies g = w. See [134] 
for the case M = R?” and [178] for the general case. For a further discussion of 
the Hofer metric we refer to the book of Polterovich [242]. The case M = R2” is 
dealt with in great detail in the book of Hofer and Zehnder, see [139]. The Hofer 
norm plays a fundamental role in symplectic topology and in the study of global 
existence questions in the theory of Hamiltonian systems. In particular, it gives rise 
to a certain symplectic invariant, the so-called Hofer-Zehnder capacity, which turns 
out to be one of the basic tools for studying the Weinstein conjecture, see Sect. 9.4. 


Exercises 
8.8.1 Let (Mj, 1) = (R x Ry, dp A dq) and (M>, w2) = (R? \ {0}, dp A dg) and 
let 6: Mj > M2, P(g, p) = (G, p) be given by 


a ee - | po 
q =,/ —sin(27q), p= ,/— cos (27q). 
ae) 0 


(a) Show that the restrictions of ® to the subsets (x,x +1) x Ry CMj,x€ 
R, are symplectomorphisms onto their images. Determine these images. 

(b) Find the generating function of the first kind for the restriction of @ to 
(0, 1) x Ry and the generating function of the second kind for the restric- 
tion to (-3, 5) x Ry. 
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8.8.2 Prove that the equations of state of an ideal gas define a Lagrangian subman- 
ifold of the symplectic manifold postulated in Example 8.8.3. 

8.8.3 Use the theorem on invariance of domain, stated in Footnote 38 on page 159, 
to prove that every embedding of a compact manifold into another compact 
manifold of the same dimension and the same number of connected compo- 
nents is surjective. 

8.8.4 Complete the proof of Proposition 8.8.8. 

8.8.5 Use Proposition 8.8.10 to prove Proposition 8.8.11 under the assumption that 
the first de Rham cohomology group of M is trivial. 

8.8.6 Let (M, w) be a symplectic manifold. Show that the space .e7(M) of smooth 
functions on M with zero mean with respect to the canonical volume form of 
w is a Poisson subalgebra of C°(M). 


8.9 Elementary Morse Theory 


In this section, we will discuss some elements of Morse theory, which will be used 
later on. Let f : M — R be a smooth function. Recall that m € M is a critical point 
of f iff 


df(m) =0. 
Let M+ be the set of critical points of f. The image df(M) of the 1-form df and 


the image so(M) of the zero section so in T* M are embedded submanifolds of T* M 
which intersect over every critical point. 


Definition 8.9.1 (Morse function) A smooth function f: M — R is called a Morse 
function if df(M) and so(M) are transversal, that is, if 


Tg (T*M) = Te (df (M)) + Te (so(M)) (8.9.1) 
for all € € df (M)N so(M). 


By Corollary 1.8.5, for a Morse function f the intersection df(M) 1M so(M) is 
an embedded submanifold. Since df (M) and so(M) are Lagrange, it has dimension 
zero. Thus, for a Morse function f all critical points are isolated. In particular, if 
M is compact, the number of critical points of a Morse function is finite. Moreover, 
for every critical point m, the tangent spaces of df(M) and so(M) at so(m) are 
complementary Lagrangian subspaces of Ts) ¢m)(T* M). 

Let us derive a criterion for transversality. According to Proposition 8.3.12, for 
every m € My, the zero section so induces a natural symplectomorphism 


P5,(m): Tsym) T° M > TM @T;,M. (8.9.2) 


Let pr.: TM ®T;, M > TM be the projection onto the second component of the 
direct sum. The tangent mapping 


‘cha TnM > Tsy(m)T*M 
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induces a linear mapping 
Hess» (f): Tn M > T3,M, Hessyn (f) := prz oPs_(m) © Gis (8.9.3) 
This mapping or, equivalently, the corresponding bilinear form 
Hessm(f): TmM x TnM—>R 


is called the Hessian of f at the critical point m. Obviously, df(M) and so(M) are 
transversal at a critical point m € My iff (df)/,, has maximal rank, that is, iff the 
Hessian of f is non-degenerate at that point. Such a critical point is called non- 
degenerate. 

Let X, ¥Y € T,,M. Using ®sy(m) 0 (so)),, (Y) = (Y, 0) and that 55m) is symplectic, 
we obtain 


Hessm(f)(X, Y) = soa) (CAA )in (X) (50)in Y))- (8.9.4) 


Since the images of df and so and the fibres of T*M are Lagrange, we have 


som) (AA Vin (X), (S0)in Y)) = spiny (CAL Yn Ys (50) in (X)), 
that is, the Hessian is symmetric. In a local chart (U, «) at m, we obtain 


a°(f ox!) 


Hessm(f)(0;, 9j) = (09; f)(m) = 9 (k(m)). 


xlaxJ 

By the Theorem of Sylvester, for every symmetric bilinear form H on a vector 
space V there exists a maximal subspace of V on which H is negative definite. The 
dimension of this subspace is called the index of H. 


Definition 8.9.2 (Morse index) Let f: M — R be a Morse function and let m € 
My. The index ind ¢(m) of f at m is defined as the index of Hess, (f). 


Theorem 8.9.3 (Morse Lemma) Let f: M — R be a smooth function and let mo € 
Mf be anon-degenerate critical point. Then, there exists a local chart mapping mo 
to 0 such that the local representative of f is given by 


1 
f@m=fO+ 5 Hesso(f) (x, X). (8.9.5) 
The following proof is due to Palais [235]. We follow the presentation in [32]. 
Proof As in the proof of the Darboux-Theorem and its generalizations, we use the 


deformation method of Moser. Since the statement is local, without loss of general- 
ity, we may assume M = R”, mo = 0 and f (mo) = 0. Denote 


1 
Wx) = sHesso( fx), fr =tf + (Oh. 
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We seek a time-dependent vector field X whose flow ©, = @; 9 satisfies OF f, =h. 
Then, ®, : yields the desired local chart. Differentiating this equation with respect 
to t, we obtain 


d * * * d * 
rr fi =; (X(f;)) +, (5) = @, (X (fi) +f- h) = 0. 


Hence, X; must satisfy 


Xi(frHh—f. (8.9.6) 
The left hand side of this equation can be rewritten as follows: expand X; = X! a 
and integrate the equality 
0° f; 
i 


ax! dx/ (sx)x! 


d xi oft _ xi 
ds 1) 27 GX) = 7 (X) 


with respect to s from 0 to | to obtain 


ofr 


Oxi dx 


1 
(Xi(fr)) ®) = Bi OX; (xx, Bisco =f (sx)ds. 


For the right hand side of (8.9.6), Taylor’s Theorem yields 


(h — f)(x) = Fij(x)x'x/ 


on some neighbourhood U of the origin, where Fj; are smooth functions on U. 
Thus, (8.9.6) holds on U if the functions X; satisfy 


Bi, (x) Xi (x) = Fij(x)x' 
for all 7. Since 
a” fi 
axtdaxd 
is invertible for all t, and since [0, 1] is compact, by possibly shrinking U we can 
achieve that Bi (x) can be inverted for all x € U and ¢ € [0, 1] and thus obtain the 
desired vector field X satisfying (8.9.6). Finally, since X;(0) = 0, and again since 


[0, 1] is compact, by possibly further shrinking U we can achieve that U is contained 
in the domain of ©; for all t € [0, 1]. This completes the proof. 


Bj,(0) = (0) = Hesso(f)ij 


By successively diagonalizing the symmetric bilinear form Hesso(f), the local 
representative of f can be brought to the following canonical form (Exercise 8.9.1): 


Corollary 8.9.4 Let f: M — R be a smooth function, let mo € My be a non- 


degenerate critical point and let i be the index of f at mo. Then, there exists a 
local chart mapping mo to 0 such that the local representative of f is given by 


1 i 
FO) = FO — s(x ++ +.47) + sitet). (8.9.7) 


418 8 Symplectic Geometry 


Remark 8.9.5 The set of Morse functions is open and dense in the space of smooth 
functions [212, $6]. In particular, Morse functions exist on every manifold. 


Next, we will see how the critical points of a smooth function f: M — R are 
related to the topology of the subsets 


M4 := f-'(-co,a]={me M: f(m) <a} (8.9.8) 
and 
Mab] -— f-'la, bl={meM:a< fim) <b}. (8.9.9) 


If a is a regular value of f, the Level Set Theorem yields that f—!(a) is an em- 
bedded submanifold of M of codimension 1. Moreover, f~!(—oo, a) is an open 
submanifold of M. Using local charts on M mapping f~!(a) to the subspace 
{0} x IR"! and local charts on f~!(—oo, a) taking values in R”, one can con- 
struct an atlas on M“ with values in R” and thus show that M@ is a manifold with 
boundary, where the boundary is given by f~!(a). Analogously, M!4-?! is a mani- 
fold with boundary, provided neither a nor b are critical values of f, and the bound- 
ary is given by f—!(a) U f~!(b). If M is compact, it is reasonable to imagine f 
as a height function: while the parameter runs through the real numbers (starting at 
—oo), M® grows (starting from the empty set) to the whole manifold. Using meth- 
ods of Morse theory, it is possible to describe this process of growing. The simplest 
statement of this type is 


Proposition 8.9.6 (Morse Isotopy Lemma) Let M be a smooth manifold and let 
f: M— R be a smooth function. Let a < b and assume that M'"! is compact and 
does not contain any critical point of f . Then, M¢ is diffeomorphic to M°. 


Proof Choose a Riemannian metric g on M and consider the gradient vector field 
Vf of f, defined by 


Vf:=¢ 'o(df), 
cf. Formula (4.5.14). Then, 


avi, X)=xX(f) (8.9.10) 


for all X € X(M). Since [a, b] does not contain any critical value, V f is nowhere 
vanishing on M!@-°!, This remains so on some neighbourhood U of M!4-! in M. 
Since M!@-’! is compact, U can be chosen to have compact closure. On U, we can 
define 


Xn = V f(m) 
|| Vf (m) ||? 


(8.9.11) 
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To extend X to a vector field on M, we choose a smooth function x:M->R 
satisfying X}ygta.o) = 1 and X}(m\v) = 0 and define a vector field Y on M by 


x(M)Xm mew, 
Ym i= 
0 mé€U. 


Since U has compact closure and hence Y has compact support, the flow ® of Y is 
complete. Using (8.9.10), for m € M'%?! we compute 


d(f o ®;)(m) 
dt 


Thus, f(®;(m)) = f(m) +t. We conclude that a): M > M is a diffeomor- 
phism mapping M“ onto M?. 


= Yn(f) = Xm(f) = 1. 


Remark 8.9.7 By means of the flow ® constructed in the proof of Proposition 8.9.6, 
one can define a mapping 


me M*, 


h:[0,1]x M’ > m?, h(t,m) := re 
Di (a— f (my) (M) me M°’\ M*. 


This mapping is continuous and satisfies 
A(O, -) =id ys, h(1,M°) c M*, h(t, -)yma = idye. 


One says that h is a strong deformation retraction from M? to M“@ and that M“ isa 
strong deformation retract of M?. 


Next, we discuss the question how the topological structure of M? differs from 
that of M@ if a and b are regular values and M!“-’! contains a single critical point. 
In general, this question can be answered only up to homotopy-equivalence, that is, 
the best one can say is how the homotopy type of M? differs from that of M“: 


Proposition 8.9.8 Let M be a smooth manifold and let f : M — R be a smooth 
function. Let a < b be regular values of f and assume that M'®?! is compact and 
contains a single critical point m of f . Let i := indm(f ). Define subsets M® and M” 
of M as follows: choose a local chart (U, k) at m such that the local representative 
of f has the canonical form given in Corollary 8.9.4, choose ¢ > 0 such that k(U) 
contains the ball of radius 2¢ about the origin and let 


Wye —e* b._ -l(,i 

Mt := MIM" Mf? = M* UK" (e5,), 
where 

eb, = {xe R": [[xll <2e, x41 =---= 1 =O}, 


see Fig. 8.1. Then, M® is homotopy-equivalent to M¢ and M? is homotopy- 
equivalent to M® (in the relative topology induced from M). 
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Fig. 8.1_ Construction of the (X41, ---;%n) 
subset M? in Proposi- 

tion 8.9.8 as the union of the 
subset MM, labelled by A, and 
the i-cell e',,, labelled by C. 
B denotes the ball of radius 
2e about the origin 


45) 


Proof See [212], Theorem 3.2. 


The set ed. is called a cell of dimension i and the process of passing from M¢ 
to M? is commonly referred to as attaching a cell of dimension i. While M¢ is 
a manifold with boundary, M? need not be so. It is however obvious that the cell 
é. may be thickened and M? may be smoothened so as to render a manifold with 
boundary. 

In case M is compact and hence every smooth function is bounded, Propo- 
sitions 8.9.6 and 8.9.8 can be used to construct a topological space which is 
homotopy-equivalent to M as follows. Choose f so that for every singular value 
there is exactly one critical point. Denote the singular values by a; with aj <---< 
a,. Starting at a = —oo with the empty set, successively attach an appropriate cell 
each time a passes a singular value. In more detail, since a; is a minimum, we must 
take a zero-dimensional cell «. For aj <a < ay, each M“ is homotopy equivalent 
to this cell. When passing the singular value a2 we must attach a cell of dimension 
equal to the index of the corresponding critical point by identifying the boundary of 
the closure of this cell with *. Continuing this procedure, we obtain a topological 
space which is called a cell complex, or more precisely a CW-complex, see [55]. In 
the last step, since a, is a maximum, we have to attach a cell of the dimension of M. 
Using this construction, one can show that every compact manifold is homotopy- 
equivalent to a cell complex with one cell in dimension i for each critical point 
of index i of some Morse function on M. This result carries over to noncompact 
manifolds. For a thorough discussion, see [212], $3 and §6. 


Example 8.9.9 We consider an upright 2-torus T with the height function f as 
shown in Fig. 8.2, thereby leaving the necessary computations to the reader (Exer- 
cise 8.9.3). There are four critical values a),...,a4, corresponding to four critical 
points m,,...,m4. The critical point m , is a minimum and hence has index 0, m2 
and m3 are saddle points and hence have index | and my is a maximum and hence 
has index 2. Thus, starting with a single point (0-cell), first we have to attach a 1- 
cell, then once again a |-cell, and finally a 2-cell, see the figure. The way how each 
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* O09CSoOSs 


attaching cells 


[> 


homotopy type C) O 


aj<a<a az <a<a3 a3 <<a<a4 a4<a 


Fig. 8.2. The subsets M,, their homotopy types in terms of cell complexes and the operations of 
attaching cells for T?, cf. Example 8.9.9 


of these cells is attached to the cell complex obtained before follows from the way 
this cell is attached to the corresponding subset M, according to Proposition 8.9.8. 


Remark 8.9.10 


1. 


In some situations, the indices of the critical points of a Morse function contain 
more information than just the homotopy type of M. For example, the Reeb 
Theorem states that if M is a compact manifold and if it admits a Morse function 
with exactly two critical points, then M is homeomorphic to the sphere S!™™, 
see e.g. [212], §4. It need not be diffeomorphic to S?™™, though. 


. By means of a so-called Morse-Smale pair~* ( f, g) one can construct a homology 


theory for M, called Morse homology, see for example [51] or [146]. The chains 
of this homology theory are generated by the critical points of f, ordered by their 
index, and the boundary operator assigns to a critical point mo of index i the sum 
over all critical points m of index i — 1, weighted with the signed number of flow 
lines between mo and m. It can be shown [146] that Morse homology coincides 
with singular homology.”* In particular, it does not depend on the chosen Morse- 
Smale pair (f, g). These facts indicate that Morse theory yields a mighty tool 
for investigating the topology of manifolds. For an axiomatic approach to Morse 
homology in the sense of Eilenberg and Steenrod we refer to [268]. 


. As a consequence of either the construction of a homotopy-equivalent cell com- 


plex for M or of the equivalence of Morse homology and singular homology, 
one obtains the Morse inequalities. Let bj; = dim H'(M) be the Betti numbers 


3That is, f is a Morse function and g is a Riemannian metric on M such that for every pair of 
critical points m,, m2 the stable manifold of m, with respect to the gradient vector field V f is 
transversal to the unstable manifold of m2. 


4 See e.g. [55] for this notion. 
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of M, see Sect. 4.3, and let m; be the number of critical points with index 7. Let 
n= dim M. Then: 


mj = bj, 


mg — my +--+ (—1)hing > by — by +++ + (Do, 


Dein = VOC 'bi. 
i=0 i=0 


. There are interesting applications of Morse theory in the theory of Lie groups 


and symmetric spaces, see [97] and [212]. As an example, consider a classical 
Lie group G C M,,(K), with K = R, C or H. Endow M,,(K) with the standard 
scalar product 


(A, B) = Retr(A‘B). 
Every A € M, (KK) defines a function on G by 
fa: GR, fa(a) := Retr(Aa). (8.9.12) 


One can show that if A is diagonal with pairwise distinct entries, f4 is a Morse 
function. It is not hard to compute the critical points and their indices. It is also 
interesting to compute the critical set for the case A = 1, see Exercise 8.9.4. 


In applications, e.g. in the theory of systems with symmetries, one often faces the 


situation that one has to work with a restricted class of functions which cannot be 
assumed to have isolated critical points. The following definition yields a reasonable 
generalization to this case: 


Definition 8.9.11 (Morse-Bott function) Let M be a smooth manifold. A smooth 
function f: M — R is called a Morse-Bott function if the following conditions are 
fulfilled: 


1. 


The critical set My is a disjoint union of connected embedded submanifolds, 
called critical submanifolds. 


. For every critical submanifold N and every m € N, the bilinear form induced by 


Hess,,(f) on the normal space Ny,N = T,M/TinN is non-degenerate.”> 


As an example, we consider a torus lying on a plane. Its height function is obvi- 


ously a Morse-Bott function with two critical submanifolds: a circle, on which the 
height function is maximal, and a circle in the plane, on which it is minimal. 


As before, the index of a critical point m € N is defined as the index of Hess, (f). 


Since N is connected, the rank of Hess,,(f) is constant along N. Hence, the index 


25 This is equivalent to the condition ker(Hess(f)) = Tn. 
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depends only on the submanifold N. Since single points are embedded submani- 
folds, every Morse function is a Morse-Bott function. The Morse Lemma general- 
izes to Morse-Bott functions as follows: assume that dim M =n, dim N =d and 
that the index of N is i. Then, for every mo € N, there exists a local chart of M at 
mo with coordinates x!,...,x? and y!,..., y"~@ such that N is given locally by 
y = 0 and the local representative of f has the form 


1 1 
FEWN=FN)— sitet) +50 te + ya) 89-13) 
By a diagonalization procedure similar to that used for Corollary 8.9.4, this will 
follow from 


Theorem 8.9.12 (Morse-Bott Lemma) Let M be a manifold of dimension n, let f 
be a Morse-Bott function on M and let N be a critical submanifold of f of dimen- 
sion d. For every mo € N, there exists a local chart of M at mo with coordinates 
x!,...,x4 and y!,..., y"~“, mapping points of N to R@ x {0} C R", and a non- 
degenerate quadratic form Q on R"~4 such that the local representative of f is 


given by 
f(x,y) = f(N) + Q(y). (8.9.14) 


Proof We follow [32]. Denote r =n — d. Since N is embedded, we find a local 
chart (U,«) of M at mo such that k(U NN) C R? x {0} C R”. Thus, without loss 
of generality, we may assume that M = R” = R¢? x R’ and N = R?. We write 
(x, y) for the points of M. By replacing f by f — f(0), we may also assume that 
f(N) = 0. With respect to the decomposition R” = R¢ © R’ of the tangent spaces, 
the Hessian at (x, 0) € N takes the form 


0| 0 
Hessyx,0)(f) = O0JQ(x) | 


where Q(x) is an invertible symmetric matrix of dimension r, depending smoothly 
on x. We seek a diffeomorphism ® defined on some open neighbourhood of the 
origin of R” such that 


(* f)(xy) = 5QOizy'y! 
on that neighbourhood. In the first step, we define 
h(x, y) = SQ@iiy'y, fr=tft+(—oh. 
We leave it to the reader to check that if we carry out the construction of the time- 


dependent vector field X from the proof of Theorem 8.9.3 for every fixed x, we 
obtain an open neighbourhood U of N in M and a smooth vector field X on U 
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which is tangent to the affine subspaces {x} x IR” and whose flow ®, = ®; 9 satisfies 


1 — 
(Pf) y) =h& y) = 5QWijy'y! 


for all (x, y) € U. In the second step, we construct a family of linear transforma- 
tions of R” transforming Q(x) to Q(0). Since Q(O) is symmetric, up to a linear 
transformation of R” we may assume that Q(O) is diagonal, with diagonal entries 
G1>-++54r- Since Q(O) is invertible, all g; are nonzero. Thus, the diagonal entries of 
Q(x) remain nonzero and keep their sign in an open neighbourhood V of the origin 
in R¢. There, we can define 


lal |_ Q@ ..)) 6 QR 
IQ@iulf Q@in Qwu 
0 
T(x) := 
: 1-1 
) 
and compute 
Ty (x) Q(®)Ti (x) = 


Bo(x) 


with a s ymmetric matrix B2(x) of dimension r — 1, depending smoothly on x and 
satisfying B2(0) = diag(q2,...,q,). By possibly shrinking V, we may achieve that 
the diagonal entries of B(x) are nonzero. Thus, we may apply the same procedure 
in one dimension less and with qg; replaced by qo. Iterating this, we obtain invert- 
ible matrices T;(x),..., 7;(x) of dimension, respectively, r,..., 1, which depend 
smoothly on x. Then, 


T(x)" Q(X)T (x) = QO) with T(x) = T1(x)(11 ® HW) --- (1-1 © T(x) 
and hence 
(®*h) (x,y) =QO)ijy'y! with B(x, y) = (x, T(x)y). 


Thus, ® = @; o @ yields the desired local diffeomorphism (defined on some neigh- 
bourhood of the origin in R”, which can be determined from U and V). 


From the proof we can read off the following version of the Morse-Bott Lemma, 
to be used later on. 


Corollary 8.9.13 Let N be a manifold and let f : N x R' > R be a Morse-Bott 
function with critical submanifold N x {0} and f (N x {0}) =0. For everyme N, 
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there exist open neighbourhoods U of m in N and V of the origin in R’, a non- 
degenerate quadratic form Q on R’ and a diffeomorphism ® from U x V into 
N XR’ such that 


foP(m, y) = Q(y), pryoP(m,y)=m 


forallmeU andyeV. 


To conclude, let us add that, analogously to Morse homology, one can develop a 
homology theory based on Morse-Bott functions, see the Lecture Notes by Hutch- 
ings [146]. We will meet a further generalization of the classical Morse Lemma in 
the context of Morse families in Sect. 12.4. 


Exercises 

8.9.1 Prove Corollary 8.9.4 by diagonalization of Hesso(f) and an appropriate scal- 
ing of basis vectors. 

8.9.2 Find the critical sets of the functions 


ios, 

IQeSe, 
faye; 
fa, yaxy’, 


fx, y) =x? ~ Bx 9", 
f(, y) =cos(27x) + cos(2zry), 


Di=0 i IZi ? 
Yio Zi |? 


Which of them are Morse functions? Find the indices for their critical points. 
Hint. The sixth function has to be understood as a function on the 2-dimen- 
sional torus and the last function has to be viewed as a function on the com- 
plex projective space CP”, written down in homogeneous coordinates, cf. Ex- 
ample 1.1.15. 

8.9.3 Show that the height function of an upright 2-torus T’ as shown in Fig. 8.2 isa 
Morse function. Determine the critical points and their indices. Determine the 
homotopy type of the subsets M@ for all values of a. Convince yourself that 
at each critical point the homotopy type changes by attaching a cell whose 
dimension is given by the index. 

8.9.4 Study the function (8.9.12) for the case A = 1. Show that the set of critical 
points of f; coincides with the set of involutions a? = 1. Find a geometric 
interpretation of this set. 

Hint. Recall the definition of the Gra&{mann manifolds Gx(k,n) in Exam- 
ple 5.7.6. 


Chapter 9 
Hamiltonian Systems 


In this chapter we start discussing the theory of Hamiltonian systems. We begin 
with an introduction to the subject, including the Legendre transformation and a 
brief discussion of linear nonholonomic systems. Next, we present three classes 
of examples, which will play a role in the subsequent chapters: the geodesic flow, 
Hamiltonian systems on Lie group manifolds and Hamiltonian systems on coadjoint 
orbits. We close the elementary part of this chapter by presenting the time-dependent 
picture. 

In Sect. 9.4 we investigate the structure of regular energy surfaces and discuss 
the problem of the existence of closed integral curves for autonomous systems. 
This leads us to the famous Weinstein conjecture and to some aspects of symplectic 
topology. We will see that a special type of symplectic invariants, called symplectic 
capacities, constitutes the most important technical tool for studying the Weinstein 
conjecture. In Sect. 9.5 we start to investigate the behaviour of a Hamiltonian system 
near a critical integral curve. We will see that, generically, there is a continuum of 
periodic integral curves nearby, constituting so called orbit cylinders. These cylin- 
ders can undergo bifurcations. We study one of these bifurcations, provided by the 
Lyapunov Centre Theorem, in detail. Next, we derive the so-called Birkhoff normal 
form both for symplectomorphisms in the neighbourhood of elliptic fixed points 
and for the Hamiltonian of a system near an equilibrium. This normal form implies 
a foliation of the phase space into invariant tori and, in the normal form approx- 
imation, the theory becomes integrable. According to the celebrated KAM theory 
many of the invariant tori persist the perturbation caused by taking into account 
the full symplectomorphism or the full Hamiltonian, respectively. Moreover, we use 
the Birkhoff Normal Form Theorem to prove the Birkhoff-Lewis Theorem, which 
yields the existence of infinitely many periodic points near a closed integral curve of 
a certain type on a given energy surface. Finally, we study some aspects of stability, 
with the main emphasis on systems with two degrees of freedom. 

In the final two sections we study time-dependent Hamiltonian systems. In 
Sect. 9.8 we deal with the stability problem of time-periodic systems with emphasis 
on parametric resonance and in the final section we comment on the famous Arnold 
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conjecture about the existence of fixed points of Hamiltonian symplectomorphisms 
and its relation to the existence problem for closed integral curves. 


9.1 Introduction 


A Hamiltonian system is a triple (M,w, H), where (M,q@) is a symplectic man- 
ifold, in the present context called the phase space, and H is a smooth function 
on M, called the Hamiltonian or the Hamiltonian function. In this section we re- 
strict ourselves to autonomous! systems. Moreover, throughout the whole chapter 
we confine our attention to holonomic systems, except for a few remarks on the non- 
holonomic case in this section.” In this context, the above data have the following 
physical interpretation: every point m € M describes a (pure) state of the system. 
Smooth functions on M are observables. The result of a measurement of the ob- 
servable f € C°(M) in the state m is given by the value f (m) of the function at m. 
The dynamics of the system is governed by the Hamiltonian: the time evolution of 
a state m is given by the integral curve through m of the Hamiltonian vector field 
XH generated by H. Let us write down the equation for the integral curves of X y 
in local Darboux coordinates q' and p;. By (8.2.2), we have 


XH = (9p; H)dgi — (gi H) dp; - (9.1.1) 
Thus, the equations for the integral curves t > (q(t), p(t)) of Xy are given by 


sig) _ oH spies Oe 
FO=7(O.PO) — OD =F 7 (a.PO). 1.2) 


These are the Hamilton equations. We see that the whole information about the dy- 
namics of the system is encoded in the flow of the Hamiltonian vector field generated 
by the Hamiltonian function, indeed. 

In physics, a variety of phase space models occur. The most prominent one is 
that of a cotangent bundle,* that is, M = T* Q, see Sect. 8.3. The base manifold Q 
is called the configuration space of the system and dim Q is called the number of 
degrees of freedom. Given the great importance of this model, let us show how it is 
derived from the Lagrangian formulation, cf. Sect. 4.8, via the so-called Legendre 
transformation. For that purpose, let us consider a mechanical system with config- 
uration space Q, described by a Lagrangian L : TQ — R. Denote the canonical 
bundle projection by z7 : TQ — Q and extend L to a mapping 


L:TO> OxR, L(X) = (a1(X), L(X)). 


'That is, the Hamiltonian does not depend explicitly on time. We will show in Sect. 9.3 that the 
framework discussed here can be easily extended to the non-autonomous case. 


As mentioned in Sect. 4.8, in the theory of nonholonomic constraints many interesting branches 
are studied. Here, we limit ourselves to showing that the Lagrangian formulation presented there 
has a counterpart on the Hamiltonian level. 


3Sometimes one is led to go beyond the cotangent bundle model though, notably in the study of 
systems with symmetries (Chap. 10) where coadjoint orbits of Lie groups play an important role 
as phase space models. 
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We treat Q x R as a vector bundle over Q, denote its canonical projection by p: 
Q x R= @Q and note that 


poL=nrz. (9.1.3) 
Now, let us define the mapping 
FL:TQ—> Hom(T@Q, OQ x R)=T*Q, (FL)(X) := Doty (9.1.4) 


Here Ly =L tex '(g) is the restriction of the mapping L to the fibre over q € Q and 


Hom(TQ, Q x R) is the vector bundle defined in Remark 2.4.7. The mapping F'L 
is called the fibre derivative* of L. It is obviously fibre preserving and smooth. We 
have 


(PL), ¥\= “ L(X +tY) 
to 


for all X, Y in the same fibre of TQ. For a typical Lagrangian, given by the differ- 
ence of the kinetic and the potential energy, 


1 
L(X) = 58, X) — V(xr(X)), (9.1.5) 


where g is a Riemannian metric on Q and V is a potential function on Q, the 
Legendre transform reduces to 


((FL)(X), Y) = @(X, Y). (9.1.6) 


Thus, it coincides with the natural vector bundle isomorphism g : TQ > T*Q in- 
duced by the metric. If the fibre derivative FL : TQ — T*Q is a diffeomorphism, it 
is called the Legendre transformation induced by L and the Lagrangian function L 
is called hyperregular. A Lagrangian of the type given by (9.1.5) is always hyper- 
regular. Finally, L is called regular iff FL is a local diffeomorphism. 


Remark 9.1.1 Let us analyze the fibre derivative in local coordinates. Thus, let q 
be local coordinates on Q and let qg', g’ and q', p; be the induced local coordinates 
on TQ and T* Q, respectively. The Lagrangian L is regular iff 


Fn cal ) 40 (9.1.7) 
Cll ssa , ol. 
dq'dq/ 


(Exercise 9.1.1). The fibre derivative F L takes the local form 
a ree 
(FL)(q, q) = agi q) dg (9.1.8) 
(Exercise 9.1.2), that is, it is given by the mapping 


: aL, 
(q, q) > (ar= we (4.@): 
q 


4More generally, if we replace TQ and Q x R by arbitrary vector bundles E and F over Q, 
respectively, then (9.1.4) yields the definition of the fibre derivative Ff : E — Hom(E, F) for any 
smooth mapping f : E — F fulfilling condition (9.1.3). 
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If L is hyperregular, this is the local representative of the Legendre transformation 
known from classical mechanics. 


Now assume that L is hyperregular. In the Lagrangian formulation, the dynamics 
of the system is governed by the Euler-Lagrange equations 


d (aL aL _y 0.1.9) 
dt \ agi agi” a 


compare with (4.8.11). This system of equations can be cast into a coordinate-free 
form in different ways. For our purposes, it is convenient to proceed as follows. We 
take the pullback w, := (F L)*w of the canonical symplectic structure on T* Q, and 
thus endow TQ with a symplectic structure such that F L becomes a symplectomor- 
phism. Next, we define the energy function 


E:TO>R, E(X):=((FL)(X), X)- L(X), (9.1.10) 


and consider the Hamiltonian vector field X_ on TQ generated by E, 
XEso@y, = —dE. (9.1.11) 


We encourage the reader to check that in local coordinates q', q' the equations for 
the integral curves of X~ are equivalent to the Euler-Lagrange equations (9.1.9) 
(Exercise 9.1.3). Now, the Hamiltonian of the system is defined by 


H:T'O>R, H:=Eo(FL)"!. (9.1.12) 

From (9.1.11) we read off 
((FL)~!)*(Xgs@,) = —dH. 
Moreover, Proposition 8.2.9 implies 
(FL).XE= XH. (9.1.13) 

Thus, if Z is hyperregular, the Lagrangian formulation and the Hamiltonian formu- 
lation are equivalent. 
Remark 9.1.2 


1. If LZ is hyperregular, the tuple (TQ, ,, E) is a Hamiltonian system which is 
equivalent to (T* Q, w, H). Usually, by the Hamiltonian formulation one means 
the latter setting. However, it may be convenient to use the former setting, see 
e.g. Example 9.2.1. 

2. For the model class defined by (9.1.5) we obtain 


1 
H(@)= xe E, £)+ V(r()) =TE) + V(x), (9.1.14) 
with 
g'(E.n):=2(g'),.8-'@) 
being the metric induced on T* Q via the vector bundle isomorphism 


g=FL:TO>T*Q. 
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3. In local coordinates, we obtain 
OL or : 
A(q, Pp) = agi qq’ —L@, 4), (9.1.15) 


with q obtained in terms of (q, p) from solving p = ce for q. Thus, H is the 
Legendre transform of L known from classical mechanics. 


Next, let us make a few remarks on linear nonholonomic systems. Recall from 
Sect. 4.8 that a linear nonholonomic constraint is given by a smooth non-integrable 
distribution D C TQ on the configuration space Q of the system and that the Euler- 
Lagrange equations for this case are given by 


d /aL ee é (9.1.16) 
dt\agi) agi 0M @ 
Here, the Aq are Lagrange multipliers and uw“, a= 1,...,5, is a system of local 
1-forms spanning the annihilator D° of D, that is, 
Dm := {X €TmQ :(u“(m), X) =0 for all a}. (9.1.17) 


The solutions ¢ ++ q(t) and the Lagrange multipliers 1, are determined by the Euler- 
Lagrange equations (9.1.16) and the constraint equations 


13 (q(t))q' (t) = 0, (9.1.18) 


cf. (4.8.5). We wish to cast this system of equations into a coordinate-free Hamil- 
tonian form. For that purpose, assume that L is hyperregular. Similarly to (9.1.11), 
the Euler-Lagrange equations (9.1.16) are equivalent to the equations for the integral 
curves of a vector field X E, which, here, is defined by 


XFL =-dE+ hate". 
The coordinate-free form of the constraint equation (9.1.18) is 
(u" om, wo Xz) =0, 


that is, 0 Xp :TQO— TQ takes values in D. We apply the Legendre transforma- 
tion to these equations. Denoting 


Xn =(FL)«Xe, (9.1.19) 
and using 77 = z o (FL), for the first equation we obtain 
Xys@ = —-dH +Agn* yu". (9.1.20) 
In canonical bundle coordinates, the equations for the integral curves of Xp read 
4 9H 4. 0H : 
ga, pre + hat (9.1.21) 
Opi dq' 


These are the Hamilton equations for the case of linear nonholonomic constraints. 
For the constraint equation we obtain 


(u% on, 2’ 0 Xp) =0, (9.1.22) 
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that is, 2’ o X47 :T*Q — TQ takes values in D. The image M = (FL)(D) CT*Q 
of D under the Legendre transformation is called the constraint submanifold. We 
have 


M={é €T*Q:(u"(x(E)), (FL) (E)) =0 for all a}. 


Next, we wish to implement the constraint equation (9.1.22) in an intrinsic man- 
ner, this way removing the Lagrange multipliers. We follow Bates and Sniatycki 
[35], see also [170]. Let us consider the distribution D= (t’)—!(D) on T*Q. In 
terms of the local frame {i} in D®, it is given by 


D={X eTT*Q: (x*y, X)=0 forall a}. (9.1.23) 


Since the constraint equation is equivalent to m'oXy:T*Q > TQ taking values 
in D, Xq must take values in D. This implies, in addition, that Xp is tangent to M 
(Exercise 9.1.4). Hence, x H takes values in 


F:=DNTM. (9.1.24) 


It is easy to show that D = D® © F, see Exercise 9.1.5. Therefore, F is a regular 
distribution on M, called the constraint distribution, and the fibrewise restriction 
wr Of w to F is non-degenerate. Since the restriction of xX H to M, denoted by the 
same symbol, takes values in F’,, the fibrewise restriction of xX H-@ to F coincides 
with X HF. Since, in addition, 2* 1, vanishes on F, restricting (9.1.20) to F we 
obtain 


Xysor =—(dH)p. (9.1.25) 


Since wp is non-degenerate, this equation determines Xj uniquely. This way, we 
have reduced the dynamics to the constraint submanifold M: it is given by the in- 
tegral curves of the vector field Xy on M which takes values in the constraint 
distribution F'. We encourage the reader to work out the description of the above 
structures in the bundle coordinates g', p;, g' and p; on TT* Q induced from bundle 
coordinates qi and p; on T* Q (Exercise 9.1.7). 


Remark 9.1.3 The distribution D is spanned by the Hamiltonian vector fields Z, on 
T* Q generated by the functions 
§ + (ua(w&)), (FL) ©): 
Evaluating the Hamilton equations (9.1.20) on Z“, we obtain 
O=—(Z*,dH)+ > dolx*u%, 2”). (9.1.26) 
b 


It is easy to show that for the class of models defined by (9.1.5), the matrix 
(x* 1“, Z”) is invertible (Exercise 9.1.6). Thus, this equation can be used to elimi- 
nate the Lagrange multipliers A,. The resulting equation reduces to (9.1.25). 


Now, we discuss symplectomorphisms. In the context of physics, a symplecto- 
morphism of the phase space is referred to as a canonical transformation. Canonical 
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transformations play an enormous role in the study of Hamiltonian systems. The 
first simple but very important observation follows from Remark 8.2.5/1, which 
states that the flow of the Hamiltonian vector field Xy consists of locally defined 
symplectomorphisms. This means that the dynamics of a Hamiltonian system can be 
viewed as a time-dependent canonical transformation. This observation can be con- 
sidered the starting point of Hamilton-Jacobi theory, to be presented in Chap. 12. 
Moreover, Proposition 8.1.3 implies 


Theorem 9.1.4 (Liouville) The phase space volume form 824 of a Hamiltonian 
system (M, w, A) is invariant under the flow of the Hamiltonian vector field X ,. 


The following corollary is a direct consequence of Proposition 8.2.9. It states that 
the Hamilton equations are invariant under canonical transformations. 


Corollary 9.1.5 Let (M,w, H) be a Hamiltonian system and let 8: M + M be a 
canonical transformation. Then, 


®,X4 =X y,9-1- (9.1.27) 


Remark 9.1.6 


1. If the canonical transformation @ is given in local Darboux coordinates by 


(q. Pp) +> (G(q. Pp), PCG, p)), 


then (9.1.27) yields the Hamilton equations in the variables (q, p), with Hamil- 
tonian H(q, p) = (Ho &-')q, p). We stress that there exist transformations 
which leave the Hamilton equations invariant but which are not canonical, e.g. 


(q.p, 1) (@=4q, p=ap, HW =aH) 


with an arbitrary constant a 4 0. 

2. With a slight abuse of language, the view on Hamiltonian systems described 
above may be called the Schrédinger picture of classical mechanics, because the 
dynamics of the system is given in terms of the time evolution of the states. The 
time evolution can be shifted to the observables as follows. For f € C°(M), 
define the family tr f(t) := f o ®;, where @ is the flow of Xy7. Then, 


(f (t))(m) = f(G(m)), 


that is, the result of a measurement of f = f (0) in the state ®;(m) coincides with 
the result of the measurement of f(t) in the state m. By (8.2.6), using the stan- 
dard physics notation f = g f(t), we obtain the following equation of motion 


for f(t): 


f=, fh. (9.1.28) 
In particular, the Hamilton equations can be rewritten as 
gi={H,q'}, pi={H, pil. (9.1.29) 


This may be viewed as the Heisenberg picture of classical mechanics. 
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3. The notion of Hamiltonian system generalizes to Poisson manifolds. In this gen- 
eralized sense, a Hamiltonian system is a triple (MV, {, }, H) where (M, {, }) is 
a Poisson manifold and H € C°(M). The dynamics is given by a = {H, f}.In 
local coordinates, this yields the equations of motion 


: 0H 
x! ={F,x"| se aires 


Finally, let us turn to the important notion of constant of motion.> 


Definition 9.1.7 Let (M,@, H) be a Hamiltonian system. A function f € C°(M) 
is called constant of motion if 


d 
qgilv@)=9 


for every integral curve t > y(t) of the Hamiltonian vector field Xj. 


Equivalently, f is a constant of motion iff its pullback @* f under the flow ©, of 
X is independent of f. 


Remark 9.1.8 By (9.1.2), the function p; is a constant of motion iff H does not 
depend on the corresponding coordinate q’. In this case, q' is said to be cyclic. 


Proposition 9.1.9 The Hamiltonian H of a Hamiltonian system (M,w,H) is a 
constant of motion. 


Proof For any integral curve t > y(t) of X7, we have 


d 
Gly) = Xq(H)(y(t)) = (dH, Xx) (y(t) = —@ (Xu, Xu)(y(O) = 0. 


This is, of course, the law of energy conservation for autonomous systems. The 
following proposition characterizes constants of motion in terms of the Poisson 
bracket. 


Proposition 9.1.10 Let (M, w, H) be a Hamiltonian system. 


1. A function f € C°(M) is a constant of motion iff { f, H} = 0. 
2. If f, g € C™(M) are constants of motion, then { f, g} is a constant of motion, 
too. 


Proof Point | follows from 


< Fv) =Xa(f)(y) ={4, fv) 


and point 2 is a consequence of point | and the Jacobi identity. 


5Also referred to as an integral of motion or a first integral. 
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Fig. 9.1 The restriction of M 
Xx to D is tangent to Y R 


Now, assume that e € R is a regular value of H, that is, dH(m) 40 for all m € 
H-'(e). By the Level Set Theorem, H —l(e) is an embedded submanifold of M of 
dimension 2n — 1. The connected components of H~!(e) are referred to as regular 
energy surfaces. Let »’ be such a regular energy surface and let 1: 3’ — M be the 
natural inclusion mapping. By Proposition 9.1.9, X is invariant under the flow of 
Xy. Thus, Xy is tangent to &' and, therefore, induces a vector field X 2 on » 


which is t-related to X 7, 
U(X#(m)) = Xy(u(m)), med. (9.1.30) 


We draw the important conclusion that for every regular value of the Hamiltonian 
we get a reduction of the dynamics to a submanifold of codimension 1, see Fig. 9.1. 
Since dH vanishes nowhere on ’, X = does not have equilibria. 

Let us consider wy := 1*w. This is a closed 2-form on 2’, which is necessarily 
degenerate, because & has dimension (2n — 1). To calculate the rank, we use (7.2.3) 
and Proposition 7.2.4. This yields 


rank(@s)m = dim(T, 5) — dim(Tm © (Tn Z)”) = 2(n — 1) 


for all m € &. Hence, the rank of wy is maximal and the characteristic distribu- 
tion D® = kerwy of wy is regular of rank 1. Moreover, using (9.1.30), we ob- 
tain 


(*o), (X7q(m), Y) = om (Xu (een)), UY) = —(dH (cm), /Y) = 0 


for any Y € T,X’. Thus, Xjos = 0 and hence D®* is spanned by xe. In partic- 
ular, the (images of the) integral curves of X - coincide with the characteristics of 
x Moreover, as a vector bundle, D®~ is trivial. 

The reduction to 2’, induced by a single constant of motion, can be generalized 
to several constants of motion /f|,..., f;. Consider the mapping 


F=(fi,..., fi): M > R* 


and assume that h € R* is a regular value of F. Then, by the Level Set Theorem, 
F7!(h) is a (2n — k)-dimensional embedded submanifold of M. This submanifold 
is invariant under the flow of X;,. Chapters 10 and 11 are devoted to the study of 
the following specific reductions of this type. 


More precisely, they are equivalent as submanifolds of M, because they are integral manifolds of 
the same integrable distribution, cf. Proposition 3.5.15 and Remark 1.6.13/5. 
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(a) If the reduction is induced by a symmetry of the system, the constants of motion 
fj arise as the components of a so-called momentum mapping. There exists 
a general symmetry reduction procedure, called Marsden-Weinstein reduction. 
This theory will be presented in detail in Chap. 10. 

If the constants of motion fj fulfil an additional global integrability condition, 
namely, if all their pairwise Poisson brackets vanish, one says that they are in 
involution. If there exist n constants of motion in involution, the system is said 
to be integrable. All the exactly solvable models usually taught in a course on 
classical mechanics are of this type. This class of systems will be studied in 
Chap. 11. 

We note that, locally, there are always (2n — 1) functionally independent con- 
stants of motion. Indeed, Proposition 3.2.17 provides local charts (U, «) such 
that (X7);y = df. The corresponding coordinate functions (k?,...,«") are 
constants of motion on the open subset U. However, in general these local con- 
stants of motion cannot be extended to the whole phase space in a functionally 
independent way. 


(b 


wm 


As a matter of fact, many examples belong both to classes (a) and (b). 

We conclude this section with a simple example which illustrates that a reduction 
may allow to draw conclusions about the qualitative behaviour of the dynamics of 
the full system. 


Example 9.1.11 Consider the Hamiltonian system with Hamiltonian 
1 
H=- 
5 
on the phase space M = T*(R \ {0}) x T*(R \ {0}). The functions 


054.2, Dy 28 
Py + py +.o7 9g) +0343) 


1 ; 
HH, := 5 (Pi +0747), i=1,2, 
are constants of motion. Every h = (EF), E2) € R2 fulfilling E,, Ez £0 is a regular 
value of (H\, Hz). The corresponding level set &' is a two-dimensional torus. In 
angle coordinates 1, ¢2 defined by 


JV2E; . 
gi = : sin gi, Di = V 2E; cos Gj, 


Qj 


pee is given by 
X77 = 0109, + 024, (9.1.31) 


(Exercise 9.1.8). If m1 /q@z2 is rational, the integral curves are closed and hence iso- 
morphic to S!. If @1/@ is irrational, every integral curve is dense in Y. 


Exercises 
9.1.1 Prove that a Lagrangian is regular iff it satisfies (9.1.7). 
9.1.2 Prove Formula (9.1.8). 
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9.1.3 Prove that in bundle coordinates g' and g' on TQ, the equations for the in- 
tegral curves of the vector field X¢ defined by (9.1.11) are equivalent to the 
Euler-Lagrange equations (9.1.9). 

9.1.4 Show that if the vector field Xy defined by (9.1.19) takes values in D, it is 
tangent to the constraint manifold M = FL(D). 

Hint. Show that, in bundle coordinates qi and g' on TQ, the integral curves 
tr> (q(t), q(2)) of Xz satisfy Sq(r) = q@). 

9.1.5 Prove that the distribution F defined by (9.1.23) satisfies D® @ F = D. 
Hint. Show that the distribution D is coisotropic and that D° ATM =0. 

9.1.6 Show that the matrix (7*w?, Z,) in Eq. (9.1.26) is non-degenerate. 

9.1.7 Analyze the Hamiltonian description of linear nonholonomic constraints, 
given in this section, in terms of the bundle coordinates g', p;, g' and pj; 
on TT* Q induced by coordinates g' on Q. 

9.1.8 Prove Formula (9.1.31). 

9.1.9 Let (M, w, H) be a Hamiltonian system and let mo be a regular point of H. 
Show that there exists a local Darboux chart (U,«) at mo with coordinates 
x = (q, p) such that 


Ko®,ox '(x)=x+(t,0,...,0), pr=Aty 


for all (t,x) € R x R®” for which the left hand side of the first equation is 
defined. From the first equation we read off 


«1(®,(m)) =k\(m) +t, Kj (®,(m)) =xKj(m), 1<i<2hn, 


that is, g! is the local coordinate along the flow line. A chart of this type is 
called a Hamiltonian flow box chart. 


9.2 Examples 


In this section we discuss three classes of examples, which are important both in 
mathematics and in physics. Each of them will be taken up again later on. 


Example 9.2.1 (Geodesic flow) Let (M,g) be a Riemannian manifold. Consider 
the cotangent bundle T*M, endowed with the natural symplectic structure m = dé. 
Using the isomorphism g: TM — T*M, we can transport this structure to the tangent 
bundle:’ 

OL :=28"0, wy := db, = g* a. 


This way, TM becomes a symplectic manifold. As the Hamiltonian, we choose the 
energy function 


E:TMSR, — E(X):= sax, X). (9.2.1) 


7Cf. Remark 9.1.2/1 and recall that g is the Legendre transformation induced by the Lagrangian 
L(X) = 38(X, X). 
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The Hamiltonian vector field of E is referred to as the geodesic vector field of 
es g). Let us analyze the corresponding Hamilton equanons in bundle coordinates 
q' and v' on TM induced by local coordinates g' on M. Let q' and p; be the 
corresponding bundle coordinates on T* M. In these coordinates, g is given by 


(q, v) +> (q, p= g(v)) 
and the tangent mapping is given by 
Big,v) (X! Agi + ¥' 9,7) = X'9qi + (gi (MY! + gijx(q@e'X*)ap,, 
where gij,4 = ee. Therefore, using 6 = px dq*, we obtain 


(91) (q,v) (X' dgi + ¥' Oy) = (6g, y) (X' Ig: + ¥'8,; ')) = gei(qu' X*. 
Thus, 
_ lak a l k land k 
6, = gnu dq”, oy = gidv A dg" + gei,ju dg’ A dq”. (9.2.2) 


In these coordinates, the Hamiltonian takes the form 


1 
E(q, Vv) = 5 8ij (Me ae (9.2.3) 
Thus, 
i i,j a k iaqvk 
dE = ZSij kv v! dq" + gigv' dv". (9.2.4) 
Using (9.2.4) and (9.2.3), from X¢iw, = —dE one reads off the following local 
formula for the Hamiltonian vector field (Exercise 9.2.1): 
Xe =v dg — vv Haye. (9.2.5) 
Here, 
pm: 1 mk 
fae (Sik, j + &jk,i — Sij,k) 
are the Christoffel symbols of the Levi-Civita connection® defined by the metric g. 


Thus, the Hamilton equations take the form 


+k _ 1k kai irk 
q =v, v= —vv lj). 


The corresponding Euler-Lagrange equations are 
q' + q'g/Tk =0. (9.2.6) 


This is the geodesic equation on (M, g). Thus, the projections of the integral curves 
of Xz to M are geodesics of the Riemannian structure. A detailed discussion of 
models of this type can be found in [1], Sect. 3.7. We will come back to the geodesic 
flow in Example 10.6.1. 


8In this example we use some facts from Riemannian geometry which go beyond the material of 
Sect. 4.4. In part II of this book we will give a concise treatment of the theory of Riemannian 
manifolds. For the time being, we refer to standard textbooks, see e.g. [166] or [76]. 
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Example 9.2.2 (Hamiltonian systems on Lie groups) We take up Example 8.3.4 
and consider a Hamiltonian system (T*G, w, H), where G is a Lie group with Lie 
algebra g, w is the canonical symplectic form on T*G and H € C™(T*G). We 
determine the Hamiltonian vector field X in the (inverse) left trivialization G x 
g — T*G given by (8.3.6). For that purpose, for (a, jz) € G x g* we make the ansatz 


Xu (au) = (LA, p) 
with A € g and p € g* and analyze the defining equation 


for arbitrary tangent vectors Y = (L/,B,o) at (a, 4). By (8.3.8), the left hand side 
reads 


o(Xx(a, 1), Y) S (po, B) = (a, A) —(u, [A, B]). 


For the right hand side, we compute 
d 
an H (aexp(tB), uw + to) = (Ha), (0) + (Ay) 0 LB) 
0 


with the induced functions H,:g* > R and H,,:G — R. Viewing the linear map- 
pings (H,)), : 9* > Rand (H,,)/,0L, : g > Ras elements of g and g*, respectively, 
we obtain 


dH (Y) = (0, (Ha)j,) + (Hwa o Ly, B). 
We read off 
A= (Ha)u> p=—(H,)), 0 Li, - ad* ((Ha)),) (9.2.7) 
As a consequence, the Hamilton equations are 
a=L,((Ha)j,)> jt = —(Ay)y 0 Li, — ad*((Ha);,) it. (9.2.8) 


In the special case where H is invariant under the action of G by left translation, 
H i, = 0 and the Hamilton equations read 


a=L,((Ha)j,)> ju = —ad*((Ha)),)- (9.2.9) 
Finally, using (8.3.8) and (9.2.7), for the Poisson bracket of f, g € C°(T*G) we 
obtain 
{f g(a, 7) _ o(X f(a, LL), Xo, 1) 
=(—(fua oly — ad* ((fa)),) Hs (8a)ju) 
—(—(gu)a oLg — ad* (ga), ) os (Fadi) — (o> [fads (au), 
that is, 
(f.gh(a, “) 
=((guyi, Li, (fa)u) — (Cfude LG, (a)y.) + (us [Cadi BaYi,]}- @.2-10) 


Models of this type are relevant, for example, in the theory of the top. 
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Example 9.2.3 (Hamiltonian systems on coadjoint orbits) Let G be a Lie group and 
let g be the Lie algebra of G. By Example 8.2.18/3, the dual vector space g* carries 
a natural linear Poisson structure, which according to (8.2.18) is given by 


{f.h}(w) =(n, [df (4), dh(u)]) 


with « € g* and f,h Ee C™(g*). Let H € C™(g*). As discussed in Sect. 8.2, the 
Hamiltonian vector field X q on the Poisson manifold g* is defined to be the vector 
field corresponding to the derivation {H, -} of C®(g*), that is, XH (f) = {H, f} for 
every f € C™(g*). Since 


(Xu(f))() ={H, f}(u) =(u, [dA (u), df (uw) |) = —(ad* (dH (4) u, df (w)), 
we read off 
Xy(u) = —ad* (dH ()) um. (9.2.11) 


Consequently, the Hamilton equations associated with the Hamiltonian system 
(g*, {,}, H) are 


ju = —ad* (dH (u)). (9.2.12) 
In coordinates jz; with respect to a basis in g*, this reads 
: 1 0H 
hj= ea 


where el ; are the structure constants with respect to the dual basis in g. As a Hamil- 


tonian vector field, Xj is tangent? to the symplectic leaves of g*, cf. the remark 
after Theorem 8.2.20. By Proposition 8.4.3, the leaves coincide with the coadjoint 
orbits. Thus, the dynamics of the (Poisson) Hamiltonian system (g*, {, }, H) reduces 
to the coadjoint orbits, hence inducing a (symplectic) Hamiltonian system on each 
of them. As a consequence of this observation, every Ad*-invariant function on g* 
is a constant of motion. Let us add that if g admits an Ad-invariant scalar product, 
according to Remark 8.4.4, we can identify g* with g and the adjoint representation 
with the coadjoint representation. Under this identification, the Hamilton equations 
take the form 


f=([p, dH]. (9.2.13) 
One says that (4, dH) constitutes a Lax pair. In Chap. 11 we will discuss Lax pairs 


in some detail. 


Exercises 
9.2.1 Prove Formula (9.2.5). 


°This also follows by direct inspection: from (6.2.3) and (9.2.11) we read off that Xy (jw) 
coincides—up to the sign—with the value at jz of the Killing vector field of the coadjoint action 
generated by dH (2). 
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9.2.2 Analyze the Poisson structure of Example 9.2.3 for the case of the Lie group 
SO(3). Compare with the symplectic structures on the coadjoint orbits found 
in Example 8.4.5/3 and write down the Hamilton equations for the quadratic 
Hamiltonian 

1 
H= 5) ain. a; > 0. 
i 


Which physical model is described by this Hamiltonian? 

9.2.3 Consider the Euclidean group E(3), defined in Exercise 8.4.2. Show that the 
Poisson structure on the dual space g* = R° of E(3) is given in standard 
coordinates (j4;, A;), i = 1, 2, 3, by the Poisson brackets 


{His Mj} = ij” Uk, {mi Aj} = e:j* Ak, {Aj,A;} =0. 


9.3 The Time-Dependent Picture 


In this section we develop a calculus where the time variable is naturally integrated 
into the phase space of the system. One of the motivations for doing so is to deal 
with explicitly time-dependent Hamiltonians. 

Thus, let (7, @) be the phase space and let H = {H; : t € R} be a smooth family 
of Hamiltonian functions. To this smooth family there corresponds a smooth family 
of Hamiltonian vector fields, that is, a time-dependent vector field X 7, given by 


(XH)(t,m) = Xp, (m). 


Such Hamiltonian systems are called time-dependent or non-autonomous. The spe- 
cial case of an autonomous system is of course included. The geometric structure 
relevant for the description of such systems is the extended phase space 


M=TRx M, @ := pry w — pri(dE A dr), (9.3.1) 
where ¢ and E denote the standard coordinates on T*R = R x R and 
pr}: M > T*R, pr: M—> M, 


are the natural projections. In what follows, the latter will usually be omitted. In 
local Darboux coordinates g' and p; on M, we have 


© = dp; Adq' —dE A dt. 
Points of M Rx Rx M will be denoted by (t, E,m). 
Remark 9.3.1 For M = T* Q, the extended phase space (M, @) is isomorphic to the 
cotangent bundle of the extended configuration space Q = R x Q. 


Now, let us define the extended!” Hamiltonian function 


H:M—>R, H(t, E,m):= H(t,m)— E, 


!0Sometimes also called the suspended Hamiltonian. 
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and consider the extended Hamiltonian system (M, @, H). This is an ordinary (au- 
tonomous) Hamiltonian system. Thus, the Hamiltonian vector field X of H is 
defined by 


X 7.0 = —dH, (9.3.2) 
and we have LX 5 @ = 0, that is, the flow of Xj is a local symplectomorphism. The 
relation between X , and the time-dependent vector field X y is given by 

xX — + —-— +X (9.3.3) 
(Exercise 9.3.1). Here, Xy is trivially extended to M. Thus, the equations for the 
integral curves s +> (f(s), E(s), y(s)) of X gq are 


d d oH d 
qs ao 7 (t(s), v(s)), ql (9) = Xu (t0), v(9)). 


(9.3.4) 


From the first of these equations we read off ¢t(s) = f9 +s, that is, up to a con- 
stant, the parameter s coincides with the time variable t. Moreover, an analogous 
calculation as in the proof of Proposition 9.1.9 yields 


d ry — 
qt), E(s), y(s)) = 


that is, H is aconstant of motion. Therefore, 
A(t +s, E(s), v(s)) = (to, E), yO) 
and thus 
E(s)=E(0)+ H(t +5, y(s)) — (to, y(0)). 
Consequently, the flows ® of X q and © of Xx are related by 


P(s,(t, E,m)) =(t+s, E+ H(t +5, ®145,,(m)) — H(t,m), Pi45,(m)). 
(9.3.5) 


Remark 9.3.2 In local Darboux coordinates gi and p; on M, the system of 
Eqs. (9.3.4) reads 


dt 
a sare y= “(H(), q(s),p(s)), (9.3.6) 


= ne 
q'(s)= apy q(s), p(s)), Pils) = agi (t(s), a(s), p(s). (9.3.7) 


These are the Hamilton equations on the extended phase space. We note that, in 
contrast to H, H is not a constant of motion. 


Now, let us consider the level sets of H. Since ae = —1, every value of A is 


regular and hence every level set is an embedded submanifold of M. Since any two 


9.3 The Time-Dependent Picture 443 


Fig. 9.2. The induced vector 
field X Zz spans the Er 


characteristic distribution 
D®s= on 


= 


4 Integral curves of XF 
> 


level sets are mapped onto one another by a shift in the variable E, it suffices to 
consider the level set 


y:=A7'(0). 


Let 1: © — M denote the natural inclusion mapping. Since D is defined by the 
equation E = H(t, m), itis mapped to the graph of H under the reordering 


M-— (Rx M)xR, (t, E,m) +> ((t,m), E). 
Suppressing this reordering, we will interpret the natural mapping 
ery : Rx M— M, (t,m) +> (t, H(t,m),m), (9.3.8) 


as the graph embedding of H (therefore the notation). By Remarks 1.6.12/2 and 
1.6.13/5, gr}, is a diffeomorphism onto &. Thus, ¥ is connected iff so is M. Since 
H is aconstant of motion, X 7 1S tangent to » and hence induces a vector field X : 


on & which is u-related to X rE We have 
0 
x = =e(= + Xu). (9.3.9) 


where gr, is viewed as a mapping to » and both 2 ; and X y are viewed as vector 
fields on R x M (Exercise 9.3.2). Note that their ith coincides with the extended 
vector field X77, cf. Sect. 3.4. As shown in Sect. 9.1, the induced 2-form 


3 ta RP 
OF =O 
on » has maximal rank and the characteristic distribution D° = ker w 5 is spanned 


by x see Fig. 9.2. 


Definition 9.3.3 (Time-dependent canonical transformation) A symplectomor- 
phism @ of the extended phase space M is called a time-dependent canonical trans- 
formation of M if (2) Cc XY and &*t =t. 


Time-dependent canonical transformations of M induce time-dependent canoni- 
cal transformations of M: 
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Proposition 9.3.4 Every time-dependent canonical transformation ® of M induces 
a smooth mapping ® :R x M > M by 


O(t, H(t,m),m) = (t, Ho O(t,m), P(t, m)). 
For every fixed t, ®(t, -) is a canonical transformation of M. 

The mapping @ is called a time-dependent canonical transformation of M. 
Proof Since ® leaves © invariant and since gry is an embedding, ® induces a 
diffeomorphism & of Rx M by 

gry =Pogr,. (9.3.10) 


We set ® := pry of. Then, ®, := P(t, -) isa diffeomorphism of M for every t. To 
see that &; is symplectic, define 1, : M > Rx M by t;(m) := (t, m). Using (9.3.10) 
and ® ol; = lt; o ®;, we obtain 


Po (Sty Olt) = (Bry Olt) oO Dy. 
A straightforward computation shows that (gry 0l;)*@ = w. Thus, on the one hand, 
((gry ots) 0 ®;)'@ = DF w 
whereas on the other hand, 


((grz Ol;) 0 D,)*@ = (gry 0l;)* 0 P*O=0. 


This yields the assertion. 


Next, we characterize time-dependent canonical transformations in terms of gen- 
erating functions. Thus, let ® be a time- aependent canonical transformation of M 
and let 73 C M x M be its graph. Let q', pj and q', p; be Darboux coordinates 
on the first and the second copy of M, respectively. For f € C °(M), we write 
f= fo ®. According to Sect. 8.8, the defining relation for a generating function 
S= S(q, G q. t, 7) of the first kind for ® reads 


(p; dg! — Ed?) — (pi dq' — Edt) = —dS. (9.3.11) 


Since i= t and since on © we have E = H and E = H, the restriction S of S to 
Tg (2 x &) satisfies 


(p; dq! — H dt) — (p; dq' — H dt) = —dS. (9.3.12) 
By comparison of coefficients, we obtain the relations 
as as - 0s 
pj = -——, ;=—, H=H+—. 9.3.13 
Pi ag! Pi aq! + at ( ) 


Thus, in particular, for every fixed tr, the function (q, q) +> S(q, q, t) is a generating 
function of the first kind for the canonical transformation ®(t,-) of M. The corre- 
sponding equations for the generating functions of the second, third and fourth kind 
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are obtained by replacing the defining relations (9.3.13) by, respectively, (8.8.4), 
(8.8.5) and (8.8.6). The relation for the Hamiltonian holds for every choice of S. 
Finally, let us study the geometry of Y under the assumption that M = T* Q and 
q@ = d6, where @ is the canonical 1-form on TO; In this situation, according to 
Remark 9.3.1, the extended phase space is given by M =T*(R x Q). Moreover, 


6 = pr3 6 — prt(Edr) (9.3.14) 


is the canonical 1-form on M and we have @ = dé. 


Proposition 9.3.5 The 1-form 1*6 is relatively invariant with respect to the vec- 
tor field X 3 . It endows & with a (strict) contact structure iff the function 
t“(0(X 4) — H) vanishes nowhere on >. 


Proof The first assertion is obvious, because d(*9) = w sy and X 2 spans D®S. We 


prove the second assertion. Since X 3 and X , are t-related, for every m € » we 
have 


(“"6),, (2) = Om) (X fg) = (0(XH) — H)(u(m)). 


Since Xj vanishes nowhere, we conclude that i*@ is non-degenerate on D®= iff 
the function .*(6(X;,) — H) vanishes nowhere. In this case, ker(*6) is a hyper- 
plane distribution on ¥’. To prove that it is a contact structure on 4’, we show that 
((*0) A (d(*))” is a volume form on & and apply Proposition 8.5.12. Since *6 is 
non-degenerate on D®5, the distribution ker(*) is complementary to D°S. Thus, 
it suffices to show that (d(*8))" = Oe is non-degenerate on ker(1*). Since &” is 
non-degenerate, the kernel of o'. = 1*@" has dimension at most 1. Since it contains 


keros = D®*, it coincides with the latter. Thus, (d(c*6))" is non-degenerate on 
ker(v*6), indeed. 


Remark 9.3.6 
1. Obviously, the forms a, @’,...,@" are integral invariants of the extended Hamil- 
tonian vector field X , and their pull-backs 1*@, U@,...,U°@" to D are integral 


invariants of the induced vector field X 3 F 


2. The 1-form 1*6 is called the Poincaré-Cartan integral invariant. Since on © we 
have E = H(t, m), in Darboux coordinates it is given by 


6 = pj dq! — H dt. 


3. Since the characteristic distribution D°S of ws. is spanned by X F , it is trivial 
as a vector bundle and hence orientable. Thus, under the additional assumption 


'IMore generally, we could assume that the symplectic form w is exact. 
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Fig. 9.3 Closed curves y| 
and y2 in ©’ which have the q 
same flow cylinder 


that ©’ be compact, it is a hypersurface of contact type. The reader should com- 
pare Proposition 9.3.5 with Example 8.5.18/5, which states that T*Q x R is a 
strict contact manifold without any further assumptions. The reader should also 
convince himself that, for the class of models defined by (9.1.14), the mysterious 
function 6(X 7) — H is nothing but the Legendre transform L o (F L)~! of the 
Lagrangian L. If it does not vanish, (@(X 7) — H 7, oy 3 is the Reeb vector field 
of the contact form “6, cf. Remark 8.5.17. 

4. The reader who feels uncomfortable with the distinguished role which the global 
time variable tf plays in this section should consult Sect. 16 of Chap. V in the 
book of Libermann and Marle [181] for a more intrinsic treatment. 


The Poincaré-Cartan integral invariant has the following interesting property. To 
formulate it, we observe that, under the flow of X a , every closed (oriented) curve y 


in Y whose tangent vectors are nowhere parallel to X generates a two-dimensional 


oriented submanifold diffeomorphic to S! x R, called the flow cylinder of y. Here, 
the orientation of R is induced by the time evolution. Proposition 4.2.16 implies 


Corollary 9.3.7 If y; and y2 are closed curves in © whose flow cylinders coincide 
as oriented submanifolds, see Fig. 9.3, then Ly a i uO. 


For a further discussion of the Poincaré-Cartan integral invariant we refer to the 
book of Arnold!? [18]. 


Exercises 


9.3.1 Prove Formula (9.3.3). 
9.3.2 Prove Formula (9.3.9). 


!2Tn this book, Arnold develops an almost philosophical attitude towards this integral invariant by 
showing that it can be taken as a starting point for building Hamiltonian mechanics. Then, also 
symplectic geometry is a derived structure. From a more modern point of view, the reader can 
find a lot of interesting thoughts about the unifying power of symplectic and especially of contact 
geometry in another paper by Arnold [22], which ends with the statement that “contact geometry 
is all geometry”. 
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Hint. Show that the mapping grj,, given by (9.3.8), satisfies 


cL 
Tr => Ty. 
Poa hae” ae Oy 
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In this section we study the geometry of regular energy surfaces. From this, we 
derive some basic statements about the existence of periodic integral curves in au- 
tonomous systems. 

Thus, let (M, w, H) be a Hamiltonian system of dimension 2n and let ® denote 
the flow of the Hamiltonian vector field X q. For a regular energy surface »' of H, let 
t: &’—> M be the natural inclusion mapping and X 2 the vector field on X’ induced 
by Xy. As before, denote wy := *w and let D®” := kerwy be the characteristic 
distribution of wy, cf. Sect. 9.1. Recall that D°* is spanned by X 5 and that integral 
curves of sections of D®» are called characteristics of D®*. 

On a regular energy surface, dH does not vanish. This implies 


Proposition 9.4.1 Let (M,q@, H) be a Hamiltonian system and let &' be a regular 
energy surface of H. The symplectic form w induces a natural volume form 4 on 2’. 
This form is invariant under the flow of the induced vector field xz. 


Proof Let $2, be the canonical volume form on M induced by a, cf. (8.1.3). Since 
dH vanishes nowhere on 2’, it vanishes nowhere on some open neighbourhood U Cc 
M of » as well. Hence, there exists a (2n — 1)-formo on U such that 2, =dH Ao 
on U. We put yz :=¢*o. This form does not depend on the choice of o: let a’ be a 
second local (2n — 1)-form fulfilling 2,, = dH Ao’ on some open neighbourhood 
U' of X. Then, (o —o’) \dH =00n UNU’, so that there exists a 2(n — 1)-form p 
on UN U' satisfying o —o' =dH A p. But i*(o — o') = 1* (dH A p) = 0, because 
(“A is constant. It remains to show that yz is X ¢, -invariant. Since 


0= Ly, Qo =dH A Lx,0, 
there exists a 2(n — 1)-form t on U such that “y,,0 =dH At. Thus, 
Lye (=F yo = dA AT) =0. 


The existence of an invariant volume form on a regular energy surface has the 
following important consequence. 


Theorem 9.4.2 (Recurrence Theorem of Poincaré) Let (M,w, H) be a Hamilto- 
nian system and let &’ be a compact regular energy surface. With respect to the 
measure on & defined by the induced volume form 1, almost every point m € & is a 
recurrent point, that is, there exists a sequence {t;} with t; > 00 such that the flow 
® of Xx fulfils 


lim ®,, (m) =m. 
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Proof We denote the measure on » defined by the induced volume form jz by 
the same symbol. Since Y is compact, the flow of Xy is complete. Let © denote 
the subset of & of points which are not recurrent. We show that u(y ) = 0. Let 
{By :a@ = 1,2,3,...} be a basis of the topology of ’. For every a, let By, denote 
the subset of B, of points m for which there exists tg such that ®;(m) ¢ By for all 
t > to. We claim that 


s= U By. (9.4.1) 


The inclusion D is obvious. Conversely, assume that m ¢ Jo-, By. The numbers 
a such that m € By form a sequence {a; : j = 1,2,3,...}. Since m € Ba; \ iy, 
for every j, we can construct a sequence {tj} such that t; > oo and ®;,(m) € Ba,. 
Since {Ba;} is a neighbourhood basis for m, this implies that limj_,o0 P;, (m) = 
m, that is, m is recurrent. Thus, (9.4.1) holds, indeed, and it suffices to show that 
[L(Ba) = 0 or, equivalently, 

ji By \ Ba) = Ba) (9.4.2) 
for all a. For that purpose, let a be fixed and consider the sequence of subsets 

BE = U Cig SOA, Boies 
jzk 

Since Dz (BE) = BS and since ® preserves the measure, we get u(BE) = (Bo). 
Then, by compactness of 2’, we have (B®) < oo and hence u(Bo \ Bk) = 0 for 
all k. Since By C B®, this implies (By \ B‘) = 0 for all k and hence 


(Ue \ 8) =0. 


k>0 
In view of the disjoint decomposition By = (Ba ( )xs0 BE) (Uxks0(Ba \ B*)), we 


conclude that 
(Ba) = u(. () Bi). 


k>0 


Using By so Be = By \ Bes we obtain (9.4.2). This proves the theorem. 


From the proof of Theorem 9.4.2 it is clear that instead of requiring 2 to be 
compact, it is enough to assume that X 2 be complete and that 5H < ©. It is 
also clear that a similar statement holds for the Liouville measure on M and any 
measurable subset of M which is invariant under the Hamiltonian flow. 

Obviously, the Recurrence Theorem does not imply the existence of closed inte- 
gral curves. From the point of view of physics, the problem whether a Hamiltonian 
system possesses periodic integral curves is of great importance.!* This turns out to 


‘This problem has its origin in celestial mechanics. In particular, it is interesting to ask whether 
our planetary systems admits periodic orbits, that is, whether there exist initial conditions to which 
the planets would return after a finite time. 
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be a deep and difficult question, which has been intensively studied. For a compre- 
hensive presentation we refer to the books of Hofer and Zehnder [139] and McDuff 
and Salamon [206]. Here, we give an introduction to the subject. 

We start with the crucial observations that the existence of periodic integral 
curves on a regular energy surface does not depend on the choice of the Hamil- 
tonian. 


Lemma 9.4.3 


1. Let H and F be Hamiltonian functions on M and let X& be a regular energy 
surface of both H and F. Then, there exists a nowhere-vanishing function x: 
& — R such that xe = Axa. Up to reparameterizations, Xe and xe have the 
same integral curves. 

2. Let H be a Hamiltonian function on M and let X be a regular energy surface 
of H. Up to a reparameterization, every non-constant integral curve of a vector 
field on M with values in D®* coincides with an integral curve of X H. 


Proof 1. Under the assumptions made, dH; 5 and dF; ys are nowhere-vanishing sec- 
tions of the annihilator of TX’ in TM; y. Since the annihilator has dimension 1, there 
exists a nowhere-vanishing smooth function A on such that dF = A dH and hence 
xe = Xe Let 6” and &” denote the flows of xe and x with domains 4 
and J, respectively. Making the ansatz oF (m) = OU my) for some smooth 


function t: J” — R satisfying t(0, m) = 0, we find 


d d d 
Xz (GO) (m)) = arm) = a Pettam) () a t(s,m)X 7 (GF (m)) 


and hence 

d 

rraAG m) =1(®/ (m)), 
which yields 


t 
t(t,m) =) a(GF (m)) ds. 
0 


Since A is nowhere vanishing on & we may interchange F and H in this argu- 
ment. This shows that the mapping (f, 7m) +> (t(t,m), m) is a bijection (in fact, a 
diffeomorphism) from Y* onto J”. 

2. Let X be a section of D®* and let y be a non-constant integral curve of X. 
Then, X does not have equilibria in a neighbourhood of y. By possibly modifying 
X outside this neighbourhood, we may assume that it vanishes nowhere on 2’. Since 
D®* is spanned by X e there exists a nowhere-vanishing smooth function A on 
such that X =~X a Now, by replacing X Zi by X in the proof of point 1 we obtain 
the assertion. 


Since the integral curves of Xy coincide with the characteristics of D®*, cf. 
Sect. 9.1, the question 


Does a regular energy surface &' admit a periodic solution of X y? 
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can be formulated purely geometrically: 
Does a regular energy surface &' admit a closed characteristic of D®* ? 


An important class of energy surfaces for which this question can be successfully 
addressed is that of energy surfaces which are hypersurfaces of contact type, cf. 
Definition 8.5.23. The following proposition shows that in the typical Hamiltonian 
systems met in physics, such energy surfaces appear frequently. 


Proposition 9.4.4 Let (T*Q,d6, H) be a Hamiltonian system with H being of the 
form 


H@)=TE)+V(x@), &ET*Q, 


cf. (9.1.14), and let &' be a regular energy surface which does not intersect the 
zero section of T*Q. Then, & is a strict contact manifold with contact form v0. If, 
additionally, ’ is compact, then it is a hypersurface of contact type. 


Proof By assumption, 0(X 7) = 2T vanishes nowhere on 2’. Hence, °0(X4) is 
nowhere-vanishing function on » and the proof that >’ is a strict contact manifold 
with contact form 1*@ is completely analogous to that of the if-direction of Proposi- 
tion 9.3.5. It is, therefore, left to the reader (Exercise 9.4.1). The rest is obvious. 


Now, let X' be a hypersurface of contact type. Concerning our question, the 
first fundamental results were obtained in 1978 by Rabinowitz [249] and Weinstein 
[306]. They showed that under some additional topological assumptions, a hyper- 
surface of contact type in R7” admits a closed characteristic. In 1979, Weinstein 
invented the general definition of a hypersurface of contact type, cf. [307], and for- 
mulated the 


Weinstein conjecture Every hypersurface ¥ of contact type fulfilling H'() =0 
carries a closed characteristic. 


The next milestone is due to Viterbo [299]. He proved that every hypersurface 
of contact type in IR?” carries a closed characteristic. Here, the assumption that 
H'() =0 is not necessary. A breakthrough was made by Ekeland, Hofer and 
Zehnder who observed that a special type of symplectic invariants, called capacities 
[84, 138], can be used to tackle the Weinstein conjecture. Let us give an introduction 
to this approach. For a detailed presentation we refer again to the books of Hofer 
and Zehnder [139] and McDuff and Salamon [206]. Let 


Bun (r) = {(q.p) ER”: a +l pI? <r?f. 
Zon(r) = {(a, p) €R™ : (q')° + (pi)? <r?] 


denote, respectively, the open ball and the open cylinder of radius r in R2”, with the 
symplectic structure induced from the canonical symplectic structure wp on R2”. 
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Definition 9.4.5 A symplectic capacity is a mapping c from the class of all sym- 
plectic manifolds, possibly with boundary, of a fixed dimension 2n to the positive 
real numbers (including infinity) fulfilling the following axioms: 


1. Monotonicity: if there exists a symplectic embedding g : (M, w) > (N,o), then 
c(M,w) <c(N,o). 


2. Conformality: c(M, aw) = |a|c(M, @) for alla ER, a 40. 
3. Nontriviality: c(Boy (1), @o) = 1 = c(Zan(1), wo). 


Every capacity is a symplectic invariant. Indeed, if g : (M,@w) > (N,o) isa 
symplectomorphism, then g~! is a symplectomorphism, too. Application of Ax- 
iom | to both of these mappings yields c(M, w) = c(N, a). While the symplectic 
volume is a capacity for n = | (Exercise 9.4.2), Axiom 3 implies that it cannot be a 
capacity for n > 1, because it is infinite for the cylinder. Thus, provided capacities 
exist, they yield new symplectic invariants, different from the volume. For two open 
subsets U and V of (M, a) fulfilling U C V, Axiom 1 implies c(U) < c(V). To 
extend c to arbitrary subsets A C M, one puts 


c(A) = inf{c(U) :ACU open}. 
From the axioms we read off the capacities of balls and cylinders of radius r in 
(R2”, wo) (Exercise 9.4.3): 
c(Bon(r)) = mr? = c(Zan(r)). (9.4.3) 


This implies in particular that the closure of the open ball of radius r has capacity 
2 
wr“, too. 


Remark 9.4.6 Assuming that a capacity exists, as an immediate consequence of 
(9.4.3) one obtains the famous Nonsqueezing Theorem of Gromov, which states 
that the ball Bo,(1) cannot be embedded symplectically into the cylinder Z2,(r) 
unless r > 1. 


Now, let us introduce the capacity of Hofer and Zehnder: consider a symplec- 
tic manifold (M,@, H), possibly with boundary. Let #(M, w) denote the set of 
Hamiltonian functions H satisfying the following. 


(a) There exists a compact set K C Int(M) such that H is constant on M \ K and 
0< H(m) < H(M\ K) forallme M. 
(b) There is a nonempty open subset U C M on which H vanishes. 
Under these assumptions, H(M \ K) = max(H) and Xy is complete. We call 
a Hamiltonian function H € #(M,w) admissible iff it does not have a periodic 
integral curve of period T < 1. Let us denote the set of admissible Hamiltonians by 
Had(M, w). The Hofer-Zehnder capacity is defined by 
cuz(M, w) := sup{max(H) : H € Hq(M, o)}. (9.4.4) 
Note that cy z can be expressed in terms of the Hofer norm (8.8.14), 


cuz(M, ) =sup{|| H ||: H € Ha(M, o)}. 
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Theorem 9.4.7 (Hofer-Zehnder) The function cyz is a symplectic capacity. 


While it is easy to show that cyz fulfils the axioms of monotonicity and confor- 
mality (Exercise 9.4.4), it is hard to prove that it fulfils the axiom of nontriviality, 
see [139]. The proof requires techniques from the calculus of variations. We also 
refer to [139], Sect. 3.5, for examples where cyz can be calculated explicitly. 

Let us derive a criterion for the existence of closed integral curves in terms of the 
Hofer-Zehnder capacity. 


Lemma 9.4.8 Let (M,w, H) be a Hamiltonian system and let © C H~'(e) be a 
compact regular energy surface. There exist ¢ > 0, an open neighbourhood U of X 
and a diffeomorphism @ : (e — €,e + &€) x &’ — U mapping {E} x & to a regular 
energy surface d'g of H of energy E. 


Proof Choose a Riemannian metric on M and denote the corresponding gradient 
vector field of H by VH. There exists an open neighbourhood V of &’ where VH 
is nowhere vanishing. There, we can define the normalized gradient vector field 

‘ VH 

A 
| VA || 


As in the proof of Proposition 8.9.6, we can extend X from some smaller open 
neighbourhood W c V of » to a vector field on M with compact support whose 
flow @ satisfies 


H(®,(m)) =e+t 


for all m € X and t such that ®,(m) € W. It follows that there exists e > 0 and an 
open neighbourhood U C W of » such that @ defines a diffeomorphism 


g:(e—&,e+e)x YoU, g(E,m) = Pgz_-(m). 


By construction, g({E} x 2’) is a regular energy surface of H with energy E. 


As a consequence of the Lemma, U is foliated by the energy surfaces 2p, 


U= U Zp. 


Ee(e—€,e+e) 


Theorem 9.4.9 (Hofer-Zehnder) Let (M,w,H) be a Hamiltonian system and let 
YC H~!(e) be a compact regular energy surface. Assume that there exists an open 
neighbourhood U of X with finite Hofer-Zehnder capacity. Then, there exists e > 0 
such that for a dense set of parameters E € (e — €,e + €) the energy surface defined 
by E contains a periodic integral curve of X y. 


Note that the periodic solutions provided by this theorem in general do not lie on 
>» but only nearby. 
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Proof We follow [139, §4.1, Thm. 1]. According to Lemma 9.4.8, by possibly 
shrinking U we may assume that there exists ¢ > 0 and a diffeomorphism ¢ : 
(e—€,e+e)x XY > U satisfying g({E} x X) = X'g. Moreover, by Lemma 9.4.3/1, 
we have the freedom to choose a convenient Hamiltonian function which has 2 as 
an energy surface. Thus, let us choose a real number 6 fulfilling 0 < 6 <e anda 
smooth function f:(e — e,e +e) > R fulfilling 


f(E)=cyzU,o)+1 forE<e-—dorE>e+6 


5 r) 
E)=0 f —-<E< 2 
f(E) we 5s ao 
f'(E) <0 Le =a ee 
6 
f'(E)>0 foret+><E<et+d 


and let us consider the function F = f o H:M — R. Since F € #(M,w) and 
max(F) > cyz(U,@), Xf possesses a periodic integral curve y (with period 
T <1). This curve must be contained in the open subset 


U:= U LE 


Ee(e—6,e—§)U(e+$,e+8) 


of U, because X f vanishes outside. On the other hand, since dF = (f’0 H) dH and 
since f’ o H is nowhere vanishing on U, F and H have the same energy surfaces 
in U. Now, Lemma 9.4.3/1 yields that, up to a reparameterization, y is an integral 
curve of Xj. Thus, for arbitrarily small 0 < 6 < ¢, we find a periodic integral curve 
of X 7 with energy between e— 6 and e+ 6. In the above argument, e can be replaced 
by any value e’ € (e — €,e + €). Then, 6 has to be chosen so that 


0<d<min{e’-—e+¢e,e+e-e'}. 


This yields the assertion. 


Now, let us turn back to hypersurfaces of contact type. Corollary 8.5.26 implies 


Corollary 9.4.10 Let (M, w) be a symplectic manifold and let &' be a hypersurface 
of contact type. If &’ admits an open neighbourhood U with finite Hofer-Zehnder 
capacity, then the characteristic distribution D®= of &’ possesses a closed charac- 
teristic. 


Proof By Proposition 8.5.25 and by the monotonicity of cyz, we may assume that 
on U there exists a Liouville vector field Z transversal to &’. Then, by Corol- 
lary 8.5.26, there exists ¢ > 0 such that the flow ® of Z induces a diffeomorphism 


g:(—&,€)x YOU, g(t,m) = ®,(m). 


Define H: U > R by A := pr, og! and consider the Hamiltonian system 
(U, w, H). By construction, the hypersurfaces g({E} x +’) are the energy surfaces 
of H. By Theorem 9.4.9, X17 possesses a periodic integral curve y on the hyper- 
surface Lip := g({E} x 2) for some E € (—¢, €). Since X'g¢ is an energy surface 
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of H, the induced vector field X - takes values in the characteristic distribution 
of wy, and y is a characteristic of the latter. By point 2 of Corollary 8.5.26, then 
pro oy! oy is acharacteristic of D®=. 


Remark 9.4.11 


1. There are examples of hypersurfaces in R®”, n > 3, which do not admit any 
closed characteristic [104, 105]. Thus, the condition on » to be of contact type 
cannot be removed. 

2. Using the Hofer-Zehnder capacity, one can give an alternative proof of the result 
of Viterbo [299] stating that every hypersurface of contact type in R*” admits a 
closed characteristic, see [206, Thm. 12.32] or Sect. 4.3 of [139]. 

3. For a discussion of the special case M = T* Q, endowed with the canonical sym- 
plectic structure, and H belonging to the class defined by (9.1.14) we refer to 
[139]. In this case, some nice geometrical ideas can be applied. In particular, for 
E > max V the problem of finding closed characteristics reduces to the problem 
of finding closed geodesics for a special metric on T* Q, called the Jacobi metric, 
which is constructed from the Hamiltonian. 


More generally, one can consider the case of an abstract contact manifold, that 
is, one can discard the Hamiltonian and the ambient symplectic manifold. Then the 
Weinstein conjecture reads as follows. Does a Reeb vector field on a compact con- 
tact manifold admit a periodic integral curve? There are partial results concerning 
this question, see [135] and [136]. For a survey on the Weinstein conjecture and re- 
lated aspects, in particular those of nearby and so-called almost existence theorems, 
we refer to [106]. Some aspects concerning the behaviour of a Hamiltonian system 
near a critical integral curve will be presented in the next section. 


Exercises 

9.4.1 Carry over the proof of Proposition 9.3.5 to Proposition 9.4.4. 

9.4.2 Show that for any 2-dimensional manifold (M, w), the total area | Af M @| isa 
symplectic capacity. 

9.4.3 Prove Eq. (9.4.3). 

9.4.4 Prove that the Hofer-Zehnder capacity, defined by (9.4.4), fulfils the axioms 
of monotonicity and conformality. 


9.5 The Poincaré Mapping and Orbit Cylinders 


The aim of this section is to study the dynamics of a Hamiltonian system near a pe- 
riodic integral curve. An important tool for this is the Poincaré mapping introduced 
in Sect. 3.7. As in that section, depending on the context, by an integral curve we 
mean the curve itself or the corresponding submanifold, cf. Proposition 3.2.11. If y 
is periodic, all of its points are regular with respect to H. Therefore, when study- 
ing the behaviour of the flow in the vicinity of y, by modifying H outside of some 
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neighbourhood of y we may assume H to be a submersion if necessary. In particu- 
lar, there exists an open neighbourhood of y in H~!(H(y)) which is an embedded 
submanifold of M. We will refer to such a neighbourhood as a local energy surface 
of y. 

Our first aim is to show that in the case of a Hamiltonian system the decom- 
position of the phase space into energy surfaces induces a decomposition of every 
Poincaré mapping into a family of symplectomorphisms. 


Proposition 9.5.1 Let (M,w, H) be a Hamiltonian system and let y be a periodic 
integral curve of Xq. Let mo € y and let (PY, HW, ©) be a Poincaré mapping of y 
at mg. 


1. Yan be shrunk so that it is foliated by embedded submanifolds 
Pri= POH'(E) 


with E in some open interval I containing Ey = H(mo). 

2. For every E € I, the submanifolds Pz and We := WO YF are symplectic sub- 
manifolds of (M,@) and the mapping Og : We > Ye, induced by restriction 
of ©, is a symplectomorphism onto its image. 


The triple (Yz,, Wz,, Ox,) is a Poincaré mapping for y with respect to the flow 
induced on a local energy surface of y. Accordingly, it is called an isoenergetic 
Poincaré mapping for y. 


Proof 1. Since X y is transversal to Y, all points of W are regular with respect 
to H. Hence, H; g is a submersion and the Constant Rank Theorem yields a local 
chart (U,«) on F at mp with image B x IC R2"-! where B is some open ball in 
R2"-2 and I is some open interval containing H (mo) such that «(H “l(E)NU)= 
B x {E}. If we replace Y by ANU, we obtain the assertion. 

2. Let E € J and let j : Az — M denote the natural inclusion mapping. With- 
out loss of generality, we may assume that H is a submersion on M. Then, Wz 
is a submanifold of some regular energy surface Y¢ C H~!(E). Since the charac- 
teristic distribution of wy, is spanned by the induced vector field X a , the kernel 
of (Wm) }T,,Z_ 18S spanned by Xq(m). Since Xy(m) ¢ Tm Ye and since the codi- 
mension of T, Ye in Ty X'g is 1, we conclude that (@m) r,, Ap = (J*@)m is sym- 
plectic. This shows that Wg is a symplectic submanifold of M. Next, since © is 
defined by the flow of Xy and H is a constant of motion, © restricts to a mapping 
Or: Wr —> Ae. Since © is a diffeomorphism onto its image and since Wg and 
A» are embedded submanifolds of W and “, respectively, Og is a diffeomor- 
phism onto its image, too. It remains to prove that Og is symplectic, that is, 


O;(j*a) = j*o. 
By Formula (3.7.2), for m € Wr, we have 
Jj 0° Og(m) = Orn) (j (m)), 
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where t denotes the first return time function of Y. To calculate the tangent map- 
ping of j o Og at m, let X € T, Wz and let t > S(t) be a curve representing X. 
Then, 


d 
(j 0 Oz) in (X) = a Pro (i(5@)) 
to 


d ; d ; 
= di pO (j(m)) + dt Perms (5(t))) 


=1'(X)Xy (Prim) (i (m))) a (Prim) 2 j'(X) 
= (Prem) {t! (X) Xu (im) + OOF, 
where t’(X) € R. Using this and (®,)*o@ = a, for X, Y € Ti, We we obtain 


(02 (j*@)),, (X,Y) = @jim(t! (4) Xn (jm) + j(X), eV) Xn (j(m)) + iW) 
= 1 (X)@j¢m) (Xu (j(m)), J’ )) 
+ t/(Y¥)@jcm)(j'(X), Xu(j(m))) + (j*e),, (X,Y). 


Up to a factor, the first term yields dH(j’(Y)) = Y(H 0 j) =0, because Ho j = E. 
In the same way, the second term vanishes. Thus, @¢ is symplectic, indeed. 


Remark 9.5.2 


1. The Uniqueness Theorem 3.7.5 implies a uniqueness theorem for the isoener- 
getic Poincaré mapping (Exercise 9.5.1). 

2. Periodic points of © correspond to periodic integral curves of X 7. In particular, 
Og, (mo) = mo, where H(mo) = Eo. We show that the eigenvalues of the tangent 


mapping 
(Op) ing: Tmo Pi > Tiny YEo (9.5.1) 


i 


coincide with the Floquet multipliers of y, cf. Remark 8.2.11/1. Since (O£))n, 
is given by the restriction of ©/,, to the invariant subspace Ting Pe, C Tiny Y 
and since the codimension of this subspace is 1, the spectra of (Og, nig and 
cae differ by a single eigenvalue 4. On the other hand, by Proposition 3.7.6, 
the spectra of ©,,, and (®r),,, differ by the single eigenvalue 1. Hence, the 
spectra of (Ox,)i, ; and (®r),,, ; differ by the two eigenvalues 4 and 1. Since 
both mappings are symplectomorphisms, by Proposition 7.4.1/4, they have unit 
determinant. Thus, A = | and hence the eigenvalues of (©z,),,,, coincide with 
the Floquet multipliers, indeed. 


/ 
mo 


Next, we will prove that periodic integral curves of Hamiltonian systems are not 
isolated. Rather, they appear in 1-parameter families called orbit cylinders. 


Definition 9.5.3 (Orbit cylinder) Let (MV, w, H) be a Hamiltonian system. An orbit 
cylinder of (M,w, H) is an embedding ¢ : S' x I —- M, where I is some open 
interval, such that ¢(S! x {c}) is the image of a periodic integral curve of X y for 
every cé€ 1. If Hof(S! x {c}) =c, then ¢ is called regular. 
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For a regular orbit cylinder, the parameter labelling the periodic integral curves 
coincides with the energy. Therefore, a regular orbit cylinder intersects every en- 
ergy surface transversally. This implies, in particular, that regular orbit cylinders are 
symplectic submanifolds: since their tangent spaces are pointwise spanned by X 
and some vector Y satisfying dH (Y) 4 0, the 2-form ¢*w vanishes nowhere. 


Theorem 9.5.4 (Existence of orbit cylinders) Let (M,w, H) be a Hamiltonian sys- 
tem and let y be a periodic integral curve of X y. If | is not a Floquet multiplier 
of y, then there exists an open interval I containing Ey = H(y) and a regular orbit 
cylinder € :S! x I > M such that VE) =Y- 


Proof Choose mg € y. According to Proposition 9.5.1, there exists a Poincaré map- 
ping (¥, W, ©) for y at mo such that Y is foliated by embedded submanifolds 
Pr = PO H-'(E). We choose a local chart (U,«) on F at mo, adapted to 
this foliation. Without loss of generality, we may assume that x(mo) = (0, Eo) 
and «(U) = R?"~* x I with some open interval J containing Eo. Denote Wr = 
pr, ok (We) C R*"~? and let 


O=Kxo@ok!: Wy x 1> R”"? x1 
be the local representative of ©. Consider the mapping 
0: Wap x I> R-?, 0 (x, E) := pr, (O(x, E) —x). 


Since ©(mo) = mo, this mapping vanishes at the point (0, Eg). We claim that » 
fulfils the assumptions of the Implicit Function Theorem of classical calculus at 
that point. To see this, let gz, denote the induced mapping x+> (xX) := (x, Eo) 
and let Y € R2”~? be such that (pE9)9(¥) = 0. Then, Oo, Ey) (¥> 0) = CY, 0), that 
is, (Y, 0) is an eigenvector of Oo, Eo) with eigenvalue |. Since the eigenvalues of 


the restriction of Oo Ep) '© the invariant subspace R*"~* coincide with the Floquet 
multipliers of y and since, by assumption, | is not among them, we conclude Y = 0. 
Therefore, (Yx,)p is bijective. Now, the Implicit Function Theorem yields that / 
can be shrunk so that there exists a differentiable mapping h : I > Wire satisfying 
g(h(E), E) = 0 and hence O(h(E), E) = (A(E), E) for all E € J. Let us denote 
W(E) = «—!(h(B), E). By construction, y yields a fixed point of ©, that is, a point 
on a periodic integral curve, for every energy value E € /. Finally, using w and the 
flow ® of X, we construct the desired orbit cylinder: let t be the first return time 
function of Y. Consider the mapping 


ft: Rx I> M, fo(t, E) := Pir wwe) (W(E)). 
Since ®,(y(£))(W(E)) = O(w(E)) = w(4), this mapping is 1-periodic, 
fot + 1, E) = Serre (W(E)) = Piece (W(E)) = Colt, E). 


Thus, it induces a smooth mapping ¢ : S'! x J + M. Obviously, ¢ is an injective 
immersion. By construction, ¢(S! x {E}) is the image of a periodic integral curve 
of Xj and ¢(S! x {Eo}) = y. It remains to prove that ¢ is an embedding, that is, 
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that it is open onto its image. By shrinking J once again we may assume that ¢ 
extends to a continuous mapping ¢ on the closure S! x T. Since S! x T is compact, 
we can apply the argument of Remark 1.6.13/2, which showed that every compact 
submanifold is embedded, to prove that ¢ is open onto its image. Then, so is ¢. 


Remark 9.5.5 The orbit cylinder at y provided by Theorem 9.5.4 is unique in the 
following sense. If ¢ : s! x I — M is another orbit cylinder containing y, there 
exists an open interval FOINI containing Eo and a smooth family of diffeomor- 
phisms QE: s'=+ S|, Fe I, such that f(x, E)=C(¢e(x), E) for all x € S! and 
Eel. 


Now, we combine orbit cylinders with the Poincaré mapping. 


Proposition 9.5.6 Let ¢:S! x I > M be a regular orbit cylinder, let Ey € I and 
let (P,W, ©) be a Poincaré mapping for yg,. Assume that F is foliated by level 
sets Pg of H according to Proposition 9.5.1, with E in some open interval J con- 
taining Eo. 


1. There exists a unique embedding 1: I J > M such that PN ye =«(E) for all 
EelndJ. 

2. ForallE € INS, (A, W, O) is a Poincaré mapping for yg and (Yr, Wr, Ox) 
is an isoenergetic Poincaré mapping for yz. 


Proof The embedded submanifolds Y and ¢ (S! x 1) are transversal, because X y is 
transversal to the former and tangent to the latter. Thus, by the Transversal Mapping 
Theorem 1.8.2, the intersection is an embedded submanifold of M. Obviously, the 
intersection can be parametrized by E € IM J. The proof of point 2 is left to the 
reader (Exercise 9.5.2). 


Since Proposition 9.5.6 implies that the isoenergetic Poincaré mappings © de- 
pend smoothly on E, we conclude 


Corollary 9.5.7 The Floquet multipliers of the periodic integral curves yr of a 
regular orbit cylinder depend smoothly on E. 


In general, a regular orbit cylinder cannot be extended to all E € R. Rather, it will 
meet periodic integral curves having | among their Floquet multipliers or it may 
degenerate to an equilibrium. Under certain conditions, the orbit cylinder can be 
continued nonetheless, though in a nonregular way, see e.g. [1, Thm. 8.2.4]. Usually, 
however, bifurcation phenomena like the degeneration to an equilibrium already 
mentioned or a splitting into several new orbit cylinders will occur. A discussion 
of the types of bifurcations present in Hamiltonian systems with two degrees of 
freedom can be found in [1, §8.6]. The following classical result of Lyapunov [188] 
which dates back to 1907 describes bifurcation of an orbit cylinder from a linearly 
stable equilibrium under certain non-resonance conditions. The periodic solutions 
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can be viewed as nonlinear continuations of the linear normal modes of the system. 
They have periods close to the periods of the linearized system. 


Theorem 9.5.8 (Lyapunov Centre Theorem) Let (M,w, H) be a Hamiltonian sys- 
tem, let mo be an equilibrium of X q of energy Eg = H(mo) and let {A,...,An} 
be a basis set'* of characteristic exponents for mo. Assume that i. = ia with a > 0 
and that 4; is not an integer multiple of 4 for j =2,...,n. Then, there exists 5 > 0 
and a regular orbit cylinder with energy interval (Eo, Eo + 5) such that in the limit 
E — Eo, the integral curves yg approach mo and the period of yz approaches 2m 
Moreover, the set of Floquet multipliers of yz, counted with multiplicities, tends to 
fea *2, ea, re ein ‘ eat} 


Proof Since the statement is of local character, we may assume (M, w) to be given 
by R” with the standard symplectic structure. Moreover, we can put mo = 0 and 
Ey = H(mo) = 0. We denote Q := 5H" (0) and view it as a quadratic form on 
IR2". Moreover, we denote A := Hessjn)(X #7). One can show that the Hamiltonian 


vector field XQ coincides with the linear vector field on IR?” represented by A (Ex- 
ercise 9.5.4). The basic idea of the proof is to rescale!> the Hamiltonian as 


1 
(x) = 52 HEX), xe R*”, 


with e € R. Then, Hp = Q= 5H (0). A brief computation reveals 
eXy,(x) = Xy(ex), xe RR, 


Thus, if e 4 0 and if y is a periodic integral curve of Xy,, then ey is a periodic 
integral curve of Xy with the same period. Hence, we may proceed as follows. 
First, we determine a periodic integral curve yo of Xy, = XQ = A. Then, we use 
the Implicit Function Theorem to generate from yo a family of closed curves y, for 
small values of ¢ such that y, is an integral curve of Xy,. Finally, we pass to the 
curves €y, and express € in terms of the energy E, which yields the desired integral 
Curves YE. 

To find a periodic integral curve yo of A, we use the fact that A has a pair of 
eigenvalues given by tia. Accordingly, there exists a symplectic basis {e1, ..., e2n} 
such that A has the form 


0 ald 
0 O|Ar 


14See Remark 8.2.11/2. 


‘This is a standard tool from bifurcation theory also called the blowing up technique, see [1] for 
historal references. 
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with respect to the decomposition R*” = F, @ Fy ® F, where F; = Re, Fo = 


Re,,+1, and F is spanned by e2,..., €;, €n42,---, €2n- Then, 
cos(at) sin(at)|} 0 
b2=e'A =| —sin(at) cos(at)| 0 
0 0 |etAr 
hence all points of F, © F2 are periodic with period To := =. Thus, for yo we may 


choose the integral curve of A through the point x9 = e;. To compute Q(xo), we 
observe that A}, F, 1s the Hamiltonian vector field of the quadratic form (q, p) > 
5a(q? + p’) on the symplectic vector space F; ® F>. Since a quadratic form is 
uniquely determined by its first partial derivatives, and hence by its Hamiltonian 
vector field (in case it is defined on a symplectic vector space), we conclude that 
Qinem(q,. Pp) = $(q? + p’). Thus, Q(xo) = $. 

Next, to generate the curves y, from yo, we use the same idea as in the proof of 
Theorem 9.5.4, that is, we look for solutions of the equation 


oP (x) -x=0 (9.5.2) 


in the variables x close to xg and t close to 7p for small e. Now, for ¢ = 0, the 
set of solutions (t,x) of this equation is given by the continuum t = 7p and x € 
F, ® F». In order to obtain (7p, x9) as an isolated solution for ¢ = 0, one must 
impose appropriate constraints on the set of solutions of (9.5.2). First, we require 
x € F| & F, because this subspace is transversal to yp. Second, we require 


H,(x) = - (9.5.3) 
To implement this condition, write x = ge + y € F| @ F asa pair (q, y) withg ER 
and y € F, and consider (9.5.3) as an equation in the indeterminate q. Since for 
€ = 0 and y=0 we have the unique solution g = | and since 


dq l(q=l,y=0,e=0) 
the Implicit Function Theorem yields a smooth function (y, €) + q(y, €), defined 
for y in a neighbourhood of 0 in F and for small ¢, such that H-(q(y, €), y) = 5. 
Finally, since H, is invariant under @"e | it suffices to consider Eq. (9.5.2) on the 
energy surface defined by (9.5.3). This can be achieved by applying the orthogonal 
projection pr : R2” — F> ® F, because the latter is bijective in a neighbourhood of 
the point (¢(0, €), 0) in that energy surface. Thus, we arrive at the equation 


gly, t; €) := pr(®;/* (q(y, €), y) — (aly, €).y)) =0 


in the variables y € F and ¢ € R close to y = 0 and ¢ = 7). For the matrix of the 
partial derivatives of g = (y'?, y”) with respect to y and f at the solution y = 0, 
t = Tp and ¢ = 0 we find 


ay", p") E 0 | 
ay) = (9.5.4) 
act, y) ly=0,t=T7p,e=0 0 elvAr — Ip 


=a 0, 
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Since, by assumption, none of the eigenvalues of Af is an integer multiple of a 
and since 7p = 2m the linear mapping e/°4F does not have 1| as an eigenvalue. 
Therefore, (9.5.4) is bijective. Now, the Implicit Function Theorem yields a map- 
ping ¢ (y(e), T(e)) for small ¢ such that p(y(e), T(e), ¢) = 0. Thus, we obtain 
a mapping € + x(e) := (g(y(e), €), y(e)) € IR2” such that x(e) is a periodic point 
of Xy, with period T(e). Let y, be the corresponding periodic integral curve of 
Xy,. As explained above, then ey, is a periodic integral curve of Xy with period 
T (e). Obviously, for e — 0, these integral curves approach mg and the period T (¢) 
approaches 7p = an Moreover, (oF (e)x(e) approaches Co y= e404. Taking the 
affine subspace ex(¢) + F as a Poincaré section for the integral curve ey,, it is 
easy to see that for ¢ — 0, the Floquet multipliers of this integral curve tend to the 


eigenvalues of elvAr , that is, to 


eloh2 : elo, ie elohn ‘ e@loAn. 
It remains to express ¢ for e > 0 by the energy E with respect to H: since 
2 14 
E= HA (&ye) = €° He(Ye) = 7° a, 
we obtain e(E) = ,/ 3k and thus yg := €(E)y¢(g) is a periodic integral curve of X 


of energy EF with period T (¢(£)). Now, the construction of the orbit cylinder from 
the curves yf is completely analogous to that in the proof of Theorem 9.5.4. 


Remark 9.5.9 


1. From the above proof it is clear that the point mp9 complements the orbit cylinder 
to a two-dimensional embedded topological submanifold. One can show that if 
H is of class C’*”, the embedding is of class C” and if H is analytic, the embed- 
ding is analytic [273]. Moreover, this submanifold is symplectic, because orbit 
cylinders are symplectic submanifolds and the tangent space at mg coincides with 
the eigenspace of Hess,,(X 77) corresponding to the pair of eigenvalues ia. 

2. If the non-resonance condition in the Lyapunov Centre Theorem is violated, no 
periodic solutions need exist, see Example 9.5.10 below. However, under certain 
additional assumptions, remarkable generalizations have been found. If one ad- 
ditionally assumes that the Hessian of H is positive or negative definite, that is, 
H”(0) > 0 or H’’(0) < 0, then at least n geometrically distinct periodic solutions 
exist. This result belongs to Weinstein [304] and Moser [224]. Moreover, Fad- 
dell and Rabinowitz showed the existence of periodic solutions under the weaker 
assumption sign(H”(0)) 4 0 [88]. 

3. One can replace the equilibrium mg € M by a non-degenerate Morse-Bott min- 
imum along a closed symplectic submanifold N Cc M and one can ask whether 
every level set of H near N carries at least one periodic integral curve. The so- 
called generalized Weinstein-Moser conjecture states that this is true. For some 
particular cases, this conjecture has been proven, but up to our knowledge, in the 
general case the problem is still open. We refer to [106] for a thorough discus- 
sion. 
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Example 9.5.10 Following Moser [226], we consider the Hamiltonian 
2 1 
H(@,2) = 5(Izol” —Izil?) + (Izal? + lzil”) Reza) (9.5.5) 


on R*, where z; = qi +ip;. One can show the following (Exercise 9.5.5). The point 
Z| = Z2 = 0 is an equilibrium. There are no periodic solutions. The eigenvalues 
of the linearized system are +i and the non-resonance condition is not satisfied. 
Moreover, the Hessian of H has signature 0. 


By the above theorems, for a given periodic integral curve y, generically there 
is a continuum of periodic integral curves nearby. However, each of these curves 
lies on a different energy surface. It is interesting to ask whether there exist periodic 
integral curves near y lying on the same energy surface. Thus, our next task will be 
the study of the behaviour of our system on a given energy surface near a critical 
integral curve. This will be one of the topics of the subsequent section. 


Exercises 

9.5.1 Prove the statements of Remark 9.5.2. 

9.5.2 Prove point 2 of Proposition 9.5.6. 

9.5.3. Prove Remark 9.5.5. 

9.5.4 Complete the proof of Theorem 9.5.8 by showing that the Hamiltonian vector 
field XQ coincides with the linear vector field Hessp(X 7). 
Hint. Show that in the standard Darboux coordinates q' , pion R2" , both linear 
vector fields are represented by the matrix 


aH aH 
qi dp; Opi 9Pj 


aH eH 
aq'dqi apidg! 
9.5.5 Prove the statements of Example 9.5.10. 


9.6 Birkhoff Normal Form and Invariant Tori 


The basic idea for our further analysis is to bring the Hamiltonian system to a nor- 
mal form. This idea goes back to Poincaré, see [240]. It has been deeply analyzed 
and further developed by Birkhoff, see [47]. First, we will prove the normal form 
theorem for a symplectomorphism in a neighbourhood of a fixed point. Then, we 
will state the analogous theorem for a Hamiltonian near an equilibrium. In both 
cases, the unperturbed normal forms give rise to a foliation of the phase space into 
invariant tori, whose physical meaning will be discussed in some detail. Next we 
comment on KAM theory, which guarantees the persistence of many of these tori 
under the perturbation which is caused by considering the full mapping or the full 
Hamiltonian. Moreover, using the normal form, we will prove the Birkhoff-Lewis 
Theorem, which states the existence of infinitely many periodic points in the neigh- 
bourhood of a given periodic integral curve lying on the same energy surface. 
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Definition 9.6.1 (r-Elementarity) 


1. A fixed point mp of a symplectomorphism is called r-elementary if some (and 
hence any) basis set!® {A1,...,An} of characteristic multipliers of mo has the 
property that 


k kn 
ee a a 


for all integers ki,...,k, with 0 < am , |ki| <r. Analogously, a periodic inte- 
gral curve y of a Hamiltonian vector field is called r-elementary if some basis 
set of Floquet multipliers has this property. 

2. An equilibrium of a Hamiltonian vector field is called r-elementary if some basis 
set (41,..., fn) Of characteristic exponents of mo satisfies 


kyu tess +kaln AO 
for all integers kj, ..., ky with 0 < )“"_, |kil <r. 
Theorem 9.6.2 (Birkhoff normal form) Let (M,w) be a symplectic manifold of 
dimension 2n and let W be a local symplectomorphism'’ of open subsets of M. Let 


mg be an elliptic and 4-elementary fixed point of YW. Then, in a neighbourhood of 
mo there exist Darboux coordinates such that © is given by 


Gk = COS(&x) qx — Sin(ax) Pk + fx, Pk = sin(az) gx + cos(arx) pe + gx (9.6.1) 
with 
‘ I) 5 2 
Ak = ag + a Bul, = 5 (ai + Dj )s (9.6.2) 
Here, {ela ee elan} is a basis set for the characteristic exponents of mo, Bai is a 


real (n x n)-matrix and fx, gx are C™-functions of the variables qj and p; such 
that all their partial derivatives vanish at the origin up to order 3. 


Denoting x = (q, p)!, X = diag(a1,...,4y,41,...,@,) and 
n 
B=diag(h1,...,bnibi,....bn), be = > Bul, 
l=1 


up to fourth order, Formula (9.6.1) can be rewritten as 
ey ee, eee (9.6.3) 
The following proof is along the lines of Theorem 7 in [225]. The reader should 


recall the notion of a generating function of a symplectomorphism, see Sect. 8.8. 


Proof Since the statement of the theorem is local, we may assume M = R2", en- 
dowed with the canonical symplectic form given by J. Moreover, we can assume 


'6Cf. Remark 8.2.11. 
NIt is enough to assume the W is of class C3 [225]. 
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that the fixed point coincides with the origin of R*”. Since mo is elliptic, the eigen- 
values Ax of A := W’(0) lie on the unit circle. By 4-elementarity, they are all distinct. 
Hence, Proposition 7.4.5 yields a symplectic basis such that 


A=e/*, (9.6.4) 


Let us decompose YW = A 0 ¢. Then, ¢’(0) = id. In what follows, for a function F 
we denote Fy := (F’(x))!. 


Lemma 9.6.3 There exists a homogeneous polynomial Y of order v + 1>3 such 
that the Taylor expansion to order v of at the origin is given by 


o(x) =xt+ JAK) +... (9.6.5) 


Proof of Lemma 9.6.3 Let us denote (x) = (q, p)’. Since $'(0) = id, @ has a 
generating function of the second kind of the form (q, p) > q'p + S(q, p), where 
S has vanishing partial derivatives at the origin up to order 2. Then, 
- a _ = 0 7 
G=a+>—-SGQP), Pi=Pi—- ~—SG.P)- (9.6.6) 
OPi Ogi 
Expanding S(q, p) and @¢ into a Taylor series at the origin and comparing coeffi- 
cients in (9.6.6), one finds that Y is given by the first non-vanishing Taylor term 
of S. 


Lemma 9.6.4 Let F'(q, p) be a homogeneous polynomial of order v + 1 (the same 
order as #) and let t be the canonical transformation defined by the generating 
function of the second kind (q, p) > q'p + F(q, p). Then, the canonical transfor- 
mation 


W=rt !oWor (9.6.7) 


can be represented by VW = A o ¢, where the Taylor expansion to order v of @ at the 
origin is given by (9.6.5) with FY replaced by 


P(x) = P(x) — F(Ax) + F(x). (9.6.8) 


Proof of Lemma 9.6.4. We rewrite (9.6.7) as tT o W = W ot and compare the terms 
of order v in the Taylor expansion of both sides. The right hand side yields 


W ot(x)=Aogor(x) 
= Aoog(x+ JFx(x)) 
= A(x+ JFy(x) + JAg(k+ JFR(®)) 
and the left hand side yields 
ToW(x)=1t0Ao g(x) 
= 1(Ax+ AJ Yx(x)) 
= Ax+ AJ Py(x) + J Fy(Ax t+ AJ Px(x)). 
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Thus, in order v we obtain the relation 
AJ (F(x) + Px (x)) = AJ Ax(X) + J Fy (AX). 
Multiplying from the left by J~'A7~! and using that A is symplectic, A7 /A = J, 
we obtain 
(P(x) — P(x) + F(Ax) — F(x), = 


Since all terms in the bracket are homogeneous of degree v + 1, the assertion fol- 
lows. 


Proof of Theorem 9.6.2 (continued) Now, we will analyze (9.6.8) in order 3 and 4. 
For that purpose, it is convenient to pass to complex variables zx = qx + ipx. In 
these variables, the homogeneous polynomials F, Y and F take the form 


Pa) Pipe e Bay Fee Bay eae: 


where 


n 
=| las See +o) =v tl. 
k=1 k 


Moreover, by formula (9.6.4), in these variables the mapping xt> Ax is given by 
(Zk, Zk) > (Anz, Ay Zk). 
Thus, Eq. (9.6.8) takes the form 
(AP-° —1) Fog = Poo — Poe (9.6.9) 


where A?~? =[]p_, Ay °*. The factor A°~° — 1 vanishes iff 


n 

Yo (ox —op)ap = 20m (9.6.10) 

k=1 
for some integer m. The order we have to start with is v = 2. Here, > (ox + ox) = 3. 
This implies, in particular, that }* px, 4 >> ox. On the other hand, since the fixed 
point is 4-elementary, the only solution of (9.6.10) is o =o. Thus, A?~° — 1 does not 
vanish and we can choose F’pg such that Poo vanishes. By performing the canonical 
transformation t defined by F we obtain a new canonical transformation for which 
v = 3. Here, a similar analysis applies, but the solution p = o is allowed now, so 
that P, ‘pp = Pop and F,, remains undetermined. For p 4 0, we choose Fy, such 
that Poo vanishes and we fix F by setting F,, = 0. Thus, Pisa homogeneous 
polynomial of order 2 in z%Z,% = 21k: 


- | 
Pq, Pp) =- 2 Bitlet I= 5eete = 5(4e + Pi). 9.6.11) 


To summarize, by a transformation of the form (9.6.7) we have brought to the 
form W = Ao@, with ¢ given by 


XHX+JA4+-:-=x—JBx+---. 
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Using (9.6.4) we conclude that ¥ is given by 
xh eX (x — JBx+--) =e JOB +... 


because X, B and J commute. This is the normal form up to order 3 stated in the 
theorem. 


Remark 9.6.5 


1. Formula (9.6.1) is called the Birkhoff normal form of the symplectomorphism 
W in the neighbourhood of the fixed point mo. The coefficients ag and fx; are 
uniquely determined by W. Therefore, they are called the Birkhoff invariants 
of W. 

2. It is obvious from the above proof that the procedure described can be iterated to 
arbitrary order. 

3. Theorem 9.6.2 applies in particular to the isoenergetic Poincaré mapping of an 
r-elementary periodic integral curve of a Hamiltonian system (but of course not 
to the period mapping). Along the corresponding orbit cylinder, the Birkhoff 
invariants depend smoothly on the energy E. The argument is the same as for the 
Floquet multipliers, cf. Corollary 9.5.7. 


Definition 9.6.6 Let W be a local symplectomorphism of a symplectic manifold 
(M, w). Let mo € M be an elliptic 4-elementary fixed point of W given in normal 
form (9.6.1). Then, mo is called non-degenerate if det 8B 4 0. 


Now, for a moment, let us ignore the higher order terms f; and gy in (9.6.1) and 
let us consider the symplectomorphism 


W(x) = eT X+B)y 


cf. (9.6.3). Moreover, let us assume that the fixed point mo is non-degenerate. It is 
clear that the functions (1), I2,..., J,) define a foliation of the open submanifold 


R™ \ {(q,p) €R™ : , =0 for some k} = T" x (Ry)” 


into n-dimensional tori. By formula (9.6.2), the assumption that mg be non- 
degenerate can be rewritten as 


that is, the tori can be also labelled by the set (a1, @2,..., @,) of angles of rotation. 
By this labelling, these tori fall into two classes: a torus is called non-resonant, if 
the angles a; are rationally independent, that is, if for k € Z” we have 


n 
YS kia ¢ 2nZ. (9.6.12) 
i=l 


Otherwise, the torus is called resonant. Both the resonant and the non-resonant tori 
constitute dense subsets. Moreover, the set of non-resonant tori has full measure 
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in phase space. Note that the set of resonant tori further decomposes into subsets 
consisting of tori, for which there are n — 1,n —2,..., 1 rationally independent 
angles ax. All these subsets are also dense and for the case of one independent angle 
every point on the corresponding torus is necessarily periodic. From this geometric 
picture the following important conclusions can be drawn: 


(a) Every neighbourhood of the fixed point contains a periodic orbit of W. Indeed, 
since the set of resonant tori with one independent angle is dense, every neigh- 
bourhood of the fixed point contains such a torus. On this torus, every point m 
is periodic of some period NV. Then, (wk m), k=1,...,N — 1} is the corre- 
sponding periodic orbit. 

(b) Every neighbourhood of mp contains uncountably many invariant tori with the 
property that every orbit of W is dense in its torus. Indeed, the set of non- 
resonant tori contained in a given neighbourhood is dense in this neighbour- 
hood. For each torus of this type, every orbit of W is dense. Density statements 
of this type can be shown using arguments from ergodic theory, see e.g. §51 
of [18]. 


Next, let us consider a Hamiltonian system (M,@, H) and let us assume that 
mo is an elliptic equilibrium of Xz, fulfilling a similar non-resonance condition. 
Then, by a completely analogous procedure as in the proof of Theorem 9.6.2, in a 
neighbourhood of mo one can bring the Hamiltonian to the following normal form. 


Theorem 9.6.7 Let (M,w, H) be a Hamiltonian system and let mo € M be a 
4-elementary elliptic equilibrium of X q. There exist Darboux coordinates such that 


1 
H(a.p)= dah +5) | Bulelito-, (9.6.13) 
I kl 


with I given by (9.6.11) and Bx; being a uniquely determined n x n matrix. 


Proof See Theorem 1.8.11 in [139]. 


Let us discuss the dynamics near the equilibrium by ignoring the higher order 
terms in the normal form. The resulting Hamiltonian will be denoted by 


~ 1 
H=) al + > Yo Belch. 
1 kd 


We complement the J; to symplectic polar coordinates by choosing angle coordi- 
nates 2; on the tori: 


Pk=vV 21k sin Vx, Gk = V 21k cos Dx. 
Then, 


So dpe A dqe = dh Add (9.6.14) 
k k 
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and the Hamilton equations take the form 


pat? i,=0 (9.6.15) 
k= alk ’ k= * Ue 
Thus, the flow of X 4 leaves the tori T” x {7} invariant and on each torus we have a 
quasiperiodic motion 


Pe=vV 21k sin(V_ + @xt), Gk = V 21k cos(V_ + wt), (9.6.16) 


with characteristic frequencies 


aH 
= —— = an + > Bul. (9.6.17) 
Ol ; 
As above, let us assume that the system is non-degenerate, that is, det 6 4 0. Then, 
since 


OWk 

an,’ 

the invariant tori can be labelled by the frequencies and we have a completely anal- 
ogous picture of resonant and non-resonant tori as described after the proof of The- 
orem 9.6.2. If a torus X17; is non-resonant, that is, if the frequencies w on this torus 
are rationally independent, 


Bu = 


n 

by wk; #0 forall k; €Z, (9.6.18) 

i=1 
each integral curve is dense in 2. This can be proved using elementary arguments 
from ergodic theory, see §51 of [18]. Since the resonant tori are also dense, for any 
neighbourhood of the equilibrium there exists a resonant torus on which we have a 
periodic solution. Since, by (9.6.14), the variables J; Poisson-commute, they consti- 
tute a system of n commuting constants of motion, that is, the system defined by H 
is integrable. This notion was already introduced in Sect. 9.1 and will be analyzed 
in detail in Chap. 11. In this context, the coordinates (J;, %%) are called action and 
angle variables. To summarize, the Birkhoff normal form procedure yields a method 
for transforming a Hamiltonian system near an equilibrium into an integrable sys- 
tem up to a higher order perturbation. 


Remark 9.6.8 (KAM Theory) One can pass to the full Hamiltonian by consider- 
ing the higher order terms in (9.6.13) as a small perturbation of H. Applying the 
celebrated KAM!® theory, one can show that many!? among the non-resonant tori 
survive” the perturbation procedure, that is, for these tori the quasiperiodic motion 


ISK AM refers to original work of Kolmogorov [168, 169], Arnold [11, 12] and Moser [217, 219, 
220). 


!°Those which fulfil an additional, stronger non-resonance condition. 


0They are only getting slightly deformed. More precisely, in each step of the perturbation proce- 
dure the variables (/x, 3%) are getting modified. 
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persists. For a nice and transparent qualitative discussion we refer the reader to the 
book of Arnold [18], paragraph 4 of Appendix 8. There, one can find 6 versions of 
the theorem on invariant tori, including the two cases (behaviour near an equilibrium 
of an autonomous Hamiltonian and behaviour near a fixed point of a symplectomor- 
phism) discussed above. We also recommend the classical textbooks [273], $36, 
and [26], Chap. 5. For a more recent overview containing a lot of further references 
we refer to [63]. For an abstract discussion, which in particular goes beyond the 
Hamiltonian setting, we refer to [270]. We also recommend the pedagogical pre- 
sentation by Péschel [246], which contains a complete proof of the KAM Theorem 
in its classical form. In this version, small time-independent perturbations of an au- 
tonomous integrable system are considered. We will discuss this case in some detail 
in Chap. 11. In all these variants, normal forms of either a symplectomorphism or a 
Hamiltonian function play a basic role. There is a beautiful paper by Douady [75], 
where the relation between these two aspects is discussed. 


Remark 9.6.9 If the characteristic multipliers are r-elementary, by iterating the pro- 
cedure of Theorem 9.6.7, one obtains a normal form of order s in the variables [;, 
with s denoting the integer part of 5. On the other hand, if the characteristic expo- 
nents satisfy certain resonance conditions, the normal form becomes more compli- 
cated. Roughly speaking, it is then impossible to express all of the contributions to 
the Hamiltonian up to a given order in terms of variables J, only. There is a huge 
literature on this subject, see [18], Sects. 4, 5 and 6 of Appendix 7 and [26], Sect. 3 
of Chap. 7, where the reader can also find a lot of further references. In the case of 2 
degrees of freedom, the phase portraits in the neighbourhood of resonant equilibria 
can be analyzed in detail, see Sect. 3.2 of Chap. 7 in [26]. An example can be found 
in Exercise 9.6.1. 


Another important consequence of the Birkhoff Normal Form Theorem is the 
following. 


Theorem 9.6.10 (Birkhoff-Lewis) Let (M,@) be a symplectic manifold and let V 
be a local symplectomorphism of M. Let mo be an elliptic,’! non-degenerate and 
4-elementary fixed point of Y. Then, every neighbourhood of mo contains infinitely 
many periodic orbits of Y. The number of periodic orbits with period smaller than 
a given constant is finite. 


We give a proof of this theorem, except for an analytic estimate (Lemma 9.6.11) 
which can be found in [225] or in the appendix of Sect. 3.3. in [165] (written by 
Moser). The original work of Birkhoff and Lewis is contained in [48]. We note that 
this theorem can be viewed as a generalization of the classical Poincaré-Birkhoff 
Theorem (Theorem 9.9.3) which will be discussed in connection with the Arnold 
conjecture in Sect. 9.9. 


2! This assumption can be weakened. It is enough to assume that at least one of the Floquet multi- 
pliers lies on the unit circle and is different from | [225, Thm. 1]. 


470 9 Hamiltonian Systems 


Proof By Theorem 9.6.2, we may assume that mpg is the origin of M = R", the lat- 
ter endowed with the canonical symplectic structure, and that W is given by (9.6.1). 
For given ¢ > 0, we define rescaled symplectic polar coordinates J, 0% by 


gk = V 2€ I cos Ux, Pe=v 2elk sin ¥x. 

In the sequel, we use the notation 8 for the matrix with entries 6;; and 
I= (h,..., In), 0 =(V1,...,0n), 
a=(d1,...,dn), Oo = (Q1,..., Qn). 


Choose 0 <r < ar and let 


n 1 2 
={IeR":) (2h -— 
k=1 
be the open ball with radius 5 centred at the point eO :— (x siseiy a). Due to 
: Dies 0<|q\>+|pl? =2¢ do 1, - 
——=f< oS r, < = 78 <é€] = nr} <é&, 
on On nee ENG 


T” x B, is an open subset of T” x IR" contained in the open e-ball in R?” centred 
at the origin. To begin with, one has to prove the following estimate for the iterates 
wi of W. Let W denote the mapping (9.6.1) with J; replaced by ¢/; and the higher 
order terms f; and gx ignored. 


Lemma 9.6.11 For every 0 <r’ <r and every c, > 0, there exists 9 > 0 such that 
wi (T x Br) CT x B, 
foralll<j<% + and all 0 < € < &9. Moreover, 
wi-Wi =e7'0, (1, 8,6), 
where 0, is some smooth function on T” x B,: such that e~'o, and its first partial 


derivatives with respect to I, and 0% tend uniformly to zero with € tending to zero. 


For the proof, see ae Peauna 1]. 
Next, letO0 <r” <r’ <r < 5,. We show that W has a periodic point in T” x B,”. 


That is, there exist I € B,» ihe some positive integer N such that 
(27)~'!Na(eD € Z”, (9.6.19) 


where we view ow = a(I), cf. Formula (9.6.2). Then, every point of the torus T” x (DB 
is a fixed point of WY and hence a periodic point of W. To prove this, consider the 
open ball B,” centred at e. Using ||Bx|| > ||87!||7'lx|] for all x € R”, where 
\|8—'|| denotes the operator norm, it is easy to see that the ae cil this ball under 
the mapping I+ a(eI) contains a ball of radius ¢ || B~! ||~! r”, centred at a + 
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epe. Thus, the image of B,” under the mapping Ib (27)~!Na(el) contains a 
ball of radius 
R= (22) "|p" 'Wer”, 
1 
and hence a cube of side length 2n~ 2 R. Thus, given r” and e, we can choose N so 
that 
1 
eNr" > mn? |b" |. 
1 

Then, the side length satisfies 2n~ 2. R > 1, so that the cube must contain a point of 
the lattice Z”. This proves the existence of periodic points of W in T” x B,. 

Finally, we prove that there exists a fixed point for some iterate of the full map- 


ping W. With the shorthand notation (Caal a )) = W (8,1, we must show that 
the system of equations 


IM =f, eo) —94+27h, heZ", (9.6.20) 


has a solution. For that purpose, first we show that there exists a torus which by 
an iterate YW is mapped radially, that is, in such a way that the second equation in 
(9.6.20) holds. By Lemma 9.6.11, 


o™) _ 9 = Na(el) +¢7!01(1, 9, ¢€). 


On the other hand, as we have just shown, there exists Te Bpv,a positive integer N 
and h € Z” such that 


Na(el) = 2zth. 


Using the estimate of Lemma 9.6.11 once again, as well as a version of the Implicit 
Function Theorem22 which provides estimates for the domain of the solution, one 
obtains that there exists a solution Ip = I+ e~!o1(e) € B,’ solving the equation 


o™) _ ¥ = Na(el) + 7! 01 (1, 8, ©) = 2th. 


Let us denote the torus corresponding to Ip by Tj. It remains to show that the first 
equation in (9.6.20) has a solution on To: By (9.6.14), we have 


a( Ely am) =o. 
k 


Hence, for any two-dimensional submanifold & of M with boundary y C T5, 
Stokes’ Theorem yields 


/ Yendn= fve(Denaon) = f vo= | w= | Detdon. 
wy) | y k = 2 Yok 


Since (9.6.20) implies d#) = d#, we conclude that 


[LU nyan =o 
Yok 


22See Lemma 3 in Sect. 2 of [225]. 
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for any closed curve y in Tj. Thus, there exists a smooth potential S = S(@#, €) on 
Tp such that 

_ os 

ae 


Since every smooth function on a compact manifold has a critical point, there exists 
I=I(#, ) satisfying I =I. This completes the proof of the Theorem. 


N 
EO atid 


Remark 9.6.12 Let y be an elliptic, r-elementary and non-degenerate periodic in- 
tegral curve of energy E of a Hamiltonian system (M,@, H). Application of the 
Birkhoff-Lewis Theorem to an isoenergetic Poincaré mapping (Yr, We, Og) at 
mo € y yields that every neighbourhood of mo in We contains infinitely many pe- 
riodic points of Oz, where the number of periodic points with period smaller than 
a given constant is finite. Consequently, every neighbourhood of y in the energy 
surface X’¢ contains infinitely many points located on periodic integral curves of 
XH. Note that this does not imply that the neighbourhood contains these periodic 
integral curves. 


Example 9.6.13 (Two degrees of freedom) The geometric picture following from 
Theorems 9.6.7 and 9.6.10 becomes especially transparent for the case of two de- 
grees of freedom, which has been analyzed in detail by Siegel and Moser [273, 
§§32-35]. In this case, every Poincaré section Ag is 2-dimensional, the invari- 
ant tori are circles with radius 2/ enclosing the fixed point (0, 0) € R? and the /- 
dependent angle of rotation is given by a(J) =a+ I. Here, non-degeneracy means 
B 40. One can show that the concentric circles whose radii fulfil the condition 


ae (9.6.21) 


for all k,l € Z with k > 1 survive the perturbation caused by passing to the full 
mapping W. Here, C and y are positive numbers, which do not depend on k and /. 
Condition (9.6.21) is the strong non-resonance condition, also called Diophantine 
condition, which we alluded to in Remark 9.6.8. 

It is instructive to try to construct the invariant curves for the full mapping by us- 
ing a formal power series expansion for f and g, see §32 in [273]. This way one gets 
insight into the role played by the Diophantine condition (9.6.21). Performing this 
procedure one gets a formal power series for the invariant curve, indeed. However, 
it is impossible to verify the convergence of these series.** Therefore, a stronger 
tool must be used—the rapidly convergent iteration scheme of Kolmogorov. As a 
consequence, for the tori fulfilling inequality (9.6.21) one gets convergence, indeed. 


3This problem is due to the occurrence of “small divisors” in these power series. We met such 
divisors in the proof of Theorem 9.6.2, cf. Eq. (9.6.9). There, in order to solve for the coefficients 
Fpq One has to divide by the number 4°~° — 1, which is different from zero under the non- 
resonance assumption. However, for the convergence of the above power series, non-vanishing of 
such numbers is not sufficient. They have to be sufficiently large. 
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These tori constitute a Cantor set with non-vanishing measure in R. In particular, 
there exist uncountably many such curves. 

The circles for which a is resonant do not survive the perturbation.”4 However, 
from the Birkhoff-Lewis Theorem we know that we still have periodic integral 
curves for the full system. These periodic integral curves cannot lie on the surviving 
tori, they must be located between them. In the next section we will draw con- 
clusions on stability from this fact. In general, among the periodic integral curves 
provided by the Birkhoff-Lewis Theorem there are curves of hyperbolic and elliptic 
type. Among the elliptic ones in general we can find again integral curves which are 
non-degenerate and 4-elementary. Around them, the above picture is being repro- 
duced on a smaller scale. The dynamics in the neighbourhood of hyperbolic integral 
curves turns out to be more complicated. For an illustration of these phenomena we 
refer to Figs. 8.3-2 and 8.3-3 in [1], which go back to Arnold [12]. 


Exercises 
9.6.1 Analyze the flow of the Hamiltonian 


1 
H= 56" + p’) + (q° — 3qp’). 


Hint. Consider the cases ¢ < 0, ¢ = 0 and ¢ > 0 separately. 
Note. This is an example of a resonance of third order taken from Sect. 4 
of [18]. 


9.7 Stability 


For a Hamiltonian vector field, the problem of stability is a rather difficult task. To 
understand this, first recall that to every characteristic exponent of an equilibrium 
y with negative real part there corresponds a characteristic exponent with positive 
real part, cf. Proposition 8.2.10. Accordingly, to every characteristic multiplier of 
a periodic integral curve y inside the unit circle there corresponds a characteristic 
multiplier outside. Hence, y cannot be asymptotically stable,~> and if it is stable, 
then all of its characteristic exponents have zero real part (in case y is an equilib- 
rium) or all of its characteristic multipliers have modulus | (in case y is periodic). In 
this case, the stable and unstable manifolds S~ (vy) and St (y) coincide with y itself 
and the whole manifold is a centre manifold. As a consequence, the usual stability 
theory for hyperbolic critical integral curves is pointless for Hamiltonian systems, 
and little is known in the general case. 

In this section, we will limit our attention to autonomous systems and discuss 
three special situations. First, we will show that under a special assumption the 


4This was already observed by Poincaré. 


25 Of course, this also follows from the Liouville Theorem 9.1.4. 
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Hamiltonian H possesses the properties of a Lyapunov function. Then, we will an- 
alyze the case of an equilibrium with purely imaginary characteristic exponents. 
Finally, we will discuss periodic integral curves in two degrees of freedom. 

Thus, let (M@,@, H) be a Hamiltonian system and let y be a critical integral 
curve. To find a criterion under which H is a Lyapunov function for y, consider the 
second derivative H,” of H at a point m € y, viewed as a symmetric bilinear form 
on T,,M. Since H is constant along y we have H,/(X, Y) =0 whenever X or Y is 


tangent to y. Consequently, H,’ induces a mapping 


Nmny X Nny > R, (LX],[Y]) AY (X,Y), (9.7.1) 


nm 


called the normal second derivative. If y is an equilibrium, then Nyy = TM and 
the normal second derivative coincides with H/’. 


Proposition 9.7.1 Let (M,q@, H) be a Hamiltonian system and let y be a critical 
integral curve. If for all m € y the second normal derivative of H at m is positive 
definite, y is stable. 


Let us add that this assumption also implies that H is a Morse-Bott function in 
some neighbourhood of y. 


Proof Denote c := H(y). Choose a covering of y by flow box charts (Uj, «;). Since 
the chart k; maps y M U; onto an open interval of the x!-axis, the normal second 
derivative of H at a point m € y NU; is given in this chart by the second derivative 
of the restriction of H o ic to the hyperplane defined by x! = ke (m). Since, by 
assumption, the normal second derivative is positive definite, the point x; (m) is a 
local mininum of the restriction. Since this is true for all m € y NU; we conclude that 
there exists a neighbourhood V; of y MU; in U; where H(m) > c for all m € V; \ y. 
Then, V :=; V; is a neighbourhood of y in M with H(m) > c for allmeV\y. 
Thus, the function H},, —c is a Lyapunov function for y and Theorem 3.8.16 yields 
the assertion. 


Example 9.7.2 Let M = R?” with the canonical symplectic structure and the stan- 
dard Darboux coordinates q' and p;. Consider a Hamiltonian of the form 


1 _ 
H(q, p) = 5PM ‘p+ V(q), 


where M is some positive symmetric invertible matrix and V is an arbitrary smooth 
function on R”. Obviously, (qo, po) is an equilibrium of X y iff po = 0 and qo is a 
critical point of V. The second derivative at (qo, po = 0) is 


M 0 
H"( =| av |. 
LO agragT 0) 


It is positive definite iff so is the second derivative of V at qo. Thus, Proposi- 
tion 9.7.1 yields that if qo is a local minimum of Vo, the equilibrium (qo, 0) is 
stable under the flow of X 7. 
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Next, we analyze the case of an equilibrium whose characteristic exponents are 
purely imaginary. The following theorem is a direct consequence of the Lyapunov 
Centre Theorem 9.5.8 and the fact that the orbit cylinder provided by this theorem 
combines with the equilibrium point to a two-dimensional invariant embedded sub- 
manifold of M, cf. Remark 9.5.9/1. 


Theorem 9.7.3 (Lyapunov Subcentre Theorem) Let m be an equilibrium of a 
Hamiltonian system (M, w, H), satisfying the following assumptions. 


1. Zero is not a characteristic exponent. 

2. If X is a purely imaginary characteristic exponent, it has multiplicity 1. 

3. If A4 and A2 are purely imaginary characteristic exponents, they are rationally 
independent. 


Then, for every purely imaginary characteristic exponent ia there exists a two- 
dimensional X y-invariant embedded symplectic submanifold Cy of M containing 
m with the following properties: 


1. Tn Cg is the eigenspace of the characteristic exponents ia and —ia. 

2. Cy \ {m} is the image of a regular orbit cylinder whose integral curves yg ap- 
proach m and whose periods Tz approach 2m in the limit E > H(m). Thus, Cy 
is a union of periodic integral curves and {m}, that is, it is diffeomorphic to a 
2-dimensional disc. 


The submanifold Cy, is referred to as the subcentre manifold associated with the 
pair of characteristic exponents tia. 

Besides the fact that the Lyapunov Subcentre Theorem provides insight into the 
behaviour of the flow near m whenever characteristic exponents satisfying the as- 
sumptions exist, it yields the following information about stability: if m is elliptic 
and if the characteristic exponents, counted with multiplicities, are pairwise ratio- 
nally independent, then m is stable in each subcentre manifold Cy. Indeed, given 
a (contractible) neighbourhood U of m in Cy we find an energy E such that yz is 
contained in U. The subset of Cy, enclosed by yg is invariant under the flow and 
hence it stays in U for all times. To be stable in each subcentre is, of course, only a 
necessary condition for m to be stable in M. Whether or under which conditions it 
is sufficient is a hard (and open) question, which remains to be studied. 


Example 9.7.4 (Subcentres in two degrees of freedom) Assume that dim M = 4 and 
that m fulfils the assumptions of the Lyapunov Subcentre Theorem. Then, charac- 
teristic exponents can be grouped into two pairs +A; and +Ap. In the trivial case 
where none of them is purely imaginary, these pairs are complex conjugates of one 
another. If one of the pairs is purely imaginary, A; = ia with a > O say, we are in 
one of the following two cases. 


1. Az = wis real and positive: since —ju < 0 < py, both the stable manifold and the 
unstable manifold have dimension one and the subcentre manifold Cy is in fact a 
centre manifold for m. Correspondingly, in the 2-dimensional subspace spanned 
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by the eigenspaces of the real characteristic exponents +y, the linearized flow 
has the form of a saddle (type 2(c) in Example 3.6.13). By the Lyapunov Centre 
Theorem, in the limit E — H(m), the Floquet multipliers of the periodic inte- 


gral curves yg of Cy approach eter, which are positive real numbers distinct 
from |. Since, for dimensional reasons, the Floquet multipliers can only arrive 
at the real axis if they pass through 1 or —1, there exists ¢ > 0 such that for all 
E with |E — H(m)| < e, the Floquet multipliers are real and distinct from | and 
the periodic integral curves yg are hyperbolic on their energy surfaces. 

2. 42 =iB with 6 > O and rationally independent from a: here, we have two sub- 
centre manifolds, Cy and Cg, consisting of periodic integral curves y; and y 4 ; 
respectively. Since a and # are rationally independent, m is r-elementary for 
any r, and Remark 9.6.9 yields that we can find Darboux coordinates q1, q2, P1, 
p2 at m such that H is a polynomial in the action variables J; = 5(q7 + P?) 
whose lowest order term is given by 


1 
H(q, p)= 5 (tel + Bh)t:.. 


Since H can be expressed entirely in terms of J; and J, it follows that the sub- 
centre manifolds Cg and Cg are given by g2 = p2 = 0 and q; = p; = 0, respec- 
tively. If the contributions of a and 6 have the same sign, m is a local minimum 
or maximum of H and the energy surfaces »’¢ near m are three-spheres con- 
taining two periodic integral curves y; and ve each, which collapse to m as E 
approaches H(m). If the contributions of a and f have opposite sign, m is a 
saddle point of H and 2’; are hyperboloids, each of which contain one periodic 
integral curve. To be definite, let us assume that the contribution of a has positive 
and that of 6 has negative sign. Then, »’¢ contains yz for E > H(m) and ve for 
E < H(m). As one increases the energy starting from a value below H(m), the 
periodic integral curves ve shrink, degenerate to m at E = H(m) and reappear 
as the periodic integral curves y;, though in another dimension. This could be 
imagined as an “evolution” of orbit cylinders, where Cg “dies” at E = H(m) and 


is “reborn” as the orbit cylinder Cy. In either case, for E > H(M), the Floquet 
20 4320 
multipliers of y approach ei and those of ve approach e~ # “, 


The authors of [1] call case | the phantom burst and case 2 the stable burst catas- 
trophe. For a further discussion of bifurcations in the case of two degrees of freedom 
we refer to Sect. 8.6 of [1] (where this example was taken from). 


Finally, we discuss the stability of a periodic integral curve y of a Hamiltonian 
system (M, w, H) with two degrees of freedom, that is, with dim M = 4. Assume 
that y is elliptic, 4-elementary and non-degenerate. From Example 9.6.13 we know 
that every neighbourhood of y in its (local) energy surface 2’; contains uncount- 
ably many 2-tori which are invariant under Xy and in which every integral curve 
is dense. The latter property implies that the tori cannot intersect. Therefore, they 
divide the 3-dimensional energy surface 2’ into disjoint subsets which are also in- 
variant under Xz. As a consequence, y is stable within its energy surface 2'r. If 
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one combines this observation with the existence of an orbit cylinder, one finds that 
y is stable: 


Proposition 9.7.5 Every elliptic, 4-elementary and non-degenerate periodic inte- 
gral curve of a 4-dimensional Hamiltonian system is stable. 


Proof Denote the periodic integral curve under consideration by yz,, where Eo = 
H(yg,). Since H does not have critical points in some open neighbourhood of 
VE, in M, and since stability is a local property, for convenience we may assume 
that H is a submersion and hence M is foliated by regular energy surfaces 2'p. 
Since, by assumption, | is not a Floquet multiplier of yz,, Theorem 9.5.4 yields 
that yz, is contained in a regular orbit cylinder ¢ : S! x I + M. Consider energy 
values E € J. Since the conditions on yg to be elliptic, to be 4-elementary and to 
be non-degenerate are all open conditions on the Birkhoff invariants*° of yg and 
since the latter depend smoothly on E, by possibly shrinking 7 we may assume 
that yz is elliptic, 4-elementary and non-degenerate for all E € J. Now, let U bea 
neighbourhood of yg, in M. By possibly further shrinking J, we may assume that 
c(S! x 1) CU. Then, for all E € J, UN Xz is a neighbourhood of yg in the en- 
ergy surface d'p. Since yz is stable in d'p, we find a neighbourhood Vz of yg in 
>'f where the flow of X y is defined for all times and whose points never leave U 
under this flow. Then, V = LU) ge, VE is aneighbourhood of y in M with the same 


property. 


For an application of the above stability analysis to celestial mechanics we refer 
to [273], $34. For example, the motion of an asteroid in the asteroid belt between 
the planets Mars and Jupiter can be viewed as a restricted 3-body problem,”’ with 
the three bodies being the asteroid, Jupiter and the sun and with the mass of the 
asteroid being completely neglected. If one assumes that the asteroid moves in the 
same plane as the sun and Jupiter, this system has two degrees of freedom. The 
ratios between the frequencies w, of the asteroid and wy of Jupiter for which the 
Floquet multipliers of the corresponding periodic integral curve do not fulfil the 
non-resonance condition are i z, t and 7 For these ratios one finds well-defined 
gaps in the distribution of asteroids, known as the Kirkwood gaps.7® Hence, these 
orbits are not stable, indeed. After the long history of the solar system, the asteroids 
have disappeared from them. For an extensive discussion of these and further gaps, 
see [216]. 


Remark 9.7.6 (Arnold diffusion) For systems with more than two degrees of free- 
dom, the invariant tori have codimension greater than | in their energy surface. Con- 
sequently, these tori do not divide the energy surface into disjoint subsets, so that an 


6Defined by an isoenergetic Poincaré mapping for yz, cf. Remark 9.6.5/1. 
27See also Sect. 4.4 of the book of Thirring [286]. 


28Named after the astronomer who first observed them in 1866. 
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integral curve can escape from a neighbourhood of y without running through such 
a torus. Thus, the above stability argument does no longer apply. This phenomenon 
is known as Arnold diffusion. 


9.8 Time-Dependent Systems. Parametric Resonance 


For the remainder of this chapter we depart from autonomous systems and con- 
sider time-dependent Hamiltonian systems (M,w, H). Here, the Hamiltonian H is 
a smooth function on R x M, cf. Sect. 9.3. We restrict attention to the case where 
the time dependence is periodic, that is, where there exists a minimal positive real 
number 7, called the period, such that H;7 = H; as functions on M for all t. In 
the present section, we study the problem of stability of equilibria and in the next 
section we address the problem of the existence of periodic integral curves. 

Let us start with some introductory remarks. The Hamiltonian vector field X q 
generated by a T-periodic Hamiltonian function H is also T-periodic, because 


(Xw)47 _ XHyyr = Xu, = (XH)t 
for all t. Let ® denote the flow of X . From Sect. 3.4 we know that ©@ satisfies 
:+7,0 = 1,0 0 Pr.0, Pe7,0 = PF (9.8.1) 


for all t € R and k € Z. The local symplectomorphism ®7 9 is called the period 
mapping of X 7. Note that the period mapping is a Hamiltonian symplectomorphism 
in the sense of Definition 8.8.7 and that every integral curve is invariant under ®7 0. 
In particular, equilibria of X y are fixed points of ®7 9. 


Proposition 9.8.1 A critical integral curve of a periodically time-dependent Hamil- 
tonian system is stable iff it is stable under the period mapping. 


Proof 1n the proof of the equivalence of points 1 and 2 of Proposition 3.8.5, allow 
y to be an equilibrium as well and replace the period mapping ®7 of the periodic 
integral curve given there by the period mapping ®7 9 of Xz. 


Now, we turn to the study of the stability properties of an equilibrium of Xy. 
Since stability is a local concept, we may assume that M = R2” with the standard 
symplectic structure and that the equilibrium under consideration is given by the 
origin. We restrict attention to the case where H; is a quadratic form on R?” for 
all t. This is a model for a system of coupled harmonic oscillators with T-periodic 
characteristic frequencies and coupling constants. Since H is quadratic in the vari- 
able x € R2”, the Hamiltonian vector field X H 1s linear, and so are the flow ® and 
the period mapping ®7.9. Thus, ®7.9 € Sp(n, R). For the elements of Sp(n, R), 
Proposition 3.8.9/2 yields the following simple stability criterion. 


Proposition 9.8.2 Let a € Sp(n, R). If all eigenvalues of a have absolute value 1 
and multiplicity 1, there exists a neighbourhood of a in Sp(n, R) such that the origin 
is stable under all symplectomorphisms from this neighbourhood. 
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Since the multiplicity of an eigenvalue +1 must be even, the assumptions imply 
that the eigenvalues of a are distinct from +1. Note that the proposition provides 
both a criterion for the stability of the origin under a symplectomorphism and a 
criterion for the structural stability”? of a family of linear symplectomorphisms with 
respect to the property to have the origin as a stable fixed point. 


Proof It is enough to show that a possesses a neighbourhood in Sp(n, R) whose 
elements have non-degenerate eigenvalues lying on the unit circle, because any au- 
tomorphism b of R?” with this property satisfies spec(b) = specé (b), so that sta- 
bility follows by Proposition 3.8.9/2. To construct the desired neighbourhood of a, 
we choose pairwise disjoint neighbourhoods U),, of the eigenvalues 2 of a in C, 
satisfying 


U,-1 =U,'=U,, 


where U;, means the complex conjugate subset. Since the eigenvalues depend con- 
tinuously on the elements of Sp(v, R), there exists a neighbourhood V of a in 
Sp(z, R) whose elements have exactly one eigenvalue in each neighbourhood Uj. 
For every element b of this neighbourhood, the eigenvalues necessarily have mul- 
tiplicity 1. We show that they have absolute value 1. Indeed, if jz is an eigenvalue 


of b, then so is ries If 4 is the eigenvalue of a such that yu € U;, then 


—_-j _>7-l 
gp eu, =U, 
Since U, contains only one eigenvalue of b, we conclude 7z~! 


|u| = 1. 


= pw and hence 


Remark 9.8.3 In the case of degenerate eigenvalues on the unit circle, Proposi- 
tion 9.8.2 does not provide information about stability. In this situation, the origin 
may be stable as in the case a = 1 or unstable as in the case 


fii 
el (et as Ee 


(Exercise 9.8.1). The proposition implies, however, that the transition between sta- 
ble and unstable linear symplectomorphisms is only possible via degenerate eigen- 
values. This transition can be imagined as follows. Two pairs of eigenvalues on the 
unit circle run (in opposite directions) into one another, meet at a certain eigenvalue 
pair and then escape along the two radial lines defined by this pair into opposite di- 
rections.*” Thus, for a linear symplectomorphism with degenerate eigenvalues, even 
a small perturbation can cause a transition between stability and instability. This 
phenomenon is referred to as parametric resonance. Note, however, that the fact 
that a has degenerate eigenvalues does not necessarily imply that in every neigh- 
bourhood of a there exist stable and unstable linear symplectomorphisms. For a 
criterion in that case, established by M.G. Krein, we refer to [18, $42]. 


2°Cf. Remark 3.8.19. 
30Compare with Fig. 7.1. 
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In what follows we study the case of one degree of freedom, that is, M = R?. 
Under the assumption that H;, be a quadratic form on R? for all t, H is of the form 


2 
mee @?, (9.8.2) 


with T-periodic frequency w. The positive sign refers to an attractive and the nega- 
tive sign to a repelling force. The Hamiltonian vector field is 


+ 1 4 
HA~(q, p= aa pot 
m 


X ps = Fmo(t)2qdp + - ay: (9.8.3) 


Denote the flow of X q+ by ©”. The stability criterion of Proposition 9.8.2 takes 
the following form. 


Proposition 9.8.4 Leta € Spd, R). 


1. If | tr(a)| < 2, there exists a neighbourhood of a in Sp(1, R) such that the origin 
is stable under all symplectomorphisms from that neighbourhood. 

2. If |tr(a)| = 2, then a= +1 and the origin is stable under a. 

3. If |tr(a)| > 2, there exists a neighbourhood of a in Sp(1, R) such that the origin 
is unstable under all symplectomorphisms from that neighbourhood. 


As a consequence, the equation | tr(a)| = 2 defines the boundary of the subset of 
Sp(1, R) of symplectomorphisms under which the origin is stable. 


Proof Let A, and Az be the eigenvalues of a. By the Symplectic Eigenvalue The- 
orem 7.4.3, A2 must coincide with A; or age Therefore, the following cases can 
occur: 


(a) Ay =Aa: here, A; = Ag = +1 and hence | tr(a)| = 2. 
(b) |Ay| = |A2| = 1 but Ay # Ad: here, A2 =A, #+1 and hence 


|tr(a)| = [Ay +. Ay] = 2|Re(A,)| <2. 


(c) |Aq| 4 |A2|: here, Az = as where i is real and distinct from +1. Hence, 
= 2 _ 
O< [Ail '(laal— 1) = laa + a7! — 2 = |r@| - 2. 


It follows that the cases (a), (b) and (c) are equivalently characterized by the 
conditions, respectively, | tr(a)| = 2, | tr(a)| < 2 and | tr(a)| > 2. Thus, for the cases 
| tr(a)| = 2 and |tr(a)| > 2, the assertion is obvious and for the case | tr(a)| < 2, it 
follows from Proposition 9.8.2. 


Combining Proposition 9.8.4 with Proposition 9.8.1, we obtain 


Corollary 9.8.5 For a Hamiltonian system on R* whose Hamiltonian function is of 
the form (9.8.2), the origin is stable iff | r(@#5)| <2. 


This way, the stability analysis for systems with Hamiltonian function (9.8.2) 
reduces to the problem of calculating the quantity r(o7 ). 
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Remark 9.8.6 In the case where w(t) performs small oscillations of period T about 
a constant value wo, Proposition 9.8.4 yields further information about the qualita- 
tive behaviour of the system. Write w(t) = we(t) with wo(t) = wo and, correspond- 
ingly, H~ = H;-. Assume that the mapping € +> w¢ is continuous with respect to 
the supremum norm for functions on [0, 7], that is, ¢, > € implies>! 


sup{ |e, (t) — we(t)| :t € [0, T]} > 0. 


H+ : He 
Then, | tr(®;‘9)| depends continuously on ¢, too. Let us compute | tr(®,- )I- In 
the case of an attractive force, the flow of the harmonic oscillator with constant 
frequency wo is given by 


Hy cos(wot) wn sin(@ot) 
#0 —mag sin(wot) cos(wot) 


@ 


hence 
juror )| = 2|cos(woT)|. 


Thus, if T ¢ op 2? Proposition 9.8.4 implies that the origin remains stable for suf- 
ficiently small ¢. If T € +Z, the origin is stable for ¢ = 0, but a transition to in- 
stability is possible for arbitrarily small values of ¢. For such values of the period, 
parametric resonance can occur. In the case of a repulsive force, 


a cosh(wof) wn | 


P95 = : 
—map sinh(q@ot) cosh(wof) 


and hence 
[tr(e7,)| = 2|cosh(woT)]. 


Since wo > 0 and T > 0, we have | tr(@7 | > 2, so that the origin remains unstable 
for sufficiently small values of e. 


Now, we consider an example where oF 7 can be calculated explicitly. This al- 
lows to go beyond the qualitative statements of Remark 9.8.6 and to analyze the 
stability of the origin for arbitrary values of «. 


Example 9.8.7 (Swing) Let 


e |re[kT, 7 


@¢(t) =@0 + 
: -e |re(r,e7I 


, kEZ, 0<e <a. (9.8.4) 


With this time-dependence of the frequency, the Hamiltonian (9.8.2) yields a simple 
model for a swing. To see this, consider a planar pendulum with time-dependent 


31Note that we do not assume @,(t) to be continuous in t. 
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length /(t). Let g be the gravitational acceleration and let ¢ be the angle of deviation 
from the lower equilibrium point. Then, the equation of motion reads 

. Id 

rot 4 + sing =0. (9.8.5) 
The mixed term 3 will be ignored. The motion of the person sitting on the swing 
is modelled by the following law for /(f): 


2 
Ib —O | re (kT, HT 
in , keZ, a= fF. 
0 


w 2k—-1 
oi |e (47, kTI 


2 
Here, /o oe < Ig is the pendulum length which effectively occurs if the person on 


the swing leans forward and J rs “ iad Io is the effective length if the person leans 
backward. For small deviations about the lower or the upper equilibrium point, sin 
can be approximated by +4, respectively. Under this approximation, Eq. (9.8.5) is 
equivalent to the Hamilton equations for the Hamiltonian (9.8.2) with p := md, 
q :=1@ and with frequency given by (9.8.4). 

First, let us consider Ay , which corresponds to small deviations from the lower 
equilibrium point. Integration of the Hamilton equations yields (Exercise 9.8.2) 


ge 
cos(eT)}. (9.8.6) 
— ¢2 


Hy @ 
|tr(P7)| =2 ee = cos(wP) — 
We keep the period T fixed and study the stability of the origin, that is, of the lower 
equilibrium point of the swing, depending on the values of the parameters wo and ¢. 
Regions in the wo—e-plane, for which the origin is stable or unstable, respectively, 
will be called stable or unstable regions, respectively. The boundaries between such 
regions are called stability boundaries. According to Proposition 9.8.4, these bound- 


2 
nat) 


+ 
aries are defined by | rp" .)| = 2. In our example, this corresponds to the equation 


w(1 + cos(woT)) = €7(1 + cos(eT)), (9.8.7) 


see Fig. 9.4. We see that, apart from the points wp = sem the wo-axis is enclosed by 
stable regions, that is, there the origin remains stable for sufficiently small values 
of e. At the exceptional points wp = An the unstable regions touch the wo-axis. 
Thus, at these points the origin is unstable for arbitrarily small values of ¢. For 
the swing, this means that when swinging with a multiple of half the characteristic 
frequency of the swing, an arbitrarily small perturbation suffices to set the swing in 
motion. Indeed, for @pT = (2k + 1)z, Eq. (9.8.6) yields 


ae wo te ? cos(e MEEUZ ) 
tr(®,4 )| =2 >2, O0<e< ——, 
| ( r.0)| | we — 62 Mk +1 
and for @pT = 2k we obtain 
2k 
yt We —€ * cos(e AZ — wo 
tr(®,% )| =2 >2, O<e<—. 
ier) =) : 
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Fig. 9.4 Stable (grey) and € 
unstable (white) regions in the 
@o—€-plane for the harmonic a 
oscillator with attractive force Tr 
and time-dependent 
frequency give by (9.8.4) 
2n 
T 
x 
T 


7 ® 


sab 
sf 
es 


However, Fig. 9.4 shows that the opening angles under which the unstable re- 
gions touch the wo-axis depend drastically on whether an integer multiple or a half 
integer multiple of the characteristic frequency is approached. In the first case, one 
must meet the characteristic frequency rather exactly or change the effective length 
heavily in order to reach the unstable region.*” On the other hand, for a half inte- 
ger multiple of the characteristic frequency it is enough to meet the characteristic 
frequency approximately and to produce a small change of the effective length by 
swinging. To calculate the opening angles, we write wo = sae +x and expand the 
left and the right hand side of Eq. (9.8.7) to the lowest non-vanishing order about 
x = 0 and ¢ = 0, respectively. Using that the sign in (9.8.7) is positive for odd k, 
we obtain a + O(x?) for the left hand side and 2e” + O(e*) for the right hand 


side. Thus, e ~ + My and the opening angle is 


(<4) 
Qa = arccos| ———~——_ ] . 


k?n2 +4 


A similar calculation for even k yields ¢ ~ + ry and hence a = 0. 


Next, let us consider the case H; , which corresponds to small deviations from 
the upper equilibrium position. Here, one finds (Exercise 9.8.2) 
Hi; 0% e 
|tr(P74)| =2) =—  cosh(woT) — 4— cosh(eT)]. (9.8.8) 


= = ee 
Wo & ® éE 


Since the expression under the absolute value signs is larger than 1 for all 0 < 
€ < wo, the upper equilibrium point remains unstable for any choice of the time 
dependence of the pendulum length. 


32The reader should try to set a swing in motion by leaning forward and backward over a whole 
period each time. 
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In a similar manner, the reader may discuss the following example (Exer- 
cise 9.8.3). 
Example 9.8.8 Let 
wop+e? |telaT, =H7[ 


W(t)? = : 
° wae |re( ra 


keZ. (9.8.9) 


The corresponding Hamiltonians (9.8.2) model small deviations from the equilibria 
of a planar pendulum whose suspension point moves vertically according to 


= 2. 2 
e21(L By) T) fee: ey 


a(t) = keZ. 


= 2 2 = 3 
eure — Fy |e (7, xr 


Finally, we discuss an example which can be worked out explicitly by using the 
theory of Mathieu functions. The latter is treated in great detail in [209]. 


Example 9.8.9 Let 


7 er? _ 2m 
Q(t)” = @p — € cos(2t), Q= eo (9.8.10) 
The corresponding Hamiltonians (9.8.2) model small deviations from the equilibria 
of a planar pendulum whose suspension point moves vertically according to 
ge 
a(t) = ou cos({2r). 


The equation of motion is 


Xk (6 ne cos(@2t))x =0, 


where the positive (negative) sign stands for the lower (upper) equilibrium. Setting 
= ot and denoting the derivative with respect to s by x’, for both cases we obtain 
the Mathieu equation 


xy (a —2q cos(2s))x =0, (9.8.11) 
with parameters 
we = 
ae on q= +257: 


In the theory of the Mathieu equation one shows that there exist families a,, r = 
0,1,2,...,andb,,r=1,2,..., of real-valued functions such that Eq. (9.8.11) has 
an even periodic solution for every g € R and a = a,(q) and an odd periodic solution 
for every g € R and a = b,(q). After an appropriate normalization, these solutions 
are called Mathieu functions and are denoted by ce, and se;, respectively. 


Let us calculate @;: . under the simplifying assumption that b,(g) 4 a #a,(q). 
In this case, the general solution of the Mathieu equation can be shown to have the 


form 


x(s) = Ael’S P(s) + Be” P(—s), (9.8.12) 
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with P being a z-periodic function and v = v(a, q) being a complex-valued an- 
alytic function.*? Under the assumptions made, either v(g,a) € R or Re(v) € Z 
and Im(v) ¥ 0 [3, $20.3]. The first case occurs if g > 0 and a;(q) <a < b-+41(q) 
or gq < 0 and a,(q) < a < b,(q), whereas the second case occurs if g > 0 and 


by (q) <a <a,(q) org <0 and b2,_1(g) <a < ba, (q) or az (q) < a < Gor41(Q). 
In particular, v(a, g) is not an integer. With p(t) = mlx(t), from (9.8.12) we read 


off 
x(t)= acl p(Sr) + Be p(-Ze), 


po) =mr| ae (inp( Fr) aa =P (S4)) 
2 2 2 
— Be ivP t)+—P t , 
2 2 2 


with vp =v Z The constants A and B are determined by the initial conditions 


x(0) = (A+ B)P(0), p(0) =ml*(A — B) (wre + +P')). 


Since P(0) 40, we can choose P(0) = 1. Then, 


elt p(2 pet pP(_2t) elt p(y) eH" p(_ Bt 


Ae 2 Imi F (0) 
®, (0) deg Ka ‘de be 
, m2 (el! F(t)—e7'"! F(—1)) el”! F(t)-+e7!" F(—1) 
2 2F(0) 


with 


F(t) =ieP 2 2 py 2 


For t = T we obtain P($T) = P(x) = P(O) = 1 and F(T) = F (0). Thus, 


ite cos(zv) “Sea aad 
7.0 Limi? F (0) sin(tv) cos(srv) 


and, therefore, 


[(O7"5)| — 2|cos(z v)| = 2,/cos? (Re(v)) + sinh*(Im(v)). 


Ht ; 
In the case where v(a, q) € R, we have | tr(®;9)| < 2, whereas in the case where 


Re(v) € Z and Im(v) + 0, we find | r(OP)| > 2. Thus, the boundaries of stability 
are given by the curves a,(q) and b,(q). The corresponding curves in the wo-e- 
plane are given by 


Q 2 
ao(e) = > f 252} f =ar, by, 


Obviously, v is determined up to addition of 2k, k € Z. 
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Fig. 9.5 Stable (grey) and unstable (white) regions in the wo—e-plane for the harmonic oscillator 
with time-dependent frequency (9.8.10) and attractive (/eft) or repulsive force (right) 


with the positive sign corresponding to small deviations from the lower equilibrium 
point and the negative sign corresponding to small deviations from the upper equi- 
librium point, see Fig. 9.5. The reader can analyze this figure in the same manner as 
in Example 9.8.7. 


Remark 9.8.10 Let us consider the influence of friction, thus leaving the realm of 
Hamiltonian systems. We restrict attention to the case of the lower equilibrium point. 
Assuming the friction force to be proportional to the velocity, the equation of motion 
reads 


X4+2BX + a-(t)x=0, B>O. 


The corresponding vector field is linear, 


X= 5a, — (moo()?x + Bp) dp. 
and so is its flow ®*. Therefore, the origin is the only equilibrium. Of course, this 
vector field is not Hamiltonian and thus the stability criterion of Proposition 9.8.4 
cannot be applied. Nonetheless, Proposition 9.8.1 carries over to the present case, 
and Proposition 3.8.9/2 yields that the origin is asymptotically stable under PF 9; 
and hence under ®*, if the absolute values of all the eigenvalues of ©. are smaller 
than |. Let us calculate the eigenvalues of Pro for e = 0, that is, for w,(t) = wo. 
According to Example 3.6.13, 
cos(@r) + & sin(21) 1 sin(Qt) 
DF = aad max z 
— moo sin(2r) cos(@r) — & sin(21) 


where {2 =, | — pf. Thus, the eigenvalues are A = e FT (cos(QT) tisin(2T)) 
and their absolute value is 


Ax] =e7P?. 


Since P 0 and, therefore, also its eigenvalues are continuous in €, we conclude that 
for every wo, there exists €9 > 0 such that the origin is stable, and hence asymptoti- 
cally stable, under ®* for all ¢ < ¢9. This means that under the influence of friction 


9.9 On the Arnold Conjecture 487 


the regions of instability cease to touch the wo-axis, that is, for every period T we 
need a finite ¢ for reaching instability. 


Remark 9.8.11 If we give up the assumption on H to be a quadratic form, the flow 
@® and consequently also the period mapping ®7 9 become non-linear. There exists 
a certain iteration procedure which can be used for calculating the period mapping 
approximately. Using this idea, a lot of non-linear Hamiltonian systems have been 
studied in the literature, see e.g. [153, §§ 7.4.2, 7.5.1, 7.5.2]. 


Exercises 
9.8.1 Show that the matrix 


{lol 
[0] 
defines a linear symplectomorphism on R? under which the origin is unstable. 

9.8.2 Verify Eqs. (9.8.6) and (9.8.8). 

9.8.3. Work out Example 9.8.8. 

9.8.4 Consider a mathematical pendulum whose suspension point oscillates verti- 
cally according to a(t) = are? cos(S2t), cf. Example 9.8.9. Show that the 
approximations of the equation of motion for small deviations from the equi- 
libria are equivalent to the Hamilton equations of the Hamiltonian systems 
(9.8.2) with time-dependent frequency (9.8.10). 
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In this section, we turn to the problem of the existence of critical integral curves 
for a periodically time-dependent Hamiltonian system (M, w, H). More precisely, 
we will discuss a special type of critical integral curves which is related to the 
fixed points of the period mapping ®7.9. For every fixed point m of ®7.9, we have 
@147,0(m) = ®;(m) for all t, so that the maximal integral curve through m is either 
an equilibrium or periodic of period es for some positive integer k. A typical exam- 
ple is the motion of a harmonic oscillator acted upon by a periodic external force. 
Therefore, periodic integral curves whose period is an integer part of that of H are 
referred to as forced oscillations. 


Remark 9.9.1 For convenience, one may restrict attention to time-dependent Hamil- 

tonian systems of period |. Indeed, if H has period T, the function 
H:RxM-=>R, H(t,m):=T-H(Tt,m), 

has period | and generates the flow 


H _@H 
Ds, = Pry TH: 
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Thus, up to a simultaneous rescaling of the time parameter and the energy scale, one 
may assume that H has period T = 1. In this situation, the period mapping ®1 9 is 
usually referred to as the time-1 mapping of the flow. 


For what follows, it is important to notice that the period mapping of the flow of a 
periodic Hamiltonian is a Hamiltonian symplectomorphism, cf. Definition 8.8.7. In 
this sense, any statement about Hamiltonian symplectomorphisms applies in partic- 
ular to the period mappings of periodic Hamiltonians.** The following conjecture, 
formulated by Arnold in the 1960s, predicts a universal lower bound on the number 
of fixed points of a Hamiltonian symplectomorphism and hence on the number of 
critical integral curves of a periodically time-dependent Hamiltonian system: 


Arnold conjecture Let (M, w) be a compact symplectic manifold and let ¥ : M > 
M be a Hamiltonian symplectomorphism. Then, Y must have at least as many fixed 
points as a function on M must have critical points. If the fixed points are all non- 
degenerate, then their number is at least equal to the minimal number of critical 
points of a Morse function on M. 


Remark 9.9.2 


1. The Arnold conjecture is certainly true in the time-independent case, that is, 
for symplectomorphisms W which are given by the flow ® of an autonomous 
Hamiltonian function H on M, taken at an arbitrary instant of time, say t = 1. 
Indeed, every critical point of H is an equilibrium of the Hamiltonian vector field 
Xy and hence a fixed point of ;. Hence, W has at least as many fixed points as 
FH has critical points. If in addition the fixed points of W are non-degenerate, so 
are the critical points of H. Therefore, H is a Morse function, and W has at least 
as many fixed points as a Morse function on M@ must have. 

2. The Morse inequalities of Remark 8.9.10/3 imply the following weaker version 
of the Arnold conjecture. If the fixed points of W are all non-degenerate, their 
number is greater than or equal to the sum of the Betti numbers of M. 


The Arnold conjecture may be viewed as a higher-dimensional generalization of 
the following theorem, which was formulated by Poincaré in 1912, shortly before 
he died, and proved by Birkhoff [45]*> in 1913. 


Theorem 9.9.3 (Poincaré-Birkhoff Theorem) Every area preserving homeomor- 
phism of an annulus 
A={(q, p)€R*:a<q’ +p’ <b} 


which leaves the boundary circles invariant but twists them in opposite directions 
possesses at least two fixed points. 


34Qne can also show the converse, that is, every Hamiltonian symplectomorphisms can be repre- 
sented as the period mapping of some periodic Hamiltonian, see Exercise 11.8 in [206]. 


35See also [46] for a generalization to ring shaped regions with arbitrary boundary curves. 
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This theorem is also known as Poincaré’s Last Geometric Theorem. Poincaré 
was led to formulate it when he studied the problem of periodic solutions of the 
restricted three body problem in celestial mechanics. As mentioned in Sect. 9.6, the 
Birkhoff-Lewis Theorem 9.6.10 generalizes this theorem in a different direction. 
We note that both the requirement of being area preserving and the twist condition 
are important, see Exercise 9.9.1. Following McDuff and Salamon [206], we give a 
beautiful and simple proof of this theorem under the stronger condition of monotone 
twist. 


Proof Denote the homeomorphism under consideration by ¢. In polar coordinates 
¢ andr on A, itis given by g(¢, r) = (f (¢,1r), h(¢,r)), where f and h are contin- 
uous functions on A satisfying 


f@+2x,r)=f.r)+2x, h+2x,r)=h(g,r). 


Under the assumption of monotone twist, that is, 


r<r => f@r<f(¢r) (9.9.1) 
for every angle @ there exists a unique radius r = F(@) € (a, b) such that 
f(¢. F@)) =¢. 


The mapping F is continuous and 27:-periodic. All points on the curve ¢ bt 
(@, F(@)) in A are mapped radially under g. Since @ preserves the area, this curve 
must intersect its image under ¢@ at least twice. Since all points of the intersection 
are fixed points of g, this proves the theorem. 


Remark 9.9.4 We draw the attention of the reader to the fact that the above intersec- 
tion argument is a simple version of the argument used in the last step of the proof 
of the Birkhoff-Lewis Theorem. In more detail, the counterpart of ¢g is the iterate 
Ww") and the counterpart of the annulus A is T” x B,. Thus, to f and to h there 


( 


correspond af and J ee respectively. 


Now, let us prove the Arnold conjecture in the simplest nontrivial case. 


Proposition 9.9.5 The Arnold conjecture holds for Hamiltonian symplectomor- 
phisms of a compact symplectic manifold which are sufficiently close to the identity 
in the C'-topology. 


Proof Let (M,qw) be a compact symplectic manifold. By Propositions 8.8.4 
and 8.8.10, in the C!-topology, there exists a neighbourhood of idy in Symp(M, w) 
and a local homeomorphism from this neighbourhood onto a neighbourhood of 
the zero section of T*M in Z'(M), mapping Hamiltonian symplectomorphisms ¢ 
to exact 1-forms df on M. Fixed points of » thereby correspond to zeros of d/. 
Thus, fixed points of g are in one-to-one correspondence with critical points of f. 
Obviously, they are non-degenerate iff f is a Morse function. 
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The Arnold conjecture has attracted the attention of leading mathematicians over 
the last decades. It was first proved by Eliashberg [85] for Riemann surfaces and 
next by Conley and Zehnder [66] for tori of arbitrary dimension. Conley and Zehn- 
der showed that the standard torus T?” has at least 27” distinct fixed points, pro- 
vided they are all non-degenerate. In the degenerate case there are at least 2n + 1 
fixed points. A breakthrough was made by Floer [93, 94], who developed a new 
approach to infinite-dimensional Morse theory*® and applied it to prove the weak 
Arnold conjecture for so-called monotone symplectic manifolds. Thereafter, Hofer 
and Salamon [137] generalized this result to the so-called weakly monotone case. 
Finally, using Floer theory, the weak version of the Arnold conjecture has been 
proven for arbitrary symplectic manifolds independently by Fukaya and Ono [100] 
and Liu and Tian [186]. To our knowledge, the strong version is still open. 


Exercises 


9.9.1 Take the annulus A = {(q, p) € R* :a <q? + p* < b} with polar coordinates 
@ and r and consider the mappings 


1 1 
Wb. (o+ 5). va.) =(o4r-5.77), 


Find out whether these mappings fulfil the requirements of the Poincaré- 
Birkhoff Theorem. Do they possess critical points? 


36Called Floer homology, see [260] or [269] for a review. 


Chapter 10 
Symmetries 


Symmetries play a fundamental role in the study of the dynamics of physical sys- 
tems, because they give rise to conserved quantities. These can be used to eliminate 
a number of variables. In the Hamiltonian context, this procedure is called symplec- 
tic reduction of systems with symmetries.! It is the aim of this chapter to present 
this procedure in a systematic way. In the Hamiltonian context, the conserved quan- 
tities corresponding to the symmetry of the system are encoded in a mapping from 
the phase space of the system to the dual space of the Lie algebra of the symmetry 
group. It is called momentum mapping, because it generalizes well-known constants 
of motion, like momentum or angular momentum etc. In Sect. 10.1 we discuss this 
notion in detail, including a number of examples. Next, in Sect. 10.2, we present 
some algebraic basics needed for the symmetry reduction procedure. Then, the clas- 
sical result of Marsden, Weinstein and Meyer on symmetry reduction is discussed. 
It states that the reduced phase space, obtained by factorizing a level set of the 
momentum mapping with respect to the freely acting residual symmetry group, car- 
ries a natural symplectic structure and that the dynamics of the systems reduces to 
this space. In particular, also the relation to orbit reduction is studied. In Sect. 10.4 
we present the Symplectic Tubular Neighbourhood Theorem.” This is an important 
technical tool for generalizing the above classical result to the so-called singular 
case, where the assumption about the free action of the residual symmetry group 
is removed. The theory of singular reduction is presented in detail in Sect. 10.5. 
In Sects. 10.6 and 10.7 the reader will find a large number of applications. First, 
we discuss the following examples from mechanics: the geodesic flow on the three- 
sphere, the Kepler problem (including the Moser regularization), the Euler top and 
the spherical pendulum. Section 10.7 contains a model of gauge theory, which can 
be viewed as obtained from approximation of gauge theory on a finite lattice. Fi- 
nally, we give an introduction to the study of qualitative dynamics of systems with 
symmetries in terms of the energy-momentum mapping. 


‘In the sequel, we shall cite important contributions to the subject. However, for a quite exhaustive 
list of references and also for a lot of historical remarks, we refer the reader to [196] and [232]. 


7 Also called the Symplectic Slice Theorem. 


G. Rudolph, M. Schmidt, Differential Geometry and Mathematical Physics, 491 
Theoretical and Mathematical Physics, 
DOI 10.1007/978-94-007-5345-7_10, © Springer Science+Business Media Dordrecht 2013 
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10.1 Momentum Mappings 


Recall that an action W of a Lie group G on a symplectic manifold (M, ) is called 
symplectic if 


Vio=o (10.1.1) 


for all g € G and that, in this case, the tuple (MV, w, W) is called a symplectic G- 
manifold (Definition 8.6.2). Correspondingly, an action? yy of a Lie algebra g on 
(M, @) is called symplectic if the vector field w(A) is symplectic for all A € g. 
Note that the action W of a Lie group G induces an action w of its Lie algebra g by 
w(A) = Ax, with A, denoting the Killing vector field generated by A € g. If W is 
symplectic, differentiation of (10.1.1) yields 


L4,0=0 


for all A € g, hence y is symplectic, too. 


Definition 10.1.1 A Hamiltonian system (M, w, H) is called symmetric 


1. under a symplectic action © of a Lie group G if ut =H forall geG, 
2. under a symplectic action w of a Lie algebra g if w(A)H = 0 for all A € g. 


If a Hamiltonian system is symmetric with respect to a Lie group action, then it 
is also symmetric with respect to the induced Lie algebra action. Thus, every Lie 
group symmetry of a given Hamiltonian system induces a Lie algebra symmetry. 
The converse need not be true, because not every Lie algebra action integrates into 
a Lie group action. 

As we have seen, Killing vector fields of a symplectic G-action are automatically 
symplectic, but they need not be Hamiltonian. This special case is, however, of 
particular importance. In this case, for every A € g there exists a function J4 € 
C™(M) such that X7, = Ax. 


Definition 10.1.2 (Momentum mapping) Let (M, w) be a symplectic manifold and 
let W be an action of a Lie group G on M. A mapping J: M — g" is called a 
momentum mapping’ for W if 


Xj, = Ax (10.1.2) 
for all A € g, where the functions J4: M — R are given by 
Ja(m) = (J(m), A). (10.1.3) 


The action W is called Hamiltonian if it is symplectic and if there exists a momentum 
mapping. In this case, the tuple (M, w, W) is called a Hamiltonian G-manifold.° 


3See Definition 6.2.6. 
4The notion of momentum mapping has a long history, see [309] and [194]. 


5If a momentum mapping J is fixed, we may include it in the tuple, thus writing (M, w, Y, J). 


10.1 Momentum Mappings 493 


Since @ is non-degenerate, Formula (10.1.2) is equivalent to 


Ayio@=—dJ,. (10.1.4) 


Proposition 10.1.3 Let (M, w) be a symplectic manifold and let © be an action of 
a Lie group G on M. A momentum mapping for W exists iff all Killing vector fields 
are Hamiltonian. 


Proof If every Killing vector field is Hamiltonian, then, for a chosen basis {e;} in g, 
there exist smooth functions f; € C°(M) such that (e;), = X 7. Using the dual 
basis {e*'}, we define 


J:M—>g*, JI(m):= filme". 
Then, for every A € g, we obtain J4 = (e*", A) f; and hence 
Xt, =X esi_ay , =(C", A) Xp, =(C", Al(ei)e = ((e", Ales), = Ax. 


Thus, J is a momentum mapping for W. The converse direction is obvious. 


Remark 10.1.4 


1. We see that a momentum mapping exists if only condition (10.1.2) is fulfilled 
for each A € g separately, with an arbitrary smooth function J4. As shown in the 
proof of Proposition 10.1.3, the functions J4 can always be chosen so that the 
mapping At> Jy is linear. 

2. Let (M,w,W) be a Hamiltonian G-manifold and let J and J be momentum 
mappings. By (10.1.4), we have 


d(J4 — Ja) =0 


for all A € g, that is, Ja — Ja is constant on every connected component of 
M. Thus, if M is connected, the momentum mapping is fixed up to an additive 
constant 19 € g*, 


J=J+ 0. 


3. Definition 10.1.2 uses only the Lie algebra g and its dual vector space. Therefore, 
it extends automatically to the case of the action of a Lie algebra. If there exists 
a momentum mapping, the action is symplectic. In this case, the tuple (M, w, wv) 
is called a Hamiltonian g-manifold. In this book we have in mind Lie algebras 
of finite-dimensional Lie groups only. However, these notions can be extended 
to infinite-dimensional Lie algebras as well. 


Proposition 10.1.5 Let (M, w) be a symplectic manifold and let V be an action of 
a Lie group G on M. If G is connected and if there exists a momentum mapping 
for ©, then © is symplectic (and hence Hamiltonian). 


Proof If there exists a momentum mapping J, for every A € g we have 


£4, 0 = Ayidw + d(Ayi@) = —d(dJ4) =0 
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and hence, by (4.1.28), 
d 
— Way = Wexp(tay-L An ® = 0. 


dt 
It follows that Dstt Av =o for all t. Since G is connected, by Proposition 5.1.7, 


it is generated by a neighbourhood of the identity. Thus, es o=ao forall g eG. 


Remark 10.1.6 As a consequence of Propositions 10.1.3 and 10.1.5, for an action 
of a connected Lie group on a symplectic manifold to be Hamiltonian it is sufficient 
that all Killing vector fields be Hamiltonian. In particular, the condition that the 
action be symplectic is automatically fulfilled then. 


We note that for arbitrary symplectic G-manifolds a momentum mapping need 
not exist, because the Killing vector fields of a symplectic group action need not be 
Hamiltonian, see Remark 8.2.7. The following proposition provides conditions on 
M and G under which a momentum mapping exists. 


Proposition 10.1.7 Let (M, w, WV) be a symplectic G-manifold. A momentum map- 
ping for W exists iff the linear mapping 
F:/lg,9]> H'(M,R), —F([A]) := [Asso] (10.1.5) 
vanishes identically. 
Proof We give the proof for a left action. The mapping F is well defined, that is, if 
A =[B,C] for some B,C € g, then A, _1@ is exact: since W is symplectic, we have 
-£3,0 =0= Lo, 0, 


that is, B, and C, are locally Hamiltonian. According to Proposition 8.2.6/2, then 
[Cy, By] = Xo c,,B,)- Thus, using Proposition 6.2.2/2, we obtain 


Ay 10 =([B, C]y 10 =[Cy, Byliw = —d(w(Cy, By)). 


The linearity of F is obvious. Now, according to Proposition 10.1.3 and For- 
mula (10.1.4), a momentum mapping J exists iff A,4@ is exact for all A € g, that 
is, iff [A,i@] = F([A]) =0. 


Corollary 10.1.8 Let (M, w, W) be a symplectic G-manifold. In each of the follow- 
ing two cases, a momentum mapping for W exists. 


1. The first de Rham-cohomology group H'!(M, R) is trivial. 
2. There holds g = [g, g]. This means that the first cohomology group of the Lie 
algebra g is trivial. 


The most important property of momentum mappings is that they provide con- 
stants of motion. Recall that a function f ¢€ C™~(M) is called G-invariant if 
f oW, =f for all g € G and that the subspace of G-invariant functions is denoted 
by C©(M)S. 
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Theorem 10.1.9 (Noether) Let (M, w, WY, J) be a Hamiltonian G-manifold and let 
H €C®(M)°. For A € gq, the function J, is a constant of motion for the Hamilto- 
nian system (M, w, H), that is, 


* ja(y() =0 
a Y = 
for all integral curves y of the Hamiltonian vector field X 1. 


Proof For all m € M, we have 


d 
(Ja, H}(m) = Xj,(H)(m) = Ax(A)(m) = Te H o Wexpta(m) = 0. 
0 


Hence, the assertion follows from Proposition 9.1.10/1. 


Corollary 10.1.10 Jf the action is free, for every basis {e;} in g, the functions Je, 
constitute a linearly independent system of constants of motion. 


Next, we study the transformation properties of momentum mappings with re- 
spect to the action of G on M and the coadjoint action of G on g*. For that purpose, 
let (M, w, YW, J) be a left Hamiltonian G-manifold and let A € g and g € G. Using 
Proposition 8.2.9 and point | of Proposition 6.2.2, we obtain 


X Jyo = (Wo-1)a XJ = (Wo-1) Ax = (Ad(g~') A), = XJ, 


and hence 


d(g7!)A 


O= (Xjyow, — XJ 


Ad(g7!)A 


Jso= —d(J4 ° WV, = Jad(g-!)A)- 


Thus, for all A € g the function J4 o Wg — Jaq¢g-1y, iS constant on every connected 
component of M. Using 


Inde a(m) = (Jn), Ad(g~') A) =(Ad*(g) 0 J(m), A), 
we can rewrite this function in the form 
me (J o W.(m) — Ad*(g) 0 J(m), A). 


Thus, if M is connected, the linear functional J o W,(m) — Ad*(g) 0 J(m) on g does 
not depend on m and we obtain a mapping 


o:G> g", o(g) := J o W.(m) — Ad*(g) 0 J(m), (10.1.6) 


where m is an arbitrarily chosen point of M. Obviously, o = 0 iff J is equivariant 
with respect to the coadjoint action of G, that is, iff 


JoW, =Ad*(g)o J. (10.1.7) 
In this case, Gm C G sqm) and Proposition 6.2.4/2 implies 
PtAaA oF. Aee, (10.1.8) 


where Ae is the Killing vector field generated by A under the coadjoint action. 
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Remark 10.1.11 The mapping o defined by (10.1.6) is a measure for the momentum 
mapping to fail the equivariance property. It satisfies 


o (gh) = o(g) + Ad*(g)a(h). (10.1.9) 


A mapping o : G > g* with this property is called a coadjoint 1-cocycle with values 
in g*. A cocycle t is called a coboundary if there exists an element jz € g* such that 


t(g) = — Ad*(g)u. (10.1.10) 


The set of g*-valued 1-cocycles carries a vector space structure and the cobound- 
aries form a vector subspace. The quotient vector space H'!(G, g*) is called the first 
cohomology group of G with values in g*. 


Proposition 10.1.12 For a connected Hamiltonian G-manifold (M,w,W), the 
class [o] € H\(G, g*), defined by (10.1.6), does not depend on the momentum map- 
ping J. An equivariant momentum mapping exists iff [o] = 0. 


Proof Let J, J be momentum mappings for Y and let o and o denote the corre- 
sponding coadjoint 1-cocycles. By Remark 10.1.4/2, we have J = J + tuo for some 
ito € g*. Thus, 


o(g) =(J + 10) o Wg(m) — Ad*(g) o (J + wo)(m) = &(g) + wo — Ad*(g) 10 


and hence [o ] = [a]. Moreover, if an equivariant momentum mapping exists, then 
obviously [o] = 0. Conversely, if [o] = 0, there exists an element jz € g* such that 
o(g) = — Ad*(g)y. Then, J = J — p is an equivariant momentum mapping. 


Remark 10.1.13 If (M, @, W) is aconnected Hamiltonian G-manifold with [ao] 4 0, 
a given momentum mapping J can be made equivariant by the following modifica- 
tion of the coadjoint action on g*: 


®:Gxg*>g*, O(g,u):=Ad*(g)u +a(g). (10.1.11) 


Using (10.1.9), one can check that @ is a left action. Moreover, by construction, we 
have J oW, = @, o J, that is, J is equivariant with respect to ®. 


Proposition 10.1.14 Let (M,w,W, J) be a left Hamiltonian G-manifold. If J is 
equivariant, for all A, B € g one has 

{Ja, JB} = JB, a, (10.1.12) 
that is, J is an anti-homomorphism® of the Lie algebras g and (C°(M), {, }). 


Proof By equivariance of J, we have 


(J 0 Wexpra(m), B) = (Ad*(exptA) o J(m), B). 


Tf W is a right action, J is a homomorphism. 
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Differentiating this equation with respect to ¢ at tf = 0, we obtain 


d 
dij (J 0 Yexpra(m), B) = Ax(Je)(m) = Xj, (Ja)(m) = (Ja, Ja}(m) 
0 


for the left hand side and 


d 
aj (Ad (expt) 0 Fm), B)=(J(m), 1B, Al) = Jia,ai(m) 


for the right-hand side. 


Corollary 10.1.15 Let (M, aw, W, J) be a left Hamiltonian G-manifold and let J be 
equivariant. Letm € M and J(m) = p, lett: G-m— M be the natural inclusion 
mapping and denote the coadjoint orbit through ys by G,, C g*. Then,’ 


fos s*o”s, (10.1.13) 
where w« denotes the negative Kirillov form on @,,, see Theorem 8.4.1. 


For a right action, one obtains the positive Kirillov form on @,.. 


Proof Since w and w°" are G-invariant, it is enough to prove (10.1.13) at m. On the 
one hand, using (10.1.12) and the fact that T,,(G -m) is spanned by Killing vector 
fields, we get 


Om (Ax, Be) = Om(XJ,, X Jy) = (Ja, Ja}(m) = Jpp,ay(m) = —(u, [A, BI) 
for all A, B € g. On the other hand, the equivariance property (10.1.7) yields 
J 0 Wn(g) = J 0 Wy(m) = Ad*(g) 0 J(m) = Ad*(g)u, 
that is, 
Jn 0 Wy, (A) = ad* (A), 
where ad*(A)yu is the value of the Killing vector field a at uu. Now, we have 
(J*a%) (Ax, Bs) = (07) (Tj Ars EB) 
= (0%), (42 Bo) 


= —(u,[A, B]) (10.1.14) 


forall A, Beg. 


Definition 10.1.16 A Hamiltonian G-manifold (g-manifold) is said to be strongly 
Hamiltonian if there exists a momentum mapping satisfying (10.1.12). 


By Proposition 10.1.14, every Hamiltonian G-manifold which admits an equiv- 
ariant momentum mapping is strongly Hamiltonian. 


TSince G,, is an initial submanifold of g*, J restricts toa smooth mapping J : G-m — G@,, denoted 
by the same symbol. 
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For a connected Hamiltonian G-manifold (M, w, W, J), we can derive a criterion 
for J to satisfy (10.1.12) in terms of the mapping o : G > g* defined by (10.1.6). 
For that purpose, we determine the tangent mapping o/:Te-G = g > Tog* = g*. By 
the calculation in the proof of Proposition 10.1.14, we obtain 


d 
(0}(A), B) = a (o(exptA), B)={Ja, Jg}(m) — Jp,aj(m). (101.15) 
0 


This means that J satisfies (10.1.12) iff of vanishes identically, that is, iff J is 
infinitesimally equivariant. 


Remark 10.1.17 Choose a point m € M and define the following antisymmetric 
bilinear form » on g: 
D(A, B):={Ja, Ja}(m). (10.1.16) 


Using (A, B) = @m(Ax, Bs), Proposition 4.1.6 and point 2 of Proposition 8.2.6, 
one can check that 


Z({A, B],C) + Z([B,C], A) + Z((C, A], B)=0 — (10.1.17) 


for arbitrary A, B,C € g (Exercise 10.1.1). An antisymmetric bilinear form on g 
with this property is called a coadjoint 2-cocycle on g. By the Jacobi identity, every 
jt € g* defines a 2-cocycle du by 


5(A, B) := w([A, B]). (10.1.18) 


A 2-cocycle of this form is called a 2-coboundary.® The 2-cocycles form a vector 
space and the 2-coboundaries form a vector subspace. The quotient vector space is 
called the second cohomology group of the Lie algebra g and is denoted by #7 (g). 
One can show that another choice of the point m in (10.1.16) yields an equivalent 
cocycle (Exercise 10.1.1). From (10.1.15) we read off 


(o/(A), B) = E(A, B) + 8(J(m))(A, B), (10.1.19) 
that is, [2] = [o/] as elements of #77 (g). 


The above discussion yields (Exercise 10.1.1) 


Proposition 10.1.18 For a connected Hamiltonian G-manifold (M,w,W), the 
class [XZ] € #/7(g), defined by (10.1.16), does not depend on the momentum map- 
ping J. The action W is strongly Hamiltonian iff [&’| = 0. 

Remark 10.1.19 

1. Let (M, w, W) be a symplectic G-manifold. If g is semisimple, then 


[I9,9]=9 and #7(g)=0, 


8 And 6 is called the coboundary operator. 
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see [314] and [315]. According to Corollary 10.1.8, the property [g, g] = g im- 
plies that a momentum mapping exists. The property .#7(g) = 0 ensures that 
(M, w, W) is strongly Hamiltonian. 

2. Let (M,w,W) be a Hamiltonian G-manifold. If [o] vanishes, there exists an 
equivariant momentum mapping. By Proposition 10.1.14, then (M,@,W) is 
strongly Hamiltonian, so that [X’] must vanish, too. Conversely, if [X'] = 0, 
there exists a momentum mapping J satisfying (10.1.12). Then, (10.1.15) yields 
a, = 0. Hence, by plugging h = exp(tA) into (10.1.9) and differentiating at 
t = 0, we obtain 

Oo, ° Ly = Ad*(g)o0,=0. 
We conclude that o is constant on each connected component of G. Thus, if 
G is connected, then [0] = 0. To summarize, if G is connected, there exists an 
equivariant momentum mapping iff W is strongly Hamiltonian. 


In what follows we discuss Hamiltonian G-manifolds admitting an invariant 
symplectic potential. 


Proposition 10.1.20 Let (M,w, W) be a left symplectic G-manifold. If w = dO for 
some W -invariant 1-form 0, then 

J:M = g*, (J(m), A) = Omn(As), (10.1.20) 
is an equivariant momentum mapping. 
Proof Obviously, for every m € M, J(m) is a linear functional on g. G-invariance 
of 6 implies 

O= L4,0 = Axi dd + d(A,40) 
and hence A, 1@ = —dJg for all A € g. Equivariance follows from 
Om (Ax) = (WO) (Ax) = Ow, (m) (YgxAx) = Ow,(m)((Ad(g~')A),.), 


where we have used Proposition 6.2.2/1. 


An important class of Hamiltonian G-manifolds which admit an invariant sym- 
plectic potential is constituted by the cotangent bundles of G-manifolds. Let (Q, w) 
be a left G-manifold and let z : T*Q — Q denote the canonical projection. Accord- 
ing to Example 6.1.2/5, for every g € G, the diffeomorphism yw, : Q > Q induces 
a point transformation YW, : T*Q — T*Q by 


(¥e(), X)=(E Wl 1X), EETO, XETyor@O, — (10.1.21) 
see also (8.3.9) and (8.3.10). The assignment g+> W, defines a mapping 
VW:GxTO-T* QO, 


which is a left G-action. By construction, the mappings Wg are vector bundle auto- 
morphisms covering the diffeomorphisms y,, 


1 OW, = Wor. (10.1.22) 
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In particular, the canonical projection z is equivariant. An action W on T*Q with 
the property (10.1.22) is called a lift of the action y to T*Q. 

By Proposition 8.3.6, the canonical 1-form 6 on T*Q, and hence the natural 
symplectic form w = dé, is invariant under W. Thus, (T*Q, w, W) is a symplectic 
G-manifold and the assumptions of Proposition 10.1.20 are fulfilled. This yields 


Corollary 10.1.21 Jn the case of the symplectic G-manifold (T* Q,w, WY) associ- 
ated with the G-manifold (Q, yw), the equivariant momentum mapping defined by 
(10.1.20) is given by 


I:TO>g*, (JE), A)=(E, AY (x), (10.1.23) 


where AY is the Killing vector field generated by A under the action wy on Q. 


Proof For A € g, let AY denote the Killing vector field generated by A under W. 
Since z is equivariant, Proposition 6.2.4/2 yields 


(J(), A) = ¢(AY) = (6. 4/(AY ©) = (6. AK (r@))). 
Example 10.1.22 (Momentum) Let Q = R? and let G = R° act by translations: 


Wa(X) :=x+a. 


Under the identification TQ = R? x R?, the tangent mapping of ya is given by 
wi(x, y) = (x +a, y). Hence, the lift ¥ to T*Q@ = R? x R? is given by 


W(x, p) = (x +a, p). 


Using Example 5.3.15, we compute the Killing vector field generated by b € g= 
R?: 


v d 
by (x) =— (x+tb)=b. 
dt fo 


Consequently, for the momentum mapping (10.1.23) we obtain 
(J ((x, p)), b) = p-b, 
that is, 
J((x, p)) =p. 
Thus, J coincides with momentum. This explains the origin of the name momentum 


mapping. 


Example 10.1.23 (Angular momentum) Let Q = IR? and let G = SO(3) act by ro- 
tations: 


WeX = gx. 
Since W leaves invariant the Euclidean scalar product, the lift W is given by 


W(x, p) = (gx, gp). 
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By Example 6.2.5/1, the Killing vector field generated by A € g = 50(3) is 
Av (x) = Ax. 
Hence, for the momentum mapping (10.1.23) we obtain 
(J (x, p), A) =p- Ax. 
Using 
p:Ax=(x~xp)-A, (10.1.24) 


where x denotes the vector product and A € R? denotes the vector corresponding 
to A via the isomorphism (5.2.6), we obtain 


I(x, p) =X x p, 
that is, J coincides with angular momentum. 
Example 10.1.24 (Translations on a Lie group) Consider Q = G with G acting by 
left translation: 
w:GxG-oG, w(g,a):= ga, 
that is, Wy = Ly. The action induced on T*G is given by 
VW:GxT*GoTG, Wy (E) = (Ly-1)7(€), 
and the associated equivariant momentum mapping (10.1.23) is given by 
ETE 9", (JG), A=. (AY) ee). 


Here, Au denotes the Killing vector field of A with respect to the action by left 
translation. According to Example 6.2.5/2, AL coincides with the right-invariant 
vector field generated by A. Hence, AG (a) = (Rg); (A) and we obtain 


J(€) = (Ra@)7 6). (10.1.25) 


In the trivialization x:G x g* — T*G induced by left translation, cf. (8.3.6), W is 
given by the action 


L:=x | oWox:G x (Gx g*) > (Gx g"*), L(a, v) = (ga, v), 
(10.1.26) 


and the momentum mapping J #:=Jo x has the form 
J“ (a, v) = Ad*(a)v. (10.1.27) 


We encourage the reader to check that for G = R, this example boils down to 
Example 10.1.22. 

Analogously, one deals with the right action induced by right translation on G. 
Here, the momentum mapping is given by 


J(E) = Lae) ). (10.1.28) 
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The corresponding left action is induced by right translation with the inverse group 


element, 
wiGxG>G, w(g,a) = ag! =R,-1(a). 
It has the momentum mapping 
J(§) = —(Lxe))" €). 
In the trivialization x, this action is given by 
B&:G x (G x g*) > (G x 9"), %,(a, v) = (ag—', Ad*(g)v), 
and the momentum mapping (10.1.29) takes the form 


JA (a, v)=—Dv. 


(10.1.29) 


(10.1.30) 


(10.1.31) 


Example 10.1.25 (Inner automorphisms of a Lie group) Consider Q = G with G 


acting by inner automorphisms: 
WiGxG>G,  (g,a):=gag", 

that is, Wg = C,. The Killing vector field generated by A € g is 

AS (g) =R/,(A) — L,,(A) 
and the induced action on T*G reads 

W:GxTG>TG,  %(E)=(C,-1) TE). 
For the equivariant momentum mapping (10.1.23), we obtain 
(J(), A) = (Raw) (&) — Cae) (8), 4), & ETEG, 

that is, 

J) =pb—Ad*(r())u, 
with 4 = (Rr@)'" (&). 


(10:1:52) 


Example 10.1.26 (Coadjoint orbits) Let g be a Lie algebra and let @ C g* be a 
coadjoint orbit. According to Theorem 8.4.1, @ endowed with the Kirillov form 


w° (Ax, Bu)() = (u, [A, B]), ME 0,” 
is a symplectic G-manifold. Every A € g defines a linear function 
Ja:g* >R, Ja() = —(, A). 
For B € g, we find 


d 
(Bo)n(Ja) =a (Ad*(exp(tB)) u, A) = —(w, [A, B]) = —w0 (As, Bs). 


0 
Hence, Ay.1w? = —dJ4. This means that the mapping 
J:6—> 9", J(u) =p, 


is a momentum mapping. It is obviously equivariant. 


(10.1.33) 
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Exercises 


10.1.1 


10.1.2 


10.1.3 


10.1.4 


10.1.5 
10.1.6 


10.1.7 


Show that the cocycles on g defined in (10.1.16) by the help of differ- 
ent points differ by a coboundary. Prove Formula (10.1.17) and Proposi- 
tion 10.1.18. 

Let (V,@) be a symplectic vector space and let G C Sp(V, w) be a closed 
subgroup. Show that 


IV gt, (10), A} = —Fo(Av, »), (10.1.34) 


is an equivariant momentum mapping for the action of G on (V, w). (This 
applies in particular to the isotropy representation at an arbitrary point of a 
symplectic G-manifold.) 

Let (M, ow, W) be a Hamiltonian G-manifold with equivariant momentum 
mapping J. Let @ C g* be a coadjoint orbit, endowed with the Kirillov 
form w?. Consider the direct product of G-manifolds M x @ and let pry 
and pry denote the natural projections to the factors. Show that 


© =priot+ prio” (10.1.35) 
is a G-invariant symplectic form on M x @ and that 
K:Mx@—> 9", K(m, pt) := J(m) — p, (10.1.36) 


is an equivariant momentum mapping. (This momentum mapping is used for 
the so-called shifting trick, which will be explained in Remark 10.3.9.) 

Let G be a Lie group and let H C G be a closed subgroup. Consider the 
action of H on G by left translation. Determine the induced action on T*G 
and the corresponding equivariant momentum mapping (10.1.23). 

Prove Formula (10.1.24). 

Let M = R? and consider the action of G = R* on M by translations, 


W:GxM—-M, W(a,x):=x+a. 
Show that 
EM => ff zr, J (x): At= Ajxo — Aox1, 


is amomentum mapping. Calculate [o] and the modified G-action (10.1.11) 
on g* for which J is equivariant. 

Show that the phase space of the n-dimensional isotropic harmonic oscilla- 
tor admits a symplectic action of the unitary group U(n) which leaves the 
Hamiltonian 


New? 
2 2 
invariant. Find a momentum mapping. 
Hint. Identify T*R” with C” via (g, p) + aq + ibp with appropriately cho- 
sen constants a,beR. 
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Let (M, ao, Y, J) be a Hamiltonian G-manifold. From the Noether Theorem 10.1.9 
we know that J is constant along the integral curves of the Hamiltonian vector field 
generated by a G-invariant function H. Thus, the level sets 


Mi:=J'WCM, peg", 


are invariant under the flow of Xy. This means that the dynamics can be reduced 
to such level sets, which leads to an elimination of some of the variables. It can 
then be further reduced by factorizing with respect to the residual symmetry. In this 
section, we provide the algebraic basics needed for this reduction procedure. For that 
purpose, we investigate the algebraic structure of the tangent spaces of M induced 
from the tangent mapping J’ and from the orbit structure of the action W. The key 
for this analysis is the Witt-Artin decomposition induced by the kernel of J’. In 
what follows, we assume J to be equivariant. 

Let m € M and let w = J(m). Let G, be the stabilizer of 4 under the coad- 
joint action, let G,, be the stabilizer of m under W and let g,, and g» denote the 
corresponding Lie algebras. Since J is equivariant, we have 


Gin C Gu, Gm C Gy- 
For the convenience of the reader, we recall the following. 


(a) The orbits of G and G,, through m are denoted by G- m and G,, - m, respec- 
tively. 

(b) G,, acts on T,,M via the isotropy representation, cf. Proposition 6.1.5/4. 

(c) The w,,-orthogonal complement of a subspace V C T,,M is given by 


Vom = {Xm E€TmM : Om(Xm, Ym) =0 for all Yn € V}. 
(d) The annihilator of a vector subspace h C g in g” is given by 
b° = {uw € 9": (uz, A) =0 for all Ac b}. 
The following lemma characterizes the kernel and the image of J;,. 


Lemma 10.2.1 Let (M,w,W, J) be a Hamiltonian G-manifold with equivariant 
momentum mapping and letm € M and 4 = J(m). Then, we have 


ker J), =Tn(G +m)", (10.2.1) 
imJ), =9,, (10.2.2) 
(ker J.) ker, = TG a). (10.2.3) 


Proof For Ym € TmM and A € g, we calculate 

w(Ax(m), i) = (XJ, (m), Yn) = —dJ4(¥in) = —¥n (Ja) = (Ji, (Ym), A). 
Since the Killing vector fields of W span T,,(G - m), this proves (10.2.1). Next, for 
the mapping (J/,)': g > T*,M dual to J/,, we find 


m? 


((J/,)"(A), Yn) = (Jf, Yin), A) = 4 Yn) = — (Ax tm; Yn) 
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and thus (J/,)"(A) = —Ax1@m. Due to Proposition 6.2.2/3, this implies 
T 
ker(J;,) = {A €g: Axiom =0}= {A € g: A,(m) = o} = Gm- 


Since the image of a linear mapping coincides with the annihilator of the kernel 
of the dual mapping, we obtain im J/, = (ker(J/,))° = oe which proves (10.2.2). 
Finally, according to (10.2.1), the elements of (ker J/,)®" can be written in the form 
Wi (A) with A € g. By Proposition 6.2.4/2 and (6.2.3), the equivariance property 
(10.1.8) implies 


Ji oW! (A) =ad* (A). 


Thus, W/,(A) € ker J, iff ad*(A)w = 0. By Proposition 6.2.2/3, this holds iff A € 
G- Since the Killing vector fields of g,, span the tangent spaces of the orbit G,, -m, 
we obtain (10.2.3). 


Corollary 10.2.2 Let (M,w, YW, J) be a Hamiltonian G-manifold. 


1. The rank of J}, is equal to the dimension of the orbit through m. 
2. The momentum mapping J is a submersion at m iff Gj» is a discrete group. If V 
is proper, this is equivalent to Gm being a finite group. 


Proof 1. Using (10.2.1) and point 2 of Proposition 7.2.1, we obtain 


dim(T,,M) = dim(ker J;,) + dim(im J;,) 
= dim(T,(G -m)®”) + rank Jj, 
= dim(T,, M) — dim(Tin (G.- m)) + rank J/, (10.2.4) 
and hence dim(T,,(G -m)) = rank J},. 

2. The momentum mapping J is a submersion at m iff its rank is maximal, that 
is, iff rank J’, = dimg*. By point | and dimT,,(G - m) = dimg — dimgy, this is 
true iff dim g,, = 0, that is, iff G,, is discrete. If W is proper, then G,, is compact 
and, therefore, finite. 


Remark 10.2.3 


1. Since Gy», is closed, one has dimg,, = 0 for all m € M iff W is locally free, 
which means that for every m € M there exists an open neighbourhood U of the 
identity e € G, such that 


UNGm = {e}. 


Thus, if W is locally free, point 2 of Corollary 10.2.2 yields that the momentum 
mapping is a submersion and hence every value of J is regular. This is of course 
in particular true for a free G-action. 

2. In complete analogy, the action of G,, on M,,, viewed as an action of a topolog- 
ical group on a topological space, is locally free iff dim g,, = 0 for all m ¢ M,,, 
that is, iff u € g* is regular. 
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Now, we will discuss the Witt-Artin decomposition of the symplectic vector 
space T,,M, m € M, with respect to the subspace W = ker J;,. Choose subspaces 
E and F such that 


W=E@WNW”™, we =FeWnNWwW. 


By Theorem 7.3.3, E and F are symplectic and we have the following direct sum 
decomposition of T,, M@ into symplectic vector subspaces: 


TnM=E@FO(ES@F)?. (10.2.5) 


Note that (E @ F)® contains W 1 W®" as a Lagrangian subspace. According to 
Lemma 10.2.1, 


W=Tn(G-m)",  W°"™=T (Gem), WOW" =Tn(G,-m). 


For what follows, we assume that G acts properly. Then, the stabilizer G,, is 
compact and there exists a G,,-invariant scalar product (-,-) in T;,M, cf. Propo- 
sition 5.5.6. Let us choose E and F as orthogonal complements of the subspace 
WO W® with respect to this scalar product. 


Lemma 10.2.4 The symplectic subspaces E, F and (E ® F)® are Gm-invariant. 


Proof By Remark 6.2.10/1, W°" = T»(G-m) is Gm-invariant. Since @m is Gm- 
invariant, the w,,-orthogonal complement W = T,,(G -m)®” is invariant, too. This 
proves the invariance of WM W®”. Now, the assertion follows from the Gy,- 
invariance of the scalar product and of @m. 


Since Gy», is compact, there exists an Ad(G,,,)-invariant scalar product in g and 
we obtain Ad(G,,,)-invariant orthogonal vector space decompositions 


Gu = Im OM, G=Gn BMG q. (10.2.6) 
There correspond Ad*(G,,,)-invariant decompositions of the dual vector spaces, 
9. =GnOem*, g*=g,Om* oq". (10:27) 


Here, g;,, m* and q* are identified with the annihilators in g* of, respectively, m@ q, 
Gm ® q and g,, ® m. Let us rewrite the Witt-Artin decomposition (10.2.5) using the 
subspaces m and q. First, it is clear that the mapping W/, : g > TM induces vector 
space isomorphisms 


WOW =Tm(Gy-m) =m, we =Tn(G-m)=m@q. (10.2.8) 
Second, if we choose the G,,-invariant scalar product on T,, M so that 
(Yn (A), Yn (BY) = (A, B) 
for all A, Be m@ q, then ¥/, induces an isomorphism 


F>q. 


10.2 The Witt-Artin Decomposition 507 


Third, by choosing a Lagrangian complement L of W1 W®” in (E @ F)®, from 
Proposition 7.2.9 we obtain a symplectomorphism 


(E@ FY? =Wwnw’ @(Wnwen)* 
and hence, by (10.2.8), a symplectomorphism 
jm (E® F)? > mem". (10.2.9) 


The latter induces a linear embedding 4, : m* — T,,M with image L. Then, the 
inverse of jm is given by the mapping 


(Yin) im BAL: m@m* —> Ty, M. (10.2.10) 


Finally, we introduce the standard notation V,, = E. To summarize, from the Witt- 
Artin decomposition (10.2.5) we obtain the @,,-orthogonal decomposition 


TnM =q® (m@ m*) @ Vin, (10.2.11) 


given by the vector space isomorphism (Y%/,) tq ® (Wi tm @® AL Blm, where ty : 
Vin > TmM is the natural inclusion mapping. This decomposition is usually re- 
ferred to as the G,,-invariant Witt-Artin decomposition of the tangent space.” 


Proposition 10.2.5 The Witt-Artin decomposition (10.2.11) induces the following 
decomposition of the symplectic form w atm € M,,: 

om = (Ita) + jeeoMO™ + wher, (10.2.12) 
Here, w is the (negative) Kirillov form on Cu, w@™™ denotes the canonical 


symplectic form on m ® m* given by (7.1.5) and w'™ is the restriction of @m to the 
symplectic subspace Vin. 


Proof Since the decomposition (10.2.11) is @ ,-orthogonal, for A; € q, By € m, 
o; © m* and v; € V,i = 1, 2, we find 
Om(Atx + Bix + AL(01) + 01, Are + Box + 1 (02) + v2) 
= Wm (Ala, Ar) + Om (Bix + AL (01), Box + AL(G2)) + Om (U1, v2). 
By Corollary 10.1.15, 
Om (Aix, Ar) = (J*@%), (Alu, Are): 
Since jm is a symplectomorphism and since its inverse is given by (10.2.10), 
Om (Bix +AL(01), Box + Ax (02)) = (02, Bi) — (01, B) 
= ((Bi, 01), (Bz, 02)) 
= j,o™™ (By +1 (01), Box + Ax (02)). 


°Note that this decomposition need not be orthogonal with respect to the G,,-invariant scalar prod- 
uct chosen above. If one wants to have an orthogonal Witt-Artin decomposition, one has to redefine 
the original scalar product by choosing a scalar product on each component and taking the orthog- 
onal direct sum. 
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Remark 10.2.6 


1. Let mo € M. The symplectic vector space Ving is called a linear symplectic slice 
for W at mo. Since q is invariant, along the orbit G - mo, the symplectic slices 
can be chosen as Vw, (mo) = w Vig Then, the union over all m € G- mo of 
the subspaces V,, forms a vertical vector subbundle of (TM) }G.mno, called the 
symplectic normal bundle over G - mo. It is isomorphic to the vector bundle 
G XG,, Vino associated with the principal G,,,.-bundle G > G/Ging. 

2. The Witt-Artin decomposition will be used for the proof of the Symplectic Tubu- 
lar Neighbourhood Theorem in Sect. 10.4. In particular, we will use that, accord- 
ing to (10.2.11), the injective linear mapping 


A:=AL ®lm: m* © Vn > TnM 
sends m* @ V,, onto a vector space complement of T,,(G -m) in T,,M. This 
implies 

TM =Tn(G-m) Om OV,,, (10.2.13) 


which may be interpreted as the infinitesimal version of the Symplectic Tubular 
Neighbourhood Theorem. According to Proposition 10.2.5, with respect to this 
decomposition, the symplectic form @,, is given by 


Om(Atx + A(01, V1), Azx + A(O2, v2) 
= (02, A1) — (01, A2) — (wu, [A1, A2]) + wy (v1.02), (1.2.14) 


where A; € g, oj € m* and v; € Vin. 

3. If yw is regular, Corollary 10.2.2 implies that g,, = 0 and hence g,, = m. More- 
over, by the Level Set Theorem 1.8.3, M,, is an embedded submanifold of M and 
ker J}, = TmM,,. Hence, Lemma 10.2.1 yields 


Tn(G-m)°" =TnM,, (10.2.15) 
Tn(G-m)OTmMy = Tn(Gy-m), (10.2.16) 

and thus 
Tn(G +m) =(TmM,). (10.2.17) 


These facts will be used in Sect. 10.3. 
4. If 4 =0, then G,, = G and hence q = 0. If, in addition, 0 is a regular value, then 
Gm = 0 and thus m= g,, = g. Then, (10.2.17) yields 


(Tin Mo ee Ee (G-m) CTmMo, 


that is, Tj, Mo is a coisotropic subspace and the symplectic slice V,, is isomor- 
phic to the symplectic vector space obtained by linear symplectic reduction, 
cf. (7.3.3). 


!0When working with scalar products, this would require the scalar product on Ty, (mo)M to be 
defined by ae 5 ae -), which makes sense, because (-, -) is Gj) -invariant. 
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10.3 Regular Symplectic Reduction 


Now we are prepared to turn to symplectic reduction in the context of Hamiltonian 
G-manifolds. In this section, we present the reduction theorem for the regular case. 
This classical result is due to Marsden and Weinstein [195] and to Meyer [210]. 

Let (M, o, W, J) be a left Hamiltonian G-manifold with proper action and equiv- 
ariant momentum mapping and let yz be a regular value of J. Then, by the Level Set 
Theorem 1.8.3, M, = J~!(u) is an embedded submanifold of M. Since it is G,,- 
invariant, according to Proposition 6.3.4/1, W restricts to a proper action 


WY: Gy x My > My. 
According to Remark 10.2.3/1, this action is locally free. One can show that in this 
case the orbit space M,,/G, is a symplectic orbifold, see [69]. In the sequel, we 


restrict ourselves to the special case where the G,,-action is not only locally free, 
but free. 


Theorem 10.3.1 (Regular Reduction) Let (M, w, W, J) be a Hamiltonian G-mani- 
fold with proper action and equivariant momentum mapping. Let 1 € g* be a regular 
value of J and assume that the induced G,,-action is free. 


1. The topological space M,/G,, carries a unique manifold structure such that the 
natural projection ty: M, > M,,/G, is a submersion. 
2. There exists a unique symplectic form w on M,/G, such that 


rial = jXo, (10.3.1) 


where j,: My, — M denotes the natural inclusion mapping. 


In the context of Hamiltonian systems, the symplectic manifold (M,/Gy, o") 
is referred to as the reduced phase space. 


Proof Point | is due to Corollary 6.5.1. To prove point 2, we observe that since 7, 
is a surjective submersion, every tangent vector of M,,/G,, can be written in the 
form 1X for some X ¢ TM,,. Thus, we may define w” by 


OF im) (MX MY) = Om (X,Y), (10.3.2) 


where m € M, and X,Y €T,,M,,. To prove that w" is well-defined, we must show 
that the right hand side does not depend on the choice of m, X and Y. For that 
purpose, let m ¢ M, and X,Y € T;,M, such that z,,(m) = 2, (m), mx = x 
and nw = Ty Y. Then, there exists g € G such that 


m=Wiin), — (Wh)/X—XeTn(Gu-m), (WEY VY —Y €Tn(Gy-m). 


By G-invariance of w, we have 
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By (10.2.17), the right hand side equals w,,(X, Y). Hence, the definition (10.3.2) 
makes sense, indeed. 

By construction, the pointwise defined 2-form w" fulfils (10.3.1). Since z,, is 
a surjective submersion, this implies that w” is uniquely determined and smooth. 
For the same reason, dw = 0 implies dw” = 0. It remains to show that w” is non- 
degenerate. Thus, let m € M,, and X € T,,M, such that 


Of imy(%pX.T,Y) =O forall ¥Y € Tn My. (10.3.3) 


We have to show that this implies 1,X = 0. By definition of w“, (10.3.3) implies 
@m(X, Y) =0 for all Y € Tn M,, that is, X € (TnM,)®". Now, by (10.2.16) and 
(10.2.17), 


(TM ay AVTnMy =Tn(Gy-m). 
Thus, ),X = 0 holds, indeed. 


Remark 10.3.2 The assumption that uw € g* be regular can be weakened. According 
to Proposition 1.7.6, if J is a subimmersion, M,, is still a closed submanifold. In 
this case, 4 is sometimes called weakly regular. For a discussion of reduction under 
this weaker assumption, we refer to [181]. 


Now we can discuss the reduction of a G-invariant Hamiltonian system 
(M, w, H) with equivariant momentum mapping: 


Proposition 10.3.3 Let (M,w, W, J) be a Hamiltonian G-manifold with proper ac- 
tion and equivariant momentum mapping and let H € C®(M)®. Let w € g* be a 
regular value of J and assume that the induced G,-action is free. 


1. M,, is invariant under the flow of X q and X y restricts to a vector field X = on 
M, which is j,-related to X 7, 


i, °Xy =XHo jp (10.3.4) 
2. H defines a unique smooth function H,, on M,/Gy, by 
Ay, ot, =H o jy. (10.3.5) 
The corresponding Hamiltonian vector field X y,, is 1,-related to xO 
Xu, omy, =m,0Xn, (10.3.6) 


The Hamiltonian system (M,,/G,,, @", H,,) is referred to as the reduction of the 
Hamiltonian system (M, w, H) at w. 


Proof 1. Invariance of M,, follows from the Noether Theorem 10.1.9. Then, X # is 
tangent to M,, and the rest of the assertion follows from Proposition 2.7.16. 

2. Since Hf is G-invariant, a function H,, on M,,/G,, satisfying (10.3.5) exists. 
Since z,, is surjective, H,, is uniquely determined and since z,, is a submersion, 
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H,, is smooth. It remains to prove (10.3.6). According to (6.7.5), since xo is Gy- 
invariant, it projects to a vector field X r on M,,/G,, uniquely determined by 

XG Oty ano XS. (10.3.7) 


It suffices to show that xe = Xy,,. By (10.3.5), we have dH, om), = dH 0 j,,. 
Using this, as well as (10.3.4) and (10.3.7), form € M,, and Y € T,,M,, we obtain 


(dA) sryim) (7, ¥) = GH) jm (iY) 
= —0j,(m)((XH) j.0m)> jn) 
= — (75) (XB) mY) 
= (210 (Xm Y) 
— =O 0 (Xt) sain)’ I, Y). 


It follows that dH, = —Xh aw" and hence x = XH,» indeed. 


Corollary 10.3.4 The Poisson structures defined by w and w" are compatible, that 
is, one has 


{fur hpsomp ={h hho jn, (10.3.8) 
for all functions f,h € Crs )° and fy,hy, € C~(M,,/G,,) related by (10.3.5). 


Proof Using (10.3.4) and (10.3.6), we calculate 


{fur hy}(tp(m)) = Or (my X fur Xn,) 
= (71,0!) (Xp Xi) 
= Fi) (XX) 
= 0 j,(m)(X f, Xn) 
={f,h}(Gu(m)). 
Now, let us apply the Regular Reduction Theorem 10.3.1 to the action of G on 


T*G induced by left translation, cf. Example 10.1.24. Thus, we consider Q = G 
with G acting by 


WiGxQ-> Q, W(g, a) :=Lg(a) = ga, 
and with the induced action YW on T*Q given by 
VW:GxTO>-T*Q, W, (E) = (Ly-1)'"€). 
Recall from Example 10.1.24 that the corresponding equivariant momentum map- 
ping is given by 
JG) = (Rr@)' 6), 


cf. (10.1.25). The action W is obviously free. Since w is proper, Remark 6.3.9 im- 
plies that W is proper, too. Let u € g* =T?Q and denote by a, € 92'(Q) the right 
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invariant differential form on Q defined by a,,(e) = 2. Thus, a,,(a) = (R,-1 a: 
Since 

(F(cu.(a)), A) = (a,(a), (Ra), (A)) = (us, A), 
we obtain J (a@,,(a)) = yw. This implies 


(T* OQ), = J! (u) =ay,(Q), 


that is, the level set of jz is given by the image of the 1-form ,,. Thus, (T*Q),, is 
an embedded submanifold, diffeomorphic to the group manifold G. Due to 


W (or. (a)) = (Ly-1)/" 0 (Ry-1)"(u) = (Ry-1 0 Ry-1)/F 0 Ad*(g7")u,, 
the G,,-action on (T*Q),, is given by 
Wy (ap (a)) =a, (ga). (10.3.9) 
Since (Rz(¢))’ T is fibrewise bijective, J is a submersion. Hence, we can apply the 
Regular Reduction Theorem 10.3.1, which yields a symplectic manifold structure 


on the quotient (T* Q),,/G,. The following theorem yields an explicit description 
of this quotient. 


Theorem 10.3.5 Let G be a Lie group and let (T*G,w,W, J) be the Hamilto- 
nian G-manifold of Example 10.1.24. For every wu € g*, the reduced phase space 
((I*G),,/G,,, w") is isomorphic to the coadjoint orbit of 4 endowed with the posi- 
tive Kirillov structure. 


For jz = 0, both symplectic spaces obviously degenerate to the one-point-space. 


Proof Let @ be the canonical 1-form and w = dé the canonical symplectic form on 


T*G. Denote by wr the positive Kirillov form on the coadjoint orbit @,, of jz, see 
Sect. 8.4. Consider the mapping 


(T*G) > Op, ay (a) > Ad*(a~") 
Since for any g € G,,, this mapping sends Wa, (a) = a, (ga) to 
Ad*((ga)~')u = Ad*(a7!) 0 Ad*(g7!)w = Ad*(a7') yu, 
it induces a bijection 
g:(T*G) /Gu > On. (10.3.10) 
We decompose ¢ as follows: 
(T*G) ,/Gu > G/Gy > On. 


Here, the first mapping is obviously a diffeomorphism, induced by the natural pro- 
jection of T*G. The second mapping is the diffeomorphism (6.1.8) provided by the 
Orbit Theorem 6.2.8. Thus, is a diffeomorphism. We prove 


+ 
go =o". 
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Due to (10.3.1), for that purpose it is enough to show that 
1 oto =win(rG),- (10.3.11) 


Now, the tangent vectors to (T*G),, at a,,(a) can be written as al, A(a), with A 
denoting the right invariant vector field on G, generated by A € g. Thus, for the left 
hand side we get: 


a ° own (a), A(a), a’, B(a)) = oF ((y OT, © a,)' A(a), (pom,0 a,,)' B(a)). 
For any f €C™°(G@,), 
1% d 
(pom, 0a,)' A(a))(f) = a ei 007, 0d, (exp(tA)a) 
=— f(Ad*(a~! exp(—tA))u) 
0 
=— if Ae (a! exp(—tA)a) o Ad*(a~') 2) 
= —(Ad(a")ay (Aa (au), 
where a denotes the Killing vector field of the coadjoint action generated by A. 
It follows that 
(yom, 00,) A(a) = —(Ad(a~!)A)“* (Ad*(a7!) u) 
and the left hand side of (10.3.11) takes the form 
oH ((Ad(a7') 4), (Ad(a~') BY") (Ad" (a~')) = —(u4,[A, Bi). 
Here we have used (8.4.2). For the right hand side of (10.3.11) we have 
(a, A(a), a, B(a)) = (ata) (A, B) 
= (da,)a(A, B) 
= Aa((ay, B)) — Ba((op, A)) — (ot, LA, BI)(a), 


where we have used (8.3.3). By right-invariance, (a,,, B) = (4, B). Hence, the first 
two terms vanish and we obtain 


—(a,, (A, B])\(a) = (1, [A, B)). 


Thus, we have proved (10.3.11), which implies that the mapping (10.3.10) is a sym- 
plectomorphism, indeed. 


To conclude this section, we discuss an alternative reduction prescription leading 
to the same reduced phase spaces. It is usually referred to as orbit reduction. We will 
see that the choice of a concrete value jz € g* is not important and that it is in fact 
the orbit @,, through jz which is relevant. Consider the preimage 


My, = I"(64) (10.3.12) 
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of the orbit Cn under J. Since J is equivariant, we have M,, =G.-M,,. As in 
the Regular Reduction Theorem, we assume that py is a regular value and that G,, 
acts freely on J~!(j), which is equivalent to the assumption that G acts freely on 
M,,. By the Orbit Theorem 6.2.8, @,, is an initial submanifold of g*. Since ju is 
regular, J is a submersion at every point of .%, and hence the Transversal Mapping 
Theorem 1.8.2 implies that .%,, is an initial submanifold of M. Since 4, is G- 
invariant, the action W restricts to an action of G on .@,. By Proposition 6.3.4/1, 
this action is proper. Hence, Corollary 6.5.1 endows .@,,/G with a unique smooth 
structure such that the natural projection #,, : 4, > -@,/G is a submersion. Since 
%,, is initial, the natural inclusion mapping M,, > -@,, is smooth. It is easy to see 
that it descends to a bijection 


¢:My/Gy > M,/G. (10.3.13) 


Since z, and 7, are submersions, ¢ is a diffeomorphism. 


Remark 10.3.6 


1. The G-manifold .%, has the structure of a bundle with fibre M,, associated with 
the principal G,,-bundle 2, : My, — M,/G,. Indeed, the mapping 


F:GxG,My> My, — F([(g,m)]):=Ye(m), (103.14) 


is a G-equivariant diffeomorphism (Exercise 10.3.1). 

2. Alternatively, one can use the Tubular Neighbourhood Theorem for showing that 
M,, is a submanifold of M on which G acts smoothly, freely and properly (Ex- 
ercise 10.3.2). 


Proposition 10.3.7 Let (M, w, W, J) be a left Hamiltonian G-manifold with proper 
action and equivariant momentum mapping. Let j1 € g* be a regular value and let 
@,, be the coadjoint orbit of 4. Assume that the restriction of W to the submanifold 
M,, is free. Let ty: M, — M denote the natural inclusion mapping. 


1. There exists a unique symplectic form o” on M4,,/G such that 
tO" = hot Teo. (10.3.15) 
2. The symplectic manifolds (M,,/G,,@") and (%4,,/G,@") are symplectomor- 
phic. 


Proof 1. The right hand side of (10.3.15) defines a 2-form pointwise on %,,/G: for 
A,Begandm €.%,, we calculate, denoting J(m) = yl 


(17,0) (Ax, B,) + (J*o),, (Ax, Bx) 
+ 
be 


C 
=m (X74; X Jp) + oy (J, (Andes Jy. (By)m) 


'l Alternatively, one may observe that, on Killing vector fields, (U,@)m coincides with the pull-back 
of w to the orbit G - m and apply Corollary 10.1.5. 
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= Jip,ay(m) + (v, [A, B)) 
= (v,[B, A]) +(v, [A, B]) =0. 


Thus, @” is well-defined. Since z,, is a surjective submersion, @” is smooth and 
dw =0=dw%t implies da = 0. Non-degeneracy will follow from point 2. 

2. We show that the diffeomorphism g given by (10.3.13) fulfils g*@" = ow". 
This implies in particular that @ is non-degenerate and hence symplectic. Since 7, 
is a submersion, it is enough to show that Tit og at =rihot. Within: My > By 


u 
denoting the natural inclusion mapping, we have g oz, = 7, 07, and hence 


ss zw : ~ : . ot 
TOP OY = (Ty Oin)*O" = (ly city) O+ (J oi) or. 


Using ¢, oi, = jy, and (10.3.1), for the first term of this sum we get ji@ = 7/0". 


The second term vanishes, because J 0 i,, is constant. This proves y*@" = w". 


Corollary 10.3.8 The Poisson structures of (My/Gy,@") and (4,,/G,@") are 
isomorphic. 


Remark 10.3.9 Orbit reduction can be also performed using the shifting trick, 
see [232], Theorem 6.5.2: for a given Hamiltonian G-manifold (M, w, YW, J) with 
equivariant momentum mapping J and a coadjoint orbit @, one considers the G- 
manifold M x @ with symplectic form 


O=TO+M,o0 
and with equivariant momentum mapping 
K:Mx@> 9", K(m, ) := J(m) — p, 


cf. Exercise 10.1.3. Thus, instead of considering the preimage J~'(@), one can 
consider K~!(0). This way, the symplectic reduction problem is reduced to the case 
w=0. 


Exercises 

10.3.1 Show that the mapping (10.3.14) is a G-equivariant diffeomorphism. 

10.3.2 Using the Tubular Neighbourhood Theorem, show that the subset .Z, yw of M, 
defined by (10.3.12), is a submanifold on which G acts smoothly, freely and 


properly. 


10.4 The Symplectic Tubular Neighbourhood Theorem 


The results of this section will be used for the subsequent discussion of singular 
symplectic reduction. Let (M,w,W) be a Hamiltonian G-manifold with proper 
action and equivariant momentum mapping J. Let O be an orbit of © and let 
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@ = J(O) be the corresponding coadjoint orbit. Choose a point m € O and de- 
note 4 = J(m). The starting point of our analysis is the Witt-Artin decomposi- 
tion!* (10.2.13) of T,,M defined by the choice of an appropriate linear embedding 
A:m* ® V > T,,M onto a vector space complement of T,,O. This way, m* @ V 
may be viewed as a model for the normal space to the orbit at m. According to 
the Tubular Neighbourhood Theorem 6.4.3 and Remark 6.5.8 there exists a tubular 
neighbourhood U of O in M and a G-equivariant diffeomorphism 


x:U > E=G xg, (m* @V) 


onto an open neighbourhood E of the zero section of E, viewed as a vector bun- 
dle over G/G,,. Our aim is to construct a Hamiltonian G-manifold structure on E 
and to deform x in a G-equivariant way so that it becomes symplectic and thus 
an isomorphism of symplectic G-manifolds from U onto E. In the course of this, 
we will use the following two commuting left G-actions on G x g* introduced in 
Example 10.1.24: 


(a) the action induced by left translation on G, 
£:G x (Gx 9g") > (Gx 9°), 2£,(g, Vv) = (ag, v), (10.4.1) 
with equivariant momentum mapping J od (g, v) = Ad*(g)v, 
(b) the action induced by right translation with the inverse group element, 
&:G x (Gx g*) > (Gx g*), alg, v)=(ga', Ad*(a)v) (10.4.2) 
with equivariant momentum mapping J %& (g,v)=—v. 


Since E is the quotient of a free G,,-action on G x (m* @ V), it is natural to 
construct E by regular symplectic reduction of a free Hamiltonian G,,.-manifold. 
For that purpose, we add the factor g*, as a symplectic partner for the necessar- 
ily isotropic G,,-orbits and consider the following auxiliary trivial vector bundle 
over G: 


E:=G x (gf, @m* @V). (10.4.3) 
This bundle is endowed with the G,,-action 
(a, (g.n, p,v)) > (ga7!, Ad*(a)n, Ad*(a)p, (Wa)i,v) (10.4.4) 
and with the G-action induced by left translation on the first factor, 
(h, (g,7, 2, v)) (hg,n, p,v). (10.4.5) 


By means of a chosen G,,-invariant scalar product on g, we can identify g*, @ m* 
with as, and the latter with a linear subspace of g*. Accordingly, for (g, 7, 0, v) € &, 
we can decompose 


Tie,n,0,n)E = 4B Gu BG;  V, (10.4.6) 


12Th the sequel, for convenience, we write V = Vin. 
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cf. Sect. 10.2 for the notations. Here, q = g/g,, can be viewed as the tangent space 
to the coadjoint orbit @, at ~ and gy ® rs models the tangent spaces of T*G,.. 
This suggests to take the following closed 2-form as a candidate for a symplectic 
structure on €: 


wr = pro +o oF +a. (10.4.7) 


Here, wl ‘ wo! © and oY denote, respectively, the (negative) Kirillov form on @ = 
@,,, the natural symplectic form on T*G and the symplectic form on V C T,M 
induced from w,,. The mapping p, is given by 


Pup:G> Gy, Pulg) = Ad*(g)u 


and t,, : G x gi, > T*G is the embedding defined by the left trivialization of T*G, 
cf. (8.3.6). According to (10.4.3), we write tangent vectors of € at (g, 7, 0, v) in the 
form 


(L,A,é,0,u), Aéeg, €€g,,0Em*, weV. 


A straightforward calculation using (8.3.8) and (8.4.2) yields 


Oe np») (Le Al, &1,01,u1), (Li, Ao, &,02,2)) 
= (€ +01, Az) — (& +02, A1) —(u +t p, [A1, A2]) + @m (ui, U2). 
(10.4.8) 


Recall from Exercise 10.1.2 that the linear G,,-action on V induced from the 
isotropy representation is Hamiltonian with momentum mapping 


1 
IV:V > gk, (JP (v), A) = 5@m(v, Hess (Ax)v). (10.4.9) 


Lemma 10.4.1 There exists an open neighbourhood €* of the zero section in €, 
invariant under both G and Gy, such that w™ is symplectic on €* . Moreover, 


1. (€*%, w*) endowed with the action (10.4.4) is a Hamiltonian G-manifold with 
G -equivariant and G-invariant momentum mapping 


K:€% +g, K(g.n,p,v):=J"(v) =n, 


2. (€*%, w*%) endowed with the action (10.4.5) is a Hamiltonian G-manifold with 
G-equivariant and G,,-invariant momentum mapping 


JX: €*% > gf, J*(g,n, p,v) = Ad*(g)(u+7 + p). 
Denote the restriction of the G-action (10.4.5) to €* by W*. 


Proof First, we check that w”* is invariant under both the action (10.4.4) and the 
action (10.4.5). This is a consequence of the following facts: 
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(a) py (hg) = Ad*(h) p,,(g) for all h € G and Pulga!) = py (g) forallae Gy, 

(b) the left trivialization of T*G intertwines the action of G on T*G induced from 
left translation (right translation by the inverse group element) on G with the 
actions # and &, respectively, 

(c) w° is invariant under the action of G on On, ow! © is invariant under point 

transformations and w” is invariant under the isotropy representation. 


Second, from (10.4.8) we read off that under the identification (10.4.6), ©(1,0,0,0) 
coincides with the direct sum of the Kirillov form on T,, @ = q, the canonical sym- 
plectic form on g,, ® Gh and w” . Hence, it is non-degenerate. By G-invariance, then 
w”* is non-degenerate on the zero section of € and hence on some G-invariant open 
neighbourhood €* of the zero section. Since Gm is compact, €* can be chosen to 
be G,,-invariant as well. 

1. To see that K is amomentum mapping for the G,,-action (10.4.4), we interpret 
€ as the direct product of the symplectic G,,-manifolds 

(G x go, om", pron + “io! ©) 

and (V, ow’). One can check that the first one is Hamiltonian, where the momentum 
mapping is obtained from that of the action Z on (G x g", wo! G) by restriction 
to G x g*, © m* and by composition with the projection! g* — g*,. That is, the 
momentum mapping is given by (g, 7, 0) + —n (Exercise 10.4.2). Since K is the 
sum of the latter and the momentum mapping J for the G,,-action on V, it is a 
momentum mapping for the direct product. Finally, G,,-equivariance follows from 
that of J” and G-invariance is obvious. 

2. To prove that J* is a momentum mapping for the G-action W*, we show that 


w* (AX, Z) =—Z(JX) (10.4.10) 


for all A € g and all vector fields Z on €*. Here, A denotes the Killing vector field 
generated by A under W™. For (g, 7, 0, v) € E*, we find 


(AP) (e.n.p.) = (Le (Ad(s~') A), 0, 0, 0). 
Hence, writing Z(g,n,9,v) = (L,B, &,o,u) and using (10.4.8), we compute 
np.) Ae Z) = {E +0, Ad(g”') A) — (uv ++ p,[Ad(g')A, B]). 
On the other hand, 
d 
Zeno. (Ja) = Ge Ad (gexpCB))(u + +o +1 +0), A) 
0 


=(u+n+p,[Ad(g-!)A, B])+(& +0, Ad(g7!)A). 


This proves (10.4.10) and, hence, that J* is a momentum mapping for Y*. Equiv- 
ariance is obvious and G,,-invariance follows at once from Gy, C Gy. 


13 Induced by the injection g — g. 
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Obviously, K is a submersion and the action of G,, on €* is free and proper. 
Hence, we can apply regular reduction to construct the symplectic manifold 
K~'(0)/Gn. 


Lemma 10.4.2 The neighbourhood €* can be chosen so that there exist Gm- 
invariant open neighbourhoods m* of the origin in m* and V of the origin in V 
such that the mapping 


E =G XGin (m* x V) = K~!(0)/Gmn, [(g, P, v)| ad [(g. J (v), p, v)], 


is a diffeomorphism. 


Proof For given €*, there exist w* and V such that the mapping 
Gxi*xV> €%, (g,p,v)+>(g,J”(v),p,v), (10.4.1) 


is defined. By construction, this mapping is bijective onto an open subset of K~!(0). 
Differentiability (in both directions) follows from the fact that K —1(0) is an embed- 
ded submanifold. Since K~!(0) carries the relative topology induced from €*, the 
latter can be shrunk so that the mapping (10.4.11) becomes a diffeomorphism onto 
K~!(0). Since JY is Gy, -equivariant, so is this diffeomorphism. Hence, it descends 
to a bijection of the quotients. Since the natural projections involved are submer- 
sions, this bijection is in fact a diffeomorphism. 


Via the diffeomorphism of Lemma 10.4.2, the symplectic form of K —10)/Gin 
inherited from w™* induces a symplectic form @ on E. This form is uniquely deter- 
mined by the relation 

Tg @ = 19w, 
where i9 : K~!(0) > €* is the natural inclusion mapping and mo: K~ 10) > 
E is the submersion obtained by composing the natural projection K~!(0) > 
K~!(0)/Gm with the inverse of the diffeomorphism of Lemma 10.4.2. To derive 
an explicit formula for ®, we denote the natural projection G x ti* x V > E by x 


and write tangent vectors of E at [(g, e, v)] in the form z Mas o,u), where A € g, 
o €m* and u € V. Then, from (10.4.8) we read off 


O(g,p,v)(H' (LZ A1, 01, 41), 0 (Li Az, 02, u2)) 
= (01+ (J), i), Aa) — (02 + (J"), 2), Al) 
—{ut+tpt+ v),[A1, A2]) + @m (U1, U2). 4. 
( JY (v), (At, Aol) + @m (ur, 2) (10.4.12) 


Moreover, since K is G-invariant, so is K~!(0). Hence, W% restricts to an action 
W° of G on K~ 1(0). Since ¥° commutes with the G,,-action and since zo is a 
submersion, Y° descends to an action W of G on E, explicitly given by 


Wr([(g, p, v)]) = [ (hg, Pp, v)]. 
By construction, for every g € G, we have 


WU, 0o7 = 10 W,, yo) =e 0 lo. (10.4.13) 
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Finally, since J” is Gm -invariant, and again since zg is a submersion, J* induces 
a smooth mapping J : E > g* by 
Jom =J* ow. (10.4.14) 
Explicitly, one finds 
J ([(g, p. v)]) = Ad*(g)(u+ p+ J" (v)). (10.4.15) 


Lemma 10.4.3 (E ,O, W) is a Hamiltonian G-manifold with equivariant momen- 
tum mapping J. 


Proof In the proof, we use the relations (10.4.12), (10.4.13) and (10.4.14) without 
further notice. First, we show that the action W is symplectic: for g € G, we find 
m9 (Wa) = Wp (119.0) = we ((5@) = 9 (W*@) = Qo = 190. 


Since zo is a submersion, it follows that WO = @. Next, we show that J is a mo- 


mentum mapping for : for A € -g, denote aie Killing vector fields generated by A 
under the actions ¥*, W° and W by, respectively, Ax, A° and A,. According to 
Proposition 6.2.4/2, we have 


A,om=m0 A’, AX oig =H 0 A®. 
Using this, we calculate 
71} (Ay 1@) = Adi m§@ = AQ sihw™ = 8 (AX 10”). 
Since J* is a momentum mapping for ¥%, the right hand side equals —1jdJx° and 


hence —ngdJ, ‘A. Since 7g is a surjective submersion, we conclude Ax o = —dJ. A: 
Finally, we show that J is equivariant: for g € G we find 


JoW,om=JomoW, =J* ooW, =J* oW% om 
Since J* is equivariant, the right hand side equals 


Ad*(g) 0 J* 019 = Ad*(g) 0 J omg. 


Since sg is surjective, this yields equivariance. 


Theorem 10.4.4 (Symplectic Tubular Neighbourhood Theorem) Let (M,a,W) 

be a symplectic G-manifold with proper action and equivariant momentum map- 

ping J. 

1. For every m € M, there exists a G-invariant open neighbourhood U of G-m 
in M and a G-equivariant symplectomorphism x :U — E such that x(m) = 


[(1, 0, 0)]. 
2. Jo x isamomentum mapping for W. If G is connected, then J o x = Jiu. 


As a result, (E ,O, W, J ) yields a local normal form for the Hamiltonian G- 
manifold (M, w, W, J) near the orbit G - m. It will be referred to as a symplectic 
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tubular neighbourhood of G - m. Accordingly, x will be referred to as a tube sym- 
plectomorphism. The representation (10.4.15) of the momentum mapping J so ob- 
tained is called the Marle-Guillemin-Sternberg normal form of J, see [190-192] 
and [117]. 


Proof 1. By the Witt-Artin decomposition (10.2.13), there exists a G,,-equivariant 
injective linear mapping 4: m* @ V —> T,,M onto a vector space complement of 
Tm(G-m) such that @,, is given by (10.2.14). Given A, Theorem 6.4.3 and Remark 
6.5.8 yield a G-equivariant diffeomorphism 


x:U > ECG xg, (m* @V) (10.4.16) 
satisfying x (m) = [(1, 0, 0)] and (6.5.3), which in the present situation reads 
Cale er (1'(A, 0, u)) = (Ax) + AG, u), (10.4.17) 


where  : G x (in* x V) > E is the natural projection. Using (10.4.17), (10.2.14) 
and (10.4.12), as well as Gg); = 0, we calculate 
(Cage) ee (x'(A1, 01, ut), m'(A2, 02, u2)) 
Om ((Ate)m + A(61, U1), (A2x)m + (62, 42) 
= (01, Az) — (02, A1) — (4, [A1, Bi]) + 0" (uy, v2) 


= 11,0,0)) (7 (A1, 01, 41), 1’ (A2, 02, U2), 


where jz = J(m). By invariance, then w and x*@ coincide on G - m and the Equiv- 
ariant Darboux Theorem 8.6.3 yields that U can be shrunk and x can be modified 
so that it becomes symplectic. Finally, we adjust U, m* and V in sucha way that x 
becomes surjective and hence a symplectomorphism. 

2. For A € g, let A, and A, denote the Killing vector fields generated by A under 
the actions W and W, respectively. Using Proposition 6.2.4/2, we calculate 


Agi@ = Ag i(X*0) = X* ((XeAw) 40) = X*(Ay sO) = —x*d 4 = —d(J 0 x) a. 


Since J o x(m) = J ({(1, 0, 0)]) = 4 = J(m) and since the difference of two mo- 
mentum mappings is locally constant, we conclude that J o x = J provided U is 
connected, which can always be achieved if G is connected. 


Exercises 


10.4.1 Write down the proof of Lemma 10.4.1 for the case u = 0. 

10.4.2 Complete the proof of Lemma 10.4.1/1 by showing that the symplectic 
action of G,, on (G x g*, x m*, pion + fo?) is Hamiltonian with 

momentum mapping induced from that of the symplectic action # on 

(G x g*, wo" G) in the way described there. 
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10.5 Singular Symplectic Reduction 


In this section, we generalize the Regular Reduction Theorem 10.3.1 to the case of 
a non-free group action. Here, as we know, several orbit types labelled by conju- 
gacy classes of stabilizers can occur and, consequently, the quotient M,,/G,, is a 
union of strata. In what follows, we will describe the symplectic structure of these 
strata using the method of point reduction. The main tool is the Symplectic Tubular 
Neighbourhood Theorem 10.4.4. We will close this section with a brief comment on 
how the strata fit together to form a stratified symplectic space. Pioneering work in 
this field was done by Arms, Cushman and Gotay, see [10], and by Sjamaar and Ler- 
man, see [275], where the case of the zero-level set was worked out. For a detailed 
discussion of all aspects of the general case, which would go beyond the scope of 
this book, we refer to the book of Ortega and Ratiu, see [232]. There, the reader can 
also find an exhaustive list of references. We also refer to Huebschmann, see [142], 
who has worked out singular reduction for the case of Kahler manifolds. 

Let (M,o,W, J) be a Hamiltonian G-manifold with proper action and equiv- 
ariant momentum mapping. Below, we use the notation introduced in Sect. 6.1. In 
particular, let M=M /G and leta:M—> M denote the natural projection. For a 
given coadjoint orbit G C g*, denote 


Me i= m(J~'(6)). 
For a given orbit type [H], the connected components of the subset 
Mtn OMe 
of orbits of orbit type [H] and momentum type @ will be referred to as the orbit- 
momentum type strata of M or the reduced phase spaces. They will be denoted 


by M,, where the set of labels t is denoted by T. For given t € T and yp being an 
element of the underlying coadjoint orbit, we define 


Mz y= a '(M;z) AM, 


where M,, = J(u) as before. Let t7,, : Mr, > M and m7, : Mr, > M, denote 
the natural inclusion mapping and the natural projection induced by 77, respectively. 
Note that, by equivariance of J, two points in M,,,, are conjugate under G iff they 
are conjugate under G,,. Consequently, zr,,,, means factorization with respect to G,,. 
We also note that G,, need not be connected and, thus, M;,,, need not be connected 
as well. It can be obtained via the G,,-action from one of its connected components 
though. 


Lemma 10.5.1 Let (M,w,W, J) be a Hamiltonian G-manifold with proper ac- 
tion and equivariant momentum mapping, let m € M and denote 4 = J(m). Let 
(E,0,W, J) bea symplectic tubular neighbourhood of the orbit G -m at m. There 
exists a G,,-invariant open neighbourhood E of the orbit G,, - [(1, 0, 0)] such that 


JW NE={[(g,0,v)]€£:g€G,, J”(v) =0}. (10.5.1) 
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Proof By (10.4.15), a point [(g, p, v)] € E belongs to J~! (w) iff 
pt+J¥(v) =Ad*(g7')u— pw. (10.5.2) 


We choose a G,,-invariant scalar product in g* and use this to embed m*, g*, and 
ri into g* as the orthogonal complements of the annihilators m°, 0, and a. Then, 
g;, = m* ® g,, and hence the left hand side of (10.5.2) belongs to the subspace g/,. 
Thus, in order to prove that E exists, it suffices to show that there exists an open 
neighbourhood W of G,, in G, invariant under left translation by G, and under 
right translation by G,», such that for any g € W, the condition Ad*(g7!)u—p € e,, 
implies g € G,,. Indeed, then 


E:=W xg,, x(t* x V) 


is G,,-invariant and satisfies (10.5.1). To prove the existence of W, let pr?: g* > gp, 
denote orthogonal projection with respect to the G,,-invariant scalar product and 
consider the mapping 


f:O.79., f@):=pM—p). 
Using Formula (6.2.3) and ad* (A) € ie for A € g we calculate 


fi, (Anu) = pt°(Asy) = pr°(ad*(A)t) = ad*(A)u = Aap, 
where A, denotes the Killing vector field of the coadjoint action. Hence, Iie is in- 
jective. For dimensional reasons, it is bijective then and the Inverse Mapping Theo- 
rem 1.5.7 yields an open neighbourhood W of jz in @,, where f is injective, that is, 


where f (7) =0 implies n = p for all n € W. Since pr° is G,,-equivariant, W can 
be chosen to be G,,,-invariant. Then, 
W :={aeG:Ad*(a')we W} 


(the preimage of W under the coadjoint orbit mapping of jz) has the desired proper- 
ties. 


Remark 10.5.2 If we extend the twisted product to the case of topological spaces, 
we can rewrite os 5.1) in the form 


J(u) NE = Gy XG, ({0} x (VA (JY)'))). 


Here, (J”)~!(0) is not necessarily a manifold. We note that restriction to E is im- 
portant, because the intersection of the coadjoint orbit of jz with the affine subspace 
p+ (m* ® g*,) may contain points different from jz, so that (10.5.2) may have so- 
lutions with g lying outside the neighbourhood W of G,, constructed in the above 
proof. 


Lemma 10.5.3 Under the assumptions of Lemma 10.5.1 we have 
(J(u) NE), =No,(H) xx ({0} x V"), (10.5.3) 
(F171) 9 BY = Gu xu ({0} x V"), (10.5.4) 


with H = Gy, and V™ denoting the H -invariant elements of V. 
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Proof First, we show that 


((J”) '@)" =v". (10.5.5) 
For that purpose, it suffices to show that J’ (v) = 0 for all elements v € V”. Indeed, 
for any element A of the Lie algebra of H, we have 


1 
(J"(v), A)= 500, Hess,,(Ax)v) = w(v, (Wexpia) nv) = 0, 


ld 
2 dt 
for any v € V". To prove (10.5.3), let [(g, 0, v)] € J~!(u) NE. If Grvg.o,vy) = A, 
then [(ag, 0, v)] = [(g, 0, v)] for all a € H. This means that there exists an element 
b € H such that 


(ag, 0, v) = (gb~', 0, (%),,,v). 


We read off that b = g~'a7'g, that is, g~'Hg C H and hence g € NG,, (7). More- 
over, by compactness of H, then g~!'Hg = H, so that b ranges through all of H 
and hence v is H-invariant. Conversely, if g € NG, (A) and ve Vi, then (10.5.5) 
implies J’ (v) = 0 and hence [(g,0, v)] € J~! 1 E. The inclusion H C Gi(g,0,v)] 
is obvious. If, conversely, a € Gycg.o,v)}, then ag = gb—! for some b € H. Since 
8 €Na, (A), thenae H. 

Equation (10.5.4) follows by a similar argument: if Gi(g,o,v)) = aH a~' for some 
a €G, then for every h € H there exists b € H such that 


(aha~'g, 0,v) = (gb"', 0, (%)), (v)). 


Then, b = g~!ah~'a7'g, so that g~'aHa~'g C H and compactness of H im- 
plies that b ranges through all of H. Hence, uv € Ve, Conversely, by (10.5.5), 
every peut [(g,0, v)] of the right hand side belongs to J— '(w) NE. Moreover, 
gHe'c Gi(g,0,v)|- If, on the other hand, a € Giig,0,v)}, then ag = gb~ ' for some 
b €H and hence a € gHg™! 


Let (E,0,W, J) bea symplectic tubular neighbourhood of G -m with tube sym- 
plectomorphism x : U — E. Denoting U= x (EB), from (10.5.3) and (10.5.4), 
we read off 


My OM, OU =x~'(No, (A) Xx ({0} x V")) = NG, (E)/H x V", 
(10.5.6) 
May OM, OU =x-"(Gy xu ((0} x V4)) = Gy/H x V4. (10.5.7) 
By Lemma 7.4.6, V” C V is a symplectic vector space with symplectic form given 
by the restriction of w” to V". By Formula (10.5.7), V” is a model space for the 


strata of the G-action on E of isotropy type H. This is the key observation for the 
proof of the following theorem. 


Theorem 10.5.4 (Singular Reduction Theorem) Let (M, w, WY, J) be a Hamiltonian 
G-manifold with proper action and equivariant momentum mapping. Let t € T and 
let C be the corresponding coadjoint orbit. For every 1 € CO, 
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1. the subset M;,,, is an embedded submanifold of M, 

2. there exists a unique smooth manifold structure on M, such that the natural 
projection 17, : Mz, — M, is a submersion, 

3. there exists a unique symplectic form @* on M; such that 


n* a" = tt ,@. (10.5.8) 


The smooth structure and the symplectic structure so induced on M, do not de- 
pend on the choice of ww. 


Proof 1. Let w € @ be given. By the Symplectic Tubular Neighbourhood Theo- 
rem 10.4.4, for every m € M,,,,, there exists a symplectic tubular neighbourhood 
(E,@,, J) of G-m and a tube symplectomorphism x : U > E. Moreover, by 
Lemma 10.5.1, there exists an open neighbourhood E of the orbit Gy, - [(4, 0, 0)] 
in E satisfying (10.5.1). Then, U0 := xB) is an open neighbourhood of G, -m 
in M. We show that!* 


Mey NU =x-|(Gy xn ({0} x V")). (10.5.9) 


Consider the set W := x dia, O,v)J:veE Vii, Since W is a connected subset 
of Mty,0M,, which intersects M;,_,, and since the latter consists of connected com- 
ponents of Mry]M M,, we have WC Mz, U. Then, Gy - WC Mr yO U. By 
(10.5.4), Gy» W = Min M,,.00 and hence M,,, U0 = Min) M,, OU. Then, 
(10.5.9) follows from Eq. (10.5.4). Since Gy, x 7 ({0} x Vit) is an embedded sub- 
manifold of E , it follows that Mz, U is an embedded submanifold of M. Finally, 
since M;,,, is obtained via the G,,-action from one of its connected components and 
since along a connected component the dimensions of V and V” cannot change, 
for all m € M;,,,,, the subspaces V" have the same dimension. Then, the assertion 
follows from Remark 1.7.4. 

2. For m € M;,,,, by identifying V with the subset H x ({0} x V) of E, we 
obtain a continuous mapping 

OniV +E; <x ((0} x 0#) 2S Me, SS me. 

Since J is equivariant, G-orbits in M intersect M;,,, in G,,-orbits. Hence, g» is 
injective. Since for an open subset W of V4, the subset x NG, xH ({O} x W)) 
is open in M;,,, and since m;,, is open, @» is also open and hence a homeomor- 
phism onto its image. By composing its inverse with a chosen linear isomorphism 
V4 _, R’, where r = dim(V”), we obtain a local chart kK» on M,. Up to the dif- 
feomorphism 


Gu xx ({0} x V")=G,/Hx Vv", (10.5.10) 


given by (10.5.7), the transition mapping between the charts kj, and Km, is a com- 
position of linear transformations with the restriction of the natural projection to the 


14 As before, for simplicity we denote H = Gyn. 
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second factor of G,,/H x V# toa certain submanifold. Hence, the local charts Km> 
m € M,,,,, define a smooth structure on M,. Since via x and (10.5.10), the natural 
projection 77, : Mz 42> M, corresponds locally to a natural projection in a direct 
product, it is a submersion. 

3. Since 7;,,, is a surjective submersion, any two local 2-forms on M, satisfying 
(10.5.8) must coincide on their common domain. Therefore, it suffices to prove that 
w* exists locally. Thus, let m € M;,,, and consider a symplectic tubular ee 
hood of G - m. In the notation introduced under point 1, we will construct w* on the 
open neighbourhood 717, (Mz. U ) of t7,,,(m) in Mz. Via x and the isomorphism 
(10.5.10), this neighbourhood is identified with vil , the subset Mz, U is iden- 
tified with G,,/H x V" the projection M,, 1 Us Trp (Mr 9 U) corresponds 
to the natural projection pr, : G,/H x V4“ _, V" and the restriction of ur yo to 


May U corresponds to j*@, where 
j:Gy/HxV"—>E, — j([gl,v):=[(g,0, v)]. 


Letwy:G,— G,/H anda :Gxm* x V => E be the natural projections. Writing 
tangent vectors of G,,/H x V# at ([g], v) in the form (1h, © LA, u) with A € gu 
andué€ V" and using (10.4.12), we calculate 


(7"®) Gp, y (TH OL, AI, 1), (7 OL, Ap, u2)) 
= G(g,0,v)) (#' (Ly A1, 0, u1), #/(Li, A, 0, u2)) 
= On (U1, U2) 


= (pr5 Om) cg.) (FH fe) L,Al, u1), (TH ° L,A2, u2)). 


This shows that w* exists on the open subset zt7,,(Mr,y9 U ) of M,, where under 
the above identification of this subset with V” it is given by the restriction of @,, to 
the subspace V” of V. Since, by Lemma 7.4.6, this subspace is symplectic, w* is 
symplectic. 

It remains to show that the smooth structure on M, and the form @* so con- 
structed do not depend on the choice of jz. For every jz € @, there exists g € G such 
that 4. = Ad*(g)j. By restriction, %, induces a diffeomorphism g : M,,; > Mz, 
which projects to the identical mapping id M, of M,. Since Tr, and 7, ;, are sub- 


mersions with respect to the smooth structure on M, induced from M,,, and M, j., 

respectively, it follows that id, is a diffeomorphism. Hence, these smooth struc- 
t 

tures coincide. Moreover, using 77,09 = 7,4 and t7,,09 = Wy ol, 7, we obtain 


Te iO" = UFO: 


It follows that @* coincides with the 2-form induced from M, ;. 


Remark 10.5.5 


1. Let us note the following consequences of the proof of Theorem 10.5.4. 
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(a) Via a symplectic tubular neighbourhood, every m € M,,, induces a local 
symplectomorphism from an open neighbourhood of z(m) in (M. 1, @") onto 
ve", @m). 7 

(b) The natural projection 7,,, : Mz, — Mr, is a locally trivial fibre bundle 
with typical fibre G,,/H, where m € M;,,, is arbitrary but fixed. 

2. The argument of the proof of Theorem 10.5.4 shows that My 1 M,, is a union 
of embedded submanifolds of M and that My HIN Mo is a union of symplectic 
manifolds. In both cases, these manifolds may have different dimensions, cf. 
Remark 6.6.2/1. Since, in practice, the orbit-momentum type strata are hard to 
find directly, the method of choice is to analyse the subsets My, % M,, for several 
combinations of H and yw in order to determine M rH M o and to read off the 
corresponding orbit-momentum type strata afterwards. In this context, let us note 
that, for given H and yz, the subset 7(My M M,,) of M is a union of connected 
components of My HO Mo but need not coincide with the latter. It coincides 
under the condition that every subgroup of G,, which is conjugate to H in G is 
also conjugate to H in G,. 

3. We encourage the reader to write down the above proofs for the special case 
jt = 0. This leads to some structural simplification. 


Now, we are able to discuss singular reduction of a G-invariant Hamiltonian 
system (M, w, h) with equivariant momentum mapping. 


Proposition 10.5.6 Let (M, aw, WY, J) be a Hamiltonian G-manifold with proper ac- 
tion and equivariant momentum mapping and let h € C®(M)®. Let t €T, let G be 
the corresponding coadjoint orbit and let we C. 


1. M,,,, is invariant under the flow of Xj, and Xj restricts to a vector field x,n on 
M,,,, which is tz,,-related to Xp, 


leo ° x," = Xp Olzp- 
2. The Hamiltonian h defines a smooth function h, on M, by 
hy ON py =hotzy,. 
The corresponding Hamiltonian vector field Xj, is ™_,,-related to dae 
To, ° ee = Xp, OM 7, p- 
As a consequence, an invariant Hamiltonian system (M, w, h) induces a uniquely 


determined reduced Hamiltonian system (M_,,w‘,h") for each orbit-momentum 
type stratum t of M. 


Proof By (6.7.3), the flow of X;, leaves invariant the stabilizers. Together with the 
Noether Theorem 10.1.9, this yields invariance of M;,,,. The rest of the proof is 
completely analogous to that of the regular case, cf. Proposition 10.3.3, and is there- 
fore left to the reader. 
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In the remainder of this section we will prove that the singular strata M, can be 
obtained via regular reduction.!° For that purpose, let H be a stabilizer, let Sy be 
a connected component of My and letis,, : &'7 — M denote the natural inclusion 
mapping. Recall from Remark 6.6.2/6 that X’y is an embedded submanifold of M 
acted upon properly and freely by 

Ds, =N*"/H, 
where 
N# := Ngsy (A), G*" :={geG: (LH) C Ly}, 


and that the quotient manifold X77 /I’y,, may be naturally identified with a certain 
orbit type stratum Mz. Let n denote the Lie algebra of NG(H). Since Ng(H)o C 
N*# CNG(A), this is also the Lie algebra of N~“. Recall that the dual mapping 
of the natural projection from n onto the Lie algebra n/b of I’y,, yields a natural 
identification of the dual vector space (n/h)* with the annihilator h° of 6 in aad 


Lemma 10.5.7 Let (M,w, WY, J) be a Hamiltonian G-manifold with proper action 
and equivariant momentum mapping, let H be a stabilizer and let X'y be a con- 
nected component of My. Let j : 1—> g denote the natural inclusion mapping. 


1. (“y, Sy @) is a symplectic submanifold of (M, w). 
2. The action of 'y,, on X’y is symplectic. For every u € J(X’7), the mapping 


JE. Dy on, J##(m):= j! o (J ots, (m) — 1), 
takes values in h° = (n/h)* and defines a momentum mapping'" for this action. 
Proof 1. According to Remark 6.6.2/6, Proposition 6.6.1/1 yields T,X'q = 
(T,M jH for all m € X’'y. Hence, the assertion follows from Proposition 7.4.6. 
2. That the action is symplectic is obvious. For the proof of the assertions about 


J4-“, we ignore the natural inclusion mapping h — n. To see that the mapping 
J takes values in °, let A € 6. Using the obvious identity 


(FP eT) Ts (10.5.11) 
and the fact that j(A), vanishes on 277, we obtain 
H, . 
dJ, BM = d(Jj(a) ° lon) = —U5,, (j(A)x 40) = 0. 
Hence, rie is constant on X77. Since uw € J(2'y), there exists mo € X'y such that 
J 05, (mo) =u. Then, J4”“ (mo) = 0 and hence 


(J#(m), A) = J" (Gn) =0 


‘The authors of [232] call this statement Sjamaar’s principle, because it first appeared in the thesis 
of Sjamaar [274] in the context of compact group actions and zero level reduction. 


!©Note that h° coincides with the subspace of H-invariant elements of the annihilator of h in g. 


'7Which need not be equivariant, see Remark 10.5.8/2. 
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for all m € X’y. Thus, J takes values in ie indeed. To see that J“ is a mo- 
mentum mapping, let A € n. First, we observe that the Killing vector field on Yq 
generated by [A] € n/b coincides with the Killing vector field Ax ” generated by A 
under the action of NX” on yy. Second, we observe that j is the Lie algebra ho- 
momorphism induced by the natural inclusion mapping N*” —> G. Since the latter, 
together with : 5,,, establishes a morphism of Lie group actions, Proposition 6.2.4/1 
implies 
ts,, OAZ# = j(A)eo j. 
Using this and (10.5.11), we obtain 


[Ale + (¢%,,0) = AZ# 3 ('$,,0) =Ch,, (i (A)s 40) = —dJj(a) ots) = dg. 


Remark 10.5.8 
1. By definition of J“, w € J(Z 77), we have 
(J) 10) = SHA My. (10.5.12) 


2. The momentum mapping J is not equivariant. Using the equivariance of J 
and j, we calculate the 1-cocycle defined by (10.1.6). Omitting the natural in- 
clusion mapping ty, : “y — M, for g€ N*# and m € Dy we obtain 


o([g]) = 7%" (W,(m)) — Ad*([g]) (J“(m)) 
= i (J(%e(m)) — w) — Ad*(Lg]) (77 (Fm) = 2) 
= j'(Ad*(g)u — p). 


By Lemma 10.5.7 and Remark 10.5.8/2, if one modifies the coadjoint action of 
Ty, on (n/b)* = h° in the sense of (10.1.11), the momentum mapping J” will 
be equivariant with respect to this modified action. Note that the stabilizer of the 
origin of (n/h)* under this modified action coincides with 


(Pry)u = {[gl€ Us, :Ad*(g)u =u} =N2"/H, | NY := Gy ON*4, 
Then, the assumptions entering the Regular Reduction Theorem 10.3.1 are fulfilled. 


In particular, (J 4.4)~1(Q) is an embedded submanifold of Sy and the reduced 
phase space 


(JH) (0)/(Loy)u = (2H M,,)/Nx# (10.5.13) 


catries a unique symplectic structure. In the remainder of this section, we will relate 
this symplectic manifold to the symplectic manifolds M, constructed before. 

Let X’7,, be a connected component of Xy M M,, let BS, H,w denote the corre- 
sponding connected component of the reduced phase space (X77 9 M,,)/ Nz and 
let TH yy? Yap > Lu, yu denote the natural projection. Since 2'y is a connected 
component of My, 2'y,, is also a connected component of My M M,,. There- 
fore, X'y,, C Mz, for some orbit-momentum type stratum t € T. Since, by con- 
struction, two points of X'y,,, are conjugate under G iff they are conjugate under 
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Nz , the natural inclusion mapping 4: X’y7,,, — M;,,, induces an injective mapping 
gy: XH, — M, by the following commutative diagram: 


DH nel Mr 
“1 fre (10.5.14) 


Dep — Me. 
Proposition 10.5.9 The mapping ¢ defined by (10.5.14) is a symplectomorphism. 
For the proof we need 


Lemma 10.5.10 Let m ¢ My M,, and let (E, o,W, J) be a symplectic tubular 
neighbourhood of the orbit G-m at m with tube symplectomorphism x :U > E. 
Let E be the open G,-invariant neighbourhood of G,, - [((1,0,0)] provided by 


Lemma 10.5.1 and denote U := yal (E). Assume that V™ is connected. 


1. Ifme Xp, then Lay U= xl ONg# x H {0} x V")). 
2. If Lap AU FS, then (Gy - La p)AU =Mz NU. 


Proof of Lemma 10.5.10 1. Consider the set W := x~!({[(4,0, v)] : v € V™}). 
Since W is a connected subset of MyM M,, which intersects X'y,,, and since the 
latter is a connected component of My M M,, we have WC YA A U. Since 
Yn. NU is invariant under Nv , then 

NEP Wax (NY? xe (Ox V")) crn’. (10.5.15) 
By (10.5.6), and since two points of 2'7,,, are conjugate under G iff they are conju- 
gate under N aH , we have in fact equality in (10.5.15). 

2. By point 1, if Yq, U # @, then it contains a point x~!({(g, 0, v)]) with 
ge Nz and v € V". Let W denote the subset defined under point |. By the 
same argument, applied to W,(W), we find that ¥%(W) C Yay U. Then, since 
Nz# CGy, 


Gu-WC(Gyu- Day). 


By (10.5.7), G,,- W coincides with M;,,,U. Since (Gy. - 54,4) NU is contained 
in M70 U , the assertion follows. 


Proof of Proposition 10.5.9 For every m € X'y,,, we find a symplectic tubular 
neighbourhood (E, @,W, J) of the orbit G - m at m with tube symplectomorphism 
X:U > E. Using point | of Lemma 10.5.10 and the argument of the proof of 
point 3 of the Singular Reduction Theorem 10.5.4, one can show that x induces 
a local symplectomorphism from an open neighbourhood of z7,,,(m) onto VA. 
Combining this with the corresponding result for M;, stated in Remark 10.5.5/1, 
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we find that ¢ is given locally by the identical mapping of V". Hence, it is a sym- 
plectomorphism from 2'y,,, onto an open subset of M,. It remains to show that 
y is surjective. For that purpose, for every m € M,,,, we choose a tube diffeo- 


morphism Xm: Um > Em and an open subset Ey according to Lemma 10.5.1 and 
denote i= = Xn (Em): Let B denote the subset of Mr. ,u Of points which are not 


G,,-conjugate to a point in My M M,,. We can cover M, by the (possibly empty) 


open subsets ext M,,,) with m € (My M,,,,) UB. Since the fibres of 77, 
are the G,,-orbits in M;,,, we have 


Ir (Um, OMe.) V2, p(Om, VMz,u) = 
whenever m, € My 1 M,,,, and m2 € B. In particular, since Yy,, C My 1 Mz yp, 
we have o(ZHw) a) fag (Us 1 M;,,) = © for all m2 € B. Combining this with 
Lemma 10.5.10/2, we obtain that M, \ (> H,u) coincides with the union of the 


open subsets zr, Prise 1 M,,,,) over all points m in ((My 1 M, mw) \ SH) UB. 


This shows that the image of @ is also closed in M,. Since M, is connected, it 
follows that @ is surjective. 


Remark 10.5.11 


1. Proposition 10.5.9 implies that for every connected component 2'y7,, of 
MyM, there exists a unique orbit-momentum type stratum t € T, with un- 
derlying coadjoint orbit @ = @,,, such that 


My = Gu : DH: (10.5.16) 


This observation clarifies the relation between our presentation of singular point 
reduction and that in [232]. There, the reduction procedure is applied to the sub- 
sets 


(Gu XH)OM, =Gy-(2HOM,), 


where, as before, »'y7 is a connected component of M7. In view of (10.5.16), 
Proposition 10.5.9 implies that the reduced phase space obtained in [232] is sym- 
plectomorphic to a union of certain orbit-momentum type strata M, contained in 
Mi HIN Mo. Itis symplectomorphic to M fay Mo under the condition that ev- 
ery subgroup of G,, which is conjugate to H in G is also conjugate to H in G,, 
cf. Remark 10.5.5/2. 

2. As in the regular case, see Proposition 10.3.7, one can perform singular orbit 
reduction, see [33] for first steps in this direction and [232], Sect. 8.4, for an ex- 
haustive discussion. Ortega and Ratiu show that, provided one endows J~!(@, nw) 
with the appropriate topology,!® singular orbit reduction yields a stratified sym- 
plectic space and that the latter is homeomorphic to the one obtained by point 
reduction. 


'8Here, a subtlety occurs: whereas J—! (2) can be simply viewed as a topological subspace of M, 
Jo! (G,,) has to be endowed with the initial topology induced by the mapping J. If yw is regular 
and G acts freely on M, this is consistent with endowing J~! (@,,) with the initial submanifold 
structure provided by the Transversal Mapping Theorem 1.8.2. 
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3. We comment on the structure of the reduced phase space as a whole. For details 

we refer to [232] and [238], see also [275]. 

For a given coadjoint orbit @, the subset M 6 of M can be identified with the 
topological quotient M,,/G,, for every 4 € @. It decomposes as 

Mo = U M;, 
T 

where the union is over all orbit-momentum type strata whose underlying coad- 
joint orbit coincides with @. This decomposition has certain topological and ge- 
ometric properties which are summarized in the notion of Whitney stratification. 
In particular, M o has the local structure of a cone space. Accordingly, the sym- 
plectic manifolds M, are usually referred to as the symplectic strata of Me CO. 

Due to the G-invariance of w, the subset C°(M)° is a Poisson subalgebra of 
C™(M). As a consequence of Noether’s Theorem, the subset 


1={f<eC™(M)°: fim, =0} 


is a Poisson ideal of C°(M)°. Thus, we can form the quotient to obtain a Pois- 
son algebra of continuous functions on Mg = M,,/G,: 


C°(Mg) := C®(M)E /T. 
This way, M o is equipped with the structure of a Poisson space.!? By construc- 


tion, the inclusion mappings M, — Mg are Poisson space morphisms, that is, 
the natural mappings 


C°(Mg) > C* (Mz) 
induced by pull-back are Poisson algebra homomorphisms. The whole struc- 


ture is summarized in the statement that Mg, together with the Poisson algebra 
C™(M@) and the family of strata {M_}, constitutes a stratified symplectic space. 
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In this section, we perform symplectic reduction for the following examples: the 
geodesic flow on S°, the Kepler problem, the Euler top and the spherical pendulum. 
Apart from the geodesic flow, these examples will be taken up again in Sect. 10.8 
and in Chap. 11. At this point, we would like to draw the attention of the reader to 
the book of Cushman and Bates [69], where the main focus is on examples of the 
above type, with the theoretical foundations being given in appendices. 

Recall that the Euclidean scalar product of two vectors x, y € R” is denoted by 
x- y and that the corresponding norm is denoted by ||x||. In this notation, the canon- 
ical 1-form 6 on TR” = T*R"” takes the form 


O(x, y) =y- dx. (10.6.1) 


!9 4 Poisson space is a topological space together with a Poisson algebra of continuous functions. 
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Example 10.6.1 (Geodesic flow on S*) Let M = TR* = R* x R*. The natural action 
w of SO(4) on R4 and the lift Y of this action to TR* are given by 

wv :SO(4) x R* > R*, w(a, X) = ax. 

W:SO(4) x TR* > TR’, W (a, (x, y)) = (ax, ay). 


By Proposition 8.3.6, the canonical 1-form @ is invariant under W. The Killing vec- 
tor field generated by A € so(4) under this action is given by 


d 
Ax(%, Y) = ary Wexp(rA) (X, y) = (Ax, Ay). (10.6.2) 
0 
Hence, for the equivariant momentum mapping (10.1.23), we obtain 
J:TR* > s0(4), (Je, y), A) = (Ax) -y, (10.6.3) 


that is, J4(x, y) = (Ax) - y. Thus, the tuple (TR*, dé, W, J) defines a Hamiltonian 
SO(4)-manifold. Let us consider the invariant Hamiltonian 


eT ee 2 
A(x, y) = 5 llxll"lyll” — 5@- yy’. (10.6.4) 
A brief computation (Exercise 10.6.1) yields the Hamilton equations 
X= [IxI’y — (&-y)x, (10.6.5) 
y=—llylPx+ (x yy, (10.6.6) 


from which we read off that the projection to the configuration space R* of the 
integral curve with initial condition xo, yo lies on the sphere of radius ||xo||. Thus, 
the dynamics can be restricted to the submanifold 


TS? = {(x, y) € TR*: IIx? =1, x-y =O}, (10.6.7) 


of TIR*, cf. Remark 2.1.4/2. One can check that via the natural inclusion mapping, 
0 pulls back to the canonical 1-form 6 on TS?, which is inherited from T*S? via 
the isomorphism defined by the Euclidean metric. Thus, by restriction, the Hamil- 
tonian system (TR*, dé, H) induces a Hamiltonian system (TS?, dé, H ), where 
H= Ayrs3- From (10.6.4)-(10.6.6) we read off 


= i 6 
(xy) = 5llyll 
and the Hamilton equations 
k=y, 9 y=-llyll’x. (10.6.8) 


Since ||y||? = 2A is a constant of motion, this system of equations can be trivially 
integrated: 


EA z cos(t./2h) Th | fa 
y(t) | —/2h sin(t/2h) cos(t/2h) Yo ]’ 


where h = H (Xo, yo). Every integral curve with h > 0 is periodic and its projection 
to the configuration space S~ is a great circle whose position is defined by the initial 


534 10 Symmetries 


conditions x9, yo. The Hamiltonian vector field Xj, is called the geodesic vector 
field on S? and its flow is called the geodesic flow on S?. We leave it to the reader 
to check that this is indeed the geodesic flow on S? with respect to the Riemannian 
structure induced from R* as discussed in Example 9.2.1. 

Since the submanifold TS? given by (10.6.7) is invariant under the action Y, the 
latter restricts to an action 


W: SO(4) x TS? = TS?, W(a, (x, y)) = (ax, ay), 
and the equivariant momentum mapping (10.6.3) restricts to an equivariant mo- 
mentum mapping J for this action. Thus, (TS?, dé, %, J) is a Hamiltonian SO(4)- 


manifold. Let us study its structure. For that purpose, from Remark 5.4.11/2 we 
recall that the Killing form of s0(4) induces an equivariant vector space isomor- 


~ 


phism so0(4)* = so(4). In turn, we can identify the vector space underlying so(4) 
with /\? R* via the mapping 


AP Atle; A@j 
(summation convention), where e; denote the standard basis vectors of R*. Since 
ee a 
(J(% y), A) = Aijx’y! = 5 (2/9! — xy) Aaj, 


under these two identifications we find 


J(x,y)=XAy. (10.6.9) 
It follows that J satisfies the Pliicker equation 
JAJ=0. (10.6.10) 
In coordinates, it reads 
Ji2J34 + S13 J42 + J14J23 = 0. (10.6.11) 


In what follows, we use the natural scalar product 
(UAV, xX Ay) := (u-x)(V-y) — U-y)(V- x) 
in ce R*. Denoting the corresponding norm by || - ||, we obtain 
| 7x, y) |? = xi llyll? — (xy)? 
and hence 
| ie, |? = ly? =24y). (10.6.12) 


Let us determine the level set J~! (A) for A € so(4). In the case A = 0, according 
to (10.6.12), , a (0) coincides with the zero section so of TS?. From the Hamilton 
equations (10.6.8) we read off that in this case the integral curves are points, that is, 
the dynamics is trivial. In the case A 4 0, we can restrict J to the subbundle 


Tt S? = TS? \ so(S?). 


The restriction will be denoted by J*. By the Noether Theorem 10.1.9, the level set 
(J tyr! (A) is a union of integral curves of the geodesic vector field X . We show 
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that it consists in fact of a single integral curve of X ;,. This implies, in particular, 
that the image of J* may be interpreted as the space of integral curves. According 
to (10.6.10), 


imJ+ ={A€s0(4):AAA=0, A#0} (10.6.13) 

(Exercise 10.6.2). This is a 5-dimensional submanifold of s0(4). We show that J+ 
is a submersion. Let (x, y) € TtS>. Since 

dim Tx, y)(T*S*) = dim ker(J*) 


/ 


x.y) + dimim(J*) 


f 

(x,y)’ 

it is enough to show dimker(J*)(, y =i. Let X= (&%& y) € Tay) (T*S3) and let 
t +> (x(t), y(t)) be a curve through (x, y) representing X. Then, 


d . ; 
(Fe) — Gu A y(t) =XAY+XAY. 


The condition (JT) (X) = 0is fulfilled for multiples of the geodesic vector field, 


/ 
(x,y) 
X =s(y, —llyll’x), seR, 
and one can show that these are the only solutions (Exercise 10.6.3). Thus, J+ is 
a submersion, indeed. Now, the Level Set Theorem 1.8.3 yields that for every A € 
im J*, the level set (J+)~!(A) is a one-dimensional submanifold of TS? satisfying 
-1 
Texyy(J*) (A) =ker(J*) 


fs 

(x,y) " 

Since ker(J Ye y) is spanned by X,(x,y), every connected component of 

(J+)—!(A) coincides with an integral curve of X 4. Below, we will see that the 

level sets of J* are in fact 1-spheres, cf. Remark 10.6.2. This yields the assertion. 
To summarize, due to the high degree of symmetry in this example, the preim- 

ages of the momentum mapping already yield the integral curves, so that a further 

symplectic reduction is not necessary. 


Remark 10.6.2 We show that im J* is foliated by the generic orbits of the coadjoint 
representation. This way, we will be able to describe the geometry of the energy 
surfaces. We use the isomorphism 


50(4) = su(2) @ su(2) 


of Example 5.2.15/5 to represent J* by a pair (K, /) of su(2)-valued mappings on 
TS?. According to (5.2.11), under the identification R*+ = H of Example 5.1.11 and 
su(2) = sp(1) of Example 5.2.8, J", K and J are related by 

J™q=Kq-ql (10.6.14) 


for all q ¢ H. Decomposing K =)"; Kj i andi =)°; LAr in the basis of Exam- 
ple 5.2.8, we read off 


1 1 1 

Ki=5 (x3 + Jia): Kr = 5 (J3i + J), K3=5(in + Jaq), 
1 1 1 

N= 5 (4x3 — Jia), I= 5(J3i — Joa)» I= 5(Ji5 — Jza)- 
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Using 
|KIP=K74+K34+K2, t= +b4+R 
one finds that in terms of K and J, the Pliicker equation (10.6.10) takes the form 
I? = (ZIP (10.6.15) 
and that the identity (10.6.12) reads 


1 
| Kel" = Gliyl? = 7, |? (10.6.16) 


(Exercise 10.6.4). Thus, the image im J* can be characterized as follows: 

imJ* = {(A, B) €su(2) x su(2): ||Al] = || Bl 40}. 
According to (8.4.8), for every (A, B) € im Jt the coadjoint orbit O(4, 8) is diffeo- 
morphic to Si All X Si Al: From (10.6.16) we read off that the preimages of these 


orbits under J+ have the topology of S* x S* and that they coincide with the energy 
level sets in T+S*. For every fixed energy level 


1 
h= silyl? =21Al? =211BIl" 40, 


J* yields a locally trivial fibre bundle 
+. -lypy ~ By 2 2 2 
J, i: H (hy=S° x SAI AI — Syay X Syay- (10.6.17) 


The connected components of the fibres of this bundle are the integral curves of 
the geodesic vector field corresponding to the energy level 4 and the base manifold 
Si All Si All is the space of integral curves of energy h. 

To show that the fibres are in fact 1-spheres, and thus coincide with the integral 
curves of Xj, we study this bundle using the original parameterization by pairs 
(x, y) of orthogonal vectors. For fixed y and for given x A y, the vector x runs 
exactly once through the great circle obtained by the intersection of S* with the 
2-dimensional surface defined by the 2-vector x A y. We conclude that, as a fibre 
bundle, (10.6.17) is isomorphic to the Stiefel bundle Sp(2, 4) > Gr (2, 4), where 


Sr(2, 4) = SO(4)/SO(2) = S? x S? 
is the space of orthonormal 2-frames in IR4 (the Stiefel manifold) and 
Gr (2, 4) = SO(4)/(SO(2) x SO(2)) = S? x $? 


denotes the space of oriented 2-dimensional surfaces in R* (the 2-fold covering 
space of the Graf{mann manifold Gp(2, 4)), cf. Examples 5.7.5 and 5.7.6. For the 
proof, see Exercise 10.6.5. 


Example 10.6.3 (Kepler problem) To start with, we summarize some well-known 
facts from classical mechanics. The phase space of the Kepler problem is 


M = T*(R° \ {0}) = (R? \ {0}) x R?. 
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In the standard bundle coordinates (q, p), the symplectic form is given by 
wo =dp; Adq', 
and the Hamiltonian is 


pi 
HGP) =~ 9 q= ll. 


Here, the reduced mass and the coupling constant have been put equal to 1. With 
this choice, we consider the case of an attractive force. The Hamiltonian vector field 
generated by H is 


i 


q 
X= Pidgi a qari 


The Hamiltonian H is invariant under the lift to M of the natural action of SO(3) 
on R?. From Example 10.1.23 we know that under the identifications s0(3)* = s0(3) 
induced by the Killing form and s0(3) = R? given by (5.2.6), the associated equiv- 
ariant momentum mapping J: M — so0(3)* coincides with angular momentum, 


J(q.p)=q x p=L. 


Thus, we have a left Hamiltonian SO(3)-manifold with invariant Hamiltonian H. 
Let us discuss symplectic reduction for this theory. Note that the coadjoint orbits of 
SO(3) are labelled by / = ||Lo||. We have to distinguish between two cases. 


(a) The case / > 0. Here, Lo is a regular value of J and the level set 
J~'(Lo) = {(4, p) € M:q x p=Lo} 


is an embedded submanifold of M, acted upon properly and freely by the stabi- 
lizer SO(3),, = SO(2). Thus, the Regular Reduction Theorem applies, yielding 
the reduced phase space 


M, = J~'(Lo)/SOQB)15- 


To describe M explicitly, we note that the constant of motion Lo reduces the dy- 
namics to the plane orthogonal to Lo. Without loss of generality, we can assume 
Lo = (0, 0,1). Then, this plane coincides with the q'-q?-plane and J~!(Lo) is 
contained in the phase space 


M = T*(R? \ {0}) = (Ry x SO(2)) x R’. 
Since q # 0, we can use polar coordinates (q, @) in the g!-q?-plane. We denote 


Pq = Picosg + p2sing, Pe = —49Pi sing + gp2 cos 


and observe that (g!, q7, pi, p2) (4.9, Pq» Po) 18 acanonical transformation 
of M. In terms of the new coordinates, we have 


J~! (Lo) = {(4. 9, Pq, Po) € M: pp =1} = (Ry x SO(2)) xR 
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and 
ey Les. on ae. 
Nhe) = 5) Pa T a2) 


Moreover, the action of SO(3)z, is given by +> @ +a and the functions q, 
Pq, Po and Hy ;-1”y,,) are invariant under this action. Thus, the reduced phase 
space M is isomorphic to IR; x IR endowed with the symplectic form dp, A dq 
and the reduced Hamiltonian is given by 


H= =(03 + ) Zs (10.6.18) 
oN ed, ad 
As a result, we arrive at a one-dimensional problem with effective potential 
U(r) = cia _ = 
2q° 


From classical mechanics we know that the orbits in configuration space are 
ellipses for h < 0, hyperbolas for h > 0 and parabolas for h = 0. Below, we will 
use the hidden symmetry provided by the Lenz-Runge vector to prove these 
facts by purely algebraic arguments. 

The case / = 0, that is, Lo = 0. This is not a regular value of J. The stabilizer is 
SO(3)9 = SO(3). We find 

J-'(0) = {@.p) € M:p=aq, a € R} = (R*\ (0}) xR, 

that is, J~'(0) is a trivial line bundle over the configuration space R3 \ {0}. 
Let mo = (qo, @qo) € J~!(0) be achosen point. Its stabilizer SO(3)ing = SO(2) 


consists of the rotations about the axis defined by qo. The connected component 
» of M08) ng containing mg is 


X= {(Aqo, Aago)€ M:a,A€R, A>0}=R, xR. (10.6.19) 


We observe that SO(3) - © = J~!(0). Hence, we obtain © MN J~!(0) = , that 
is, the intersection YN J —1() consists of a single connected component. The 
subset M{s0)ng] 1 Mo, of the orbit space M = M/SO(3) consists of a sin- 


gle orbit-momentum type stratum Mo. The corresponding submanifold M;,,, is 
given by SO(3) - © = J~!(0). Thus, 

Mo = J~'(0)/SO@). 
which may be identified with 2’. Under this identification, the reduction of the 
canonical |-form yields 


i Pe a) 
Pilg Gg Ng gd. Paw, 
and the reduced Hamiltonian reads 
2 
P 1 
H=——-. (10.6.20) 
2 4 
Obviously, the orbits are located on rays in R?. For h <0 they are bounded 
from above by the value g = — i: 
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Now, let us recall that there are three additional constants of motion in the Kepler 
problem, given by the Lenz-Runge vector 


R=pxL—4, (10.6.21) 
q 


which fulfils 
L-R=0. (10.6.22) 


This fact enhances the symmetry of the problem. We provide the discussion for the 
case h <Q. Thus, in what follows, let &’_ C M be the open subset of points with 
negative energy. We define 


1 1 1 1 
=-3(L- =a"). I= -3(L+ 5%): (10.6.23) 


For the Poisson brackets, we find 


{Ki,Kj}=eij*Kr, (h.dj=sey*hk, {Ki 1j}=0. (10.6.24) 


We read off that {K;} and {J;} constitute bases in the Lie algebra su(2). Therefore, 
via the isomorphism s0(4) = su(2) x su(2) used before, together they constitute a 
basis 


ej = Kj, e4i:=];, i=1,2,3, 
of the Lie algebra s0(4).”° It is easy to see that the linear mapping 
w:so0(4) > X(M), we) = Xe, (10.6.25) 


defines a right symplectic action of so0(4) on M. Since, by construction, every vector 
field w(A) with A € so(4) is Hamiltonian, Proposition 10.1.3 implies that y is 
Hamiltonian with momentum mapping 


J:M-— s0(4)*, J(m) := (mje, 


where e*! denotes the dual basis in s0(4)*. Thus, (M,@, w, J) is a right Hamilto- 
nian s0(4)-manifold. If we identify s0(4)* = s0(4) via e; + e*, we obtain 


JoSe,. Rp Shae, (10.6.26) 


and, consequently, 


k k 
{Je; Je;} = {ei, e;} = Ci jek = Cj Sex = Toke = Jtej,e;}> 


k 


where c; are the structure constants of s0(4) with respect to the basis {e;}. Thus, 


the system is strongly Hamiltonian. By (10.6.22) and (10.6.23), 
WIP = IP. (10.6.27) 


20More precisely, we have faithful representations of these Lie algebras in the Poisson algebra of 
smooth functions on phase space. 
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An explicit calculation (Exercise 10.6.6) yields 


1K =—— =P (10.6.28) 
OH ‘ 6. 
Thus, as for the geodesic flow on S°, the image of J is given by the 5-dimensional 
submanifold 


im J = {(A, B) € su(2) x su(2): |All = ||Bll £0} 


of s0(4), the coadjoint orbits fulfil O43) = Stal x Stal and their preimages under 
J coincide with the energy level sets defined by (10.6.28), cf. Remark 10.6.2. As in 
that example, one can show that J: X’_ — im J is a submersion and that the level 
sets coincide with the integral curves of the Hamiltonian vector field. This means 
that the values of the constants of motion L and R uniquely determine the integral 
curves. As before, one has to distinguish between the following two cases. 


(a) The case / 4 0. Here, the integral curve in »'_ projects to an oriented ellipse in 
the configuration space R? \ {0}. To see this, recall that the motion takes place in 
the plane orthogonal to the angular momentum vector L. Due to (10.6.22), the 
Lenz-Runge vector R defines an axis in this plane. Using (10.6.21) and denoting 
€ := ||R||, one finds 


6? =2hAl? +1, Reger =<: (10.6.29) 
Let ¢ be the angle between R and q. Then, (10.6.29) implies 
R-q=eqcos¢=l’ —q. 
It follows that 


P 


q (10.6.30) 


~ ecos¢+1 


with ¢ being given by (10.6.29). This is the polar equation of an ellipse in the 
plane orthogonal to L. For the semimajor and semiminor axes we obtain 


_ oF ll oe dl 

~ [—e2 —2h’ Mae wah 
respectively. The direction of the semimajor axis is determined by R. 
The case / = 0. Here, the orbits in the configuration space R? \ {0} are half-open 


intervals 
1 1 
{(se ct 2(n+ Je) iS (0 |}: 
Ss h 


where e is an arbitrary vector of unit length in R*. Note that for the initial 
conditions (— ie, 0), the integral curve falls into the origin after the finite time 


a (10.6.31) 


(b 


wm 


jee 
=5- 


Niw 


This makes explicit that the Hamiltonian vector field of the Kepler problem is 
not complete. 
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Remark 10.6.4 


1. For h > 0, one can solve the problem by deriving commutation relations analo- 
gous to (10.6.24). This leads to the Lie algebra s0(3, 1). For h = 0, the expression 
rT does not make sense. In this case, one has to use L; and R; as generators, 


which leads to the Lie algebra of the group of Euclidean motions SO(3) x R? 
(Exercise 10.6.7). 

For the history of the Lenz-Runge vector and the resulting symmetries, see 
[108, 109]. 

2. In 1970, Moser found a deep relation between the Kepler problem and the 
geodesic flow on S?. The latter provides a certain regularization of the Kepler 
problem [222]. 

In the above notation, we consider the following extension of the stereo- 
graphic projection to the tangent bundle of S*: 


®: TTS) — TR3, (x, y) > (u,v), (10.6.32) 


defined by 


Xx 
ue = —_, vg = (1 — xa) yg + yaxg, k= 1,2,3. 
1—x4 


Here, S? := S* \ (0,0, 0, 1) denotes the 3-sphere with the north pole removed 
(the punctured sphere). One can check that ® is a symplectomorphism and that 
the pull-back F = (@~')*H of the Hamiltonian by &~! is given by 


1 2 
F(u, v) = g liv (ilu? + 1) 


(Exercise 10.6.8). The corresponding Hamilton equations read 


du dv 
—=V,F, —=-V,F. 
ds 
In what follows, we restrict ourselves to the level set defined by A= 5. Then, 
every transformation F + G(F) of the Hamiltonian with the property G’ ) =1 
yields a flow whose restriction to the level set coincides with the flow generated 
by F’. We choose 


1 
G(u, v) := /2F(u, v) —1= 5livii(iiuil? +1)-1. 


For the Hamiltonian G, the Hamilton equations, restricted to the level set, have 
the same form as the equations defined by F and the level set is now given by 
G = 0. Let us define a new flow parameter, s +> f(s), by 


— =v]. 
1s IVI 


This yields a scaling of the Hamiltonian vector field XG by ||v||~! and induces 
the following transformation of the Hamilton equations: 


du -1 dv =] 
io Weg tes VuGs 
Talvin'wG,  S=-vie'Vu 
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We set 
-1 ae 2 -1 
K(u, v) := ||vl|- Gu, v) — = = s|lull~ — |Ilvll. 
2 2 
On the level set, we have K = —5 and the Hamilton equations for K have the 


canonical form. If we now perform the canonical transformation 


(u, v) + (q(u, v), p(u, v)) := (—v, u), 


we obtain the Hamiltonian of the Kepler problem: 
1, 1 

K(q,P) = =p — -. 

2 q 


Finally, let us study the behaviour of the flows in the neighbourhood of the north 
pole (0, 0, 0, 1): For x4 — 1, we have ||u|| — ov, ||v|| > 0 and hence 


IIpIl> oo, q—>0. 


In this limit, we obtain the singular integral curves of the Kepler problem. More 
precisely, for K = -5 and L = 0 we obtain the integral curves 


| 2; 
{(se —1I+ =e) IS E (21) 


Under the above diffeomorphism, running through the orbit of the Kepler prob- 
lem from (e, 0) to the singularity (0, oo) corresponds to running through a great 
circle from the south pole to the north pole. Moving on the same great circle 
back to the south pole corresponds to moving through the Kepler orbit back to 
the starting point. Instead of falling into the singularity, the moving particle is 
getting reflected. In this sense, dynamics has been regularized. Mathematically 
speaking, the space of momenta has been compactified. 

Let us summarize. Every energy level set corresponding to a negative value of 
h is diffeomorphically mapped by the above discussed transformation onto the 
unit sphere bundle over the 3-sphere with the north pole removed (the pointed 
3-sphere). After an appropriate reparameterization of the time variable, this dif- 
feomorphism maps the flow of the Kepler problem to the geodesic flow of the 
pointed 3-sphere. This way, the singular Kepler orbits are mapped to regular or- 
bits of the geodesic flow through the north pole. 

The above ideas have attracted much attention. There is a whole bunch of 
related regularization procedures, see e.g. [67, 175, 176, 185] and [300]. The ar- 
ticle of Vivarelli [300] contains an overview with a large number of references. 
We also refer to Cushman and Bates [69], where some regularization methods 
are contained in the form of exercises to Chap. II, and to Guillemin and Stern- 
berg [118], where in particular the relation to the conformal group is discussed 
in detail. 


Example 10.6.5 (The Euler top) We use the results of Examples 8.4.5/3, 9.2.1 
and 9.2.2. We consider a top, that is, a rigid body, which is fixed at one distin- 
guished point. For the description of the dynamics of rigid bodies one uses two 
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reference frames: a fixed inertial frame in 3-dimensional Euclidean space, given by 
an orthonormal frame {nj, nz, n3}, and a non-inertial, so called body frame, which is 
fixed to the body, described by an orthonormal frame {e1, e2, e3}. The correspond- 
ing coordinates are called space and body coordinates, respectively. If we put the 
origins of both reference frames into the distinguished point, then the position of 
the body is uniquely determined by a rotation matrix a, which transforms the frame 
{n;, M2, n3} into (e1, e2, e3), that is, 


(e1, €2, €3) = (anj, an, an3), 
or 
ei = a] inj. 


Thus, the configuration space of a top is the group manifold of the rotation group 
SO(3) and the phase space is T*SO(3) = SO(3) x 50(3)*, where we use the triv- 
ialization by left translation, cf. (8.3.6). That is, the top belongs to the class dis- 
cussed in Example 9.2.2. As in the previous example, we use the identifications 
s0(3)* = s0(3) = R? and the corresponding identifications of the adjoint and the 
coadjoint representations of SO(3), cf. (5.2.6) and Remark 5.4.11/2. For an element 
x € R? we shall write x when viewed as an element of s0(3) or of s0(3)*. 

In what follows, we will rather use the body frame. For every curve t +> a(f) in 
the configuration space and every f, the tangent vector a(t) defines an element 


w(t) =a a(t) 


of the Lie algebra. The corresponding vector @ in R? is the vector of angular veloc- 
ity. The kinetic energy of a rigid body is 


T= 5° - (Ow), 


with © denoting the inertia tensor. Here, we restrict ourselves to the Euler top, 
that is, we assume that there are no external forces acting on the body. Then, the 
generalized momentum coincides with the vector of angular momentum, 


and the Hamiltonian is obtained from T by a Legendre transformation: 
H(a,L) = (OL) vie (10.6.33) 
Under the above identifications, it is given by the function 
H:SO(3) x so(3) > R, H(a, L)= JO, L), (10.6.34) 


where ©~! is the constant, symmetric, positive-definite covariant tensor of rank 2 
defined by 


@-'(L, L):=(O"L)-L. 
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The tensor © constitutes a Riemannian metric on the configuration space SO(3) 
which is invariant under the lift Y of the action of SO(3) on itself by left translation, 
cf. Example 10.1.24. Thus, H belongs to the class of Hamiltonians discussed in 
Example 9.2.1, that is, the dynamics of the Euler top is described by the geodesic 
flow of © on the group manifold SO(3). Since H is invariant under W, the Hamilton 
equations can be read off immediately from (9.2.9):! 


a a=O lL, Le==ad(O "Li; (10.6.35) 


The first equation restates that L corresponds to angular momentum. It yields 
©-!'L =o. Then, the second equation takes the form? L = [L, w]. Applying the 
isomorphism (5.2.6), we obtain 


L4+o*%L=0. (10.6.36) 


These are the Euler equations for the force-free top. Equation (10.6.36) yields an- 
gular momentum conservation in the body frame. 

Let us discuss symplectic reduction for this model. From Example 10.1.24 we 
know that the Hamiltonian system under consideration possesses a natural momen- 
tum mapping. Under the above identifications, it is given by 


J:SO(3) x s0(3) > sa(3), J(a, L) = Ad(a)L, (10.6.37) 


cf. (10.1.27). Let SO(3)z; denote the stabilizer of L under the adjoint representa- 
tion. According to Example 5.4.7, the adjoint representation of SO(3) is isomorphic 
to the fundamental representation of SO(3) on R?. Thus, for every L £0, SO(3)z7 
is isomorphic to the subgroup SO(2) Cc SO(3) of rotations about the axis defined 
by L. For L = 0, we have SO(3);, = SO(3). Theorem 10.3.5 yields that TD) 
is diffeomorphic to the group manifold SO(3) and that the symplectic manifold 
(J~!(L)/SO()z, wl) provided by the Regular Reduction Theorem 10.3.1 is iso- 
morphic to the adjoint orbit @;, through L, endowed with the negative Kirillov form. 
Thus, 


J~(0)/SO@B)o = Oo = {0} 
and 
J7"(L)/SOQ), = 6, =Sjy), LO. (10.6.38) 
Moreover, from (10.6.37) we read off that the bundle 
my: J~'(L) > J*(L)/SO(@)z 


coincides with the bundle of oriented 2-frames in R?. 
Now, for a fixed nonzero element Lo of s0(3), let us describe the reduced Hamil- 
tonian system 


(J7'(Lo)/SO@B) 19, 0”, Hr) 


21 We use the simplified notation ©! L for the element of s0(3) corresponding to the vector @~!L. 


2.4 pair (L, w), fulfilling this differential equation, is called a Lax pair and the equation is called 
Lax equation, see Sect. 11.2. 
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provided by Proposition 10.3.3. For that purpose, we identify J~!(Lo)/SO(3) Lo 
with the adjoint orbit @,, through Lo, that is, with the sphere Si Lol of radius || Zo]| 
in s0(3). Elements of J~!(Lo) are pairs (a, L) such that L = Ad(a~!)Lo. Conse- 
quently, tangent vectors of J —!(Lo) at (a, L) are given by pairs (L/, A, ad(L) A) with 
A € $0(3). Since the stabilizer SO(3)z, acts by (A, (a, L)) t (ha, L), the natural 
projection zy, : JI“'(L9) > Stoll and its tangent mapping are given by 


(a LVS Ls (19)(a,t) (La A, ad(L)A) = ad(L)A, 
respectively. Thus, from (10.6.35) we read off the reduced Hamiltonian and its 
Hamiltonian vector field: 

1 _ 
H,,)(L) = 50 ht), (Xn, =ad(L)(O 'L), Le€Sjz,y- (0.6.39) 
The corresponding Hamilton equations are 
; -1 2 
L=[L,OVL], LeSjz,): (10.6.40) 
or, in vector notation, 
L=Lx (Q"'L), Ll] = [Lol 


Up to left translation by the stabilizer SO(3)7,, every solution t r+ L(t) of the 
reduced system (10.6.40) defines a solution t + a(t) such that tr (a(t), L(t)) isa 
curve in J~!(Lo). 


Remark 10.6.6 Using the above results, together with the fact that the Hamiltonian 
is a constant of motion, one can discuss the qualitative behaviour of integral curves. 
Assume that the body frame is chosen in such a way that the basis vectors e; coincide 
with the principal axes of inertia. Then, O = diag(O,;, ©, O3) and the constants of 
motion L? and H take the form 


a 

20; 202 20; 

Consequently, the integral curves t +> L(t) of the reduced system are given by the 
lines of intersection of the sphere with radius ||L|| with the ellipsoid defined by H. 
For an exhaustive discussion we refer to [69]. There, the reader will also find a 
solution of the reduced Euler equations in terms of Jacobi’s elliptic functions. 


VaL+E3+L3, H= 


Example 10.6.7 (The spherical pendulum) Let M = T*R? = TR? = R? x R? en- 
dowed with the standard symplectic structure. Consider the restriction of the natural 
action of SO(3) on R? to the subgroup SO(2) given by rotations about the axis 
of e3. In Example 10.1.23 we have seen that the Killing vector field generated by 
A € 50(2) under the lift W of this action to TR? is given by 


Ax(X, y) = (Ax, Ay) 
and that the mapping 
J:TR? > s0(2), (I(x, y), A) =y-(Ax)=(xxy)-A, (10.641) 
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is an equivariant momentum mapping. As before, the Lie algebra element A is iden- 
tified with the vector A = ||A||e3 in R? via the isomorphism (5.2.6). Thus, according 
to (10.6.41), the momentum mapping is given by the projection of angular momen- 
tum to the e3-axis. To summarize, the tuple (TR?, do, W, J) is a Hamiltonian SO(2)- 
manifold. Now, consider the SO(2)-invariant Hamiltonian 
1 
H(x,y)=5 sii? +x-e3+ 5 5 (iyi? — + es)/(I[xl|° 1) — 5x-y. 

The corresponding — equations read 

x=y+y/(IIxil? — 1) — (x-y)x, (10.6.42) 

. 1 

¥ = —e3 + 5€3/(IIxll* — 1) — (lly? —x-e3)x-+(&-y)y.  (10.6.43) 


Since x - x = 0, the projection of the integral curve with initial conditions x9, yo to 
the configuration space R? is located on the sphere of radius ||xo||. Thus, the Hamil- 
tonian system (TR?, w, H) restricts to a Hamiltonian system on the submanifold 


S? = {(x,y) € TR?: |[x||?=1, x-y =O}, (10.6.44) 


cf. Remark 2.1.4/2. As in Example 10.1.23, one can check that the canonical 1-form 
6 of TR? pulls back to the canonical 1-form 6 on TS?. The restriction of H to TS” 
is given by 


: 1 
A(x, y) = 5liyil? +X- 3 (10.6.45) 


and the Hamilton equations (10.6.42) and (10.6.43) become 
x=y, y =—e3 — (ly? —x-e3)x. (10.6.46) 


We note that TS? is the phase space and H is the Hamiltonian of the spherical 
pendulum.”* Since W leaves the submanifold TS? invariant, it induces a symplectic 
action 
W : SO(2) x TS* > TS?, W(x, y) =(a-x,a-y), 
and the restriction J = J )Ts2 yields an equivariant momentum mapping. We find 
J (x,y) =x, y2 — x21. (10.6.47) 


Thus, we obtain a Hamiltonian SO(2)-manifold cS", dé, W, J ) with Hamilto- 
nian H. The Noether Theorem 10.1.9 implies that J is a constant of motion. 

Let us discuss symplectic reduction for this system. First, observe that W is not 
free, because it has the fixed points 


mo = (+e3, 0). 


Thus, we have to perform singular reduction. Since the adjoint action of the Abelian 
group SO(2) is trivial, for all values j € s0(2) we have @; = {j} and G; = SO(2). 


?3With the gravitational acceleration set equal to 1. 
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The points mo have the stabilizer SO(2), whereas all other point of TS* have the 
trivial stabilizer {1}. Consequently, the submanifolds of the above isotropy types are 
given by 


Mia =TS?\ {my Umg}, — Msoay = {mg } U {mg J. 
For the intersections with M; = J =1y J) we obtain 
MsoQ) 1M; = 2, May M; ={(%, y) € TS*: (xx y)-e3 = j} 
for j #0 and 
Mgo2) 0 Mo = {mg} U {mG}, Mia} Mo = {(x, 0) € TS? :: x 4 tes} 


for j = 0. Hence, in the first case, M 6, consists of the single orbit-momentum type 
stratum 


M; = {(x, y) € TS”: (x x y)-e3 = j} /SO(2) 
and in the second case it consists of the three orbit-momentum type strata 
Mo = {(x,0) € TS?:x#+e3}/SO2), Mor={mp},  Mo-={mg}, 


forming the principal stratum and the two secondary strata, respectively. 


Remark 10.6.8 


1. Let us analyse the structure of the strata found above by means of classical invari- 
ant theory, see [247] and [312]. The algebra R[x, ypo of invariant polynomials 
in the variables x, y € R? is generated by the invariants 


T = X3, T2 = Y3, B=yit+y5, 
T= X1 Yl + x2Y2, 1 =x +X5, T= X12 — X2Y1 
which fulfil the relations 
+t =13t5, 1320, t5>0. (10.6.48) 
They define the so-called Hilbert mapping 
tT: TR? > R®, T(x, Y) := (u1(x, Y),--+, T6(X, y)). 


In invariant theory it is shown that the topological quotient TR? /SO(2) is home- 
omorphic to the image of 1, that is, to the subset of IR° defined by the relations 
(10.6.48).*4 In the present case, this can be checked by direct inspection, see 
[69]. Let us pass to new invariants o; defined by 


o3:= 34+, Oj := 7, 13. 
Restriction to TS? yields the additional relations 


osto7=1, o4+0102=0, 


24 \ subset of R” defined by equations and inequalities is said to be semialgebraic. 
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which allow for elimination of o4 and o5. Therefore, TS?/ SO(2) can be identi- 
fied with the image of the mapping 


o:TS’>R*, a(x, y) = (a1(%, y), 02%, y), 03(%, y), 06(%, y)), 
which is the subset of IR* defined by the relations 
03(1—07) — a3 —o7 =0, 03-05 >0, loi| <1. 


According to (10.6.47), we have o6 = J. Therefore, Mo, = J~'(j)/SO(2) can 
be identified with the subset of R? defined by the relations 


o3(1—07) —o7 —j? =0, 03 — 03 >0, loi| <1. (10.6.49) 
In case j 4 0, this subset coincides with the graph of the function 
“2 2 
JO +9 
03(01, 02) = i >, loi| <1. 


1 
Hence, in this case we obtain a 2-dimensional smooth manifold diffeomorphic 
to R*, representing the orbit-momentum type stratum Mj. In case j = 0, the sub- 
set (10.6.49) cannot be the graph of a function, because it contains the vertical 
lines {(+1, 0, 03) € R?, 03 > O}. The endpoints (+1, 0,0) of these lines corre- 
spond to the secondary strata Mox of M Op and all the rest corresponds to the 


principal stratum Mo. Topologically, this subset is also isomorphic to R?. 
. From the above construction we can read off the preimages of the mapping 


JI1(j) > Mj. (10.6.50) 


For the point (£1, 0, 0), the preimage consists of the single point mo . For all the 
other points, it is a 1-sphere. 
. Using the invariants, one can compute the Poisson brackets and, thus, determine 
the symplectic structure on the strata of the reduced phase space, see [69] and 
Exercise 10.6.9. One obtains 


oF 
{o;, 0%}; = 2 fit 55 
where 


Fj (01, 02,03) = 03(1 —o?) -op—j. 
The reduced Hamiltonian can be read off from (10.6.45), 


1 
Hj = rim (10.6.51) 
and the Hamilton equations are given by 
Gj = {Hj, oi} ;. 


All_these equation have to be viewed as equations on J 1 J)/SOQ) = 
o(J—1( j)). In particular, the relation Fj (01, 02,03) = 0 has to be taken into 
account. 
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Exercises 

10.6.1 Prove the formulae (10.6.5) and (10.6.6). Show that the projections of in- 
tegral curves of this system to the configuration space R* are located on 
3-spheres. 

10.6.2 Verify Eq. (10.6.13). 

10.6.3 In Example 10.6.1, show that the kernel of Jee is spanned by the geodesic 
vector field X j;. 

10.6.4 Prove the formulae (10.6.15) and (10.6.16). 

10.6.5 Prove that the fibre bundles given by (10.6.17) are isomorphic to the Stiefel 
bundle Sp(2, 4) > Gr(2, 4). 

Hint. Find the preimages under the covering homomorphism (5.1.10) of the 
SO(2)-subgroups of SO(4) occurring in Sp(2, 4) and Gr (2,4). 

10.6.6 Prove Formula (10.6.28). 

10.6.7 Show that for positive values of the energy, the hidden symmetry of the 
Kepler problem is given by the Lie algebra of SO(3, 1) and that for vanish- 
ing energy it is given by the Lie algebra of the group of Euclidean motions 
SO(3) « R?, cf. Remark 10.6.4. Find the corresponding orbits. 

10.6.8 Prove that the mapping (10.6.32) defines a symplectomorphism. Compute 
the pull-back of the Hamiltonian under this mapping. 

10.6.9 Compute the Poisson brackets of Remark 10.6.8/2. Show that the symplectic 
form on the reduced phase space is given by 


o= do, A do. 


a, —1 
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In this section, we discuss a model arising from lattice approximation of SU(3)- 
gauge theory in the Hamiltonian approach. For a detailed presentation we refer to 
[92] and further references therein.” 

Let A be a finite regular cubic lattice in R?. Let us denote the sets of oriented 
i-dimensional elements of A (sites, links, plaquettes and cubes) by A’. The gauge 
group is G = SU(3) and its Lie algebra is g = su(3). The g-valued gauge potential 
will be approximated on links by its G-valued parallel transporter: 


Alas G, (x, y) > ay). 
Thus, the configuration space and the phase space of the system are given by 


O=G", mM=TG", 


>This is part of a research program which aims at developing a non-perturbative approach to 
quantum gauge theory in the Hamiltonian framework, with special attention paid to the role of 
non-generic gauge orbit strata [144, 161-164], see also [254-256]. 
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respectively. Using the natural identification T*G4' = (T*G)4", the left trivial- 
ization T*G = G x g* given by (8.3.6) and the isomorphism g* = g induced by 
the Ad-invariant scalar product (A, B) = —tr(AB) on g, we may identify M with 
G"' x g4'. Under this identification, the momentum canonically conjugate to the 
gauge potential (the colour-electric field) is given by a mapping 

A g, (x,y) Awy). 
Local gauge transformations are approximated by mappings 

ws G, XF> gx, 

acting on the parallel transporters by 


x,y) = 8xAxy)8y (10.7.1) 


This defines an action of G4” on Q. Under the above identification, the lift of this 
action to M = T*Q = G" x gt is given by (10.7.1) and by 


Algy) = Ad(gx) Acx,y)- 


In lattice gauge theory, one considers the following gauge invariant Hamiltonian: 


1 
=-5 3 (Aga) + 5235 Y > (6—tr(ap +ap)), (10.7.2) 
(x,y)e Al peA2 


see [167] for the original source. Here, 5 and a denote the lattice spacing and the 
coupling constant, respectively, and ap is the parallel transporter around the plaquet- 
te p= (x,y,z, U), 


Ap = Ax, y) Uy,z)4(z,u) 4 (u,x) - 


In what follows, we restrict ourselves to the simplest non-trivial case, where A con- 
sists of a single plaquette. Using a lattice tree, one can carry out an intermediate 
reduction which leaves one with a configuration space Q given by the group man- 
ifold G and with a phase space M given by the cotangent bundle T*G = G x g. 
Let us denote the elements of G x g by (a, A) and let us write tangent vectors of 
G x g at (a, A) in the form (L/,B, C) with B,C € g. The group of local gauge 
transformations boils down to G itself and its action on G x g is given by 


W(g, (a, A)) = (gag, Ad(g)A). (10.7.3) 
According to (8.3.7), the canonical 1-form 6 on G x g is given by 
O(a,a)(L,B, C) = (A, B) (10.7.4) 
and for the equivariant momentum mapping (10.1.23), Example 10.1.25 yields 
J:Gxg>g, J(a, A) =aAa7!— A. (10.7.5) 


Thus, (M, dé, W, J) is a Hamiltonian G-manifold. Finally, the Hamiltonian (10.7.2) 
reduces to 


3 


6 1 
H= a: tr(A*) + F356 —tr(a+a')). (10.7.6) 
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In what follows, we shall discuss symplectic reduction for this system. From the 
point of view of physics, this corresponds to removing the (unphysical) gauge 
degrees of freedom. Therefore, the level set Mo = J -1(0) is relevant, because 
J(a, A) = 0 is equivalent to the Gauf law.”° Thus, we consider the reduced phase 
space 


P := Mo, = Mo/G. (10.7.7) 
From (10.7.5) we read off 
Mo = {(a, A) EG x g:aA= Aa}. 


In particular, Mo contains the subset T x t, where T C SU(3) denotes the subgroup 
of diagonal matrices (maximal torus) and t denotes its Lie algebra. By restriction, 
the natural projection Mp — P induces a mapping 


A:T xt P. (10.7.8) 


Let (a, A) € Mo. Since a and A commute, they possess a common eigenbasis. Since 
a is unitary and A is anti-Hermitian, the eigenbasis can be chosen to be orthonormal. 
Hence, (a, A) is conjugate to an element of T x t under W, that is, every G-orbit in 
Mpo intersects the submanifold T x t. Hence, A is surjective. Since two elements of 
T x t are conjugate under G iff they differ by a simultaneous permutation of their 
entries, A descends to a bijection 


(T x t)/S3 > P, 


where S3 denotes the symmetric group on 3 symbols. Standard arguments ensure 
that this is in fact a homeomorphism.”’ Thus, we can use A to describe P. 

Next, we determine the stabilizers and the orbit types of the elements of T x t 
under the actions of S3 and of SU(3). The basic observation is that in both cases the 
stabilizer of (a, A) depends on the number of entries which simultaneously coincide 
for both a and A. This number can be 0, 2 or 3. Denote the corresponding subsets 
of T x t by, respectively, (T x t)2, (T x t); and (T x t)o and define 


P:=A(T xj), i=2,1,0. 
By restriction, 2 induces mappings 
ith xd BB, 7951.0, (10.7.9) 


which descend to homeomorphisms from (T x t);/S3 onto P,, i= 2,1,0. To de- 
termine the subsets (7 x t); explicitly, as in Example 6.6.6, let Z = Z3 denote the 
centre of G = SU(3), let TS, j =1,2,3, denote the subset of T consisting of the 
elements whose entries other than the jth one coincide and define t/ analogously. 
For j = 1, 2,3 let 


(T xt) = (T! x t/)\ (Z x {0}). 


26See [161]. 


27Tn particular, P is an orbifold. 
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We find 
(T x Ho = Z x {0}, 
3 
Txt=Uoxo!, 
j=) 
TxO=7xOH\(( x UM x Hp): (10.7.10) 


Obviously, all the subsets (TJ x t); are embedded submanifolds of T x t. Moreover, 
all the connected components (T x t)/ of (T x ty project to the same subset of 
(T x t)/S3. Under the action of $3, the elements of (T x t)2 have the trivial stabilizer 
{1} and hence the orbit type [{1}], the elements of (T x t)g have the stabilizer S3 and 
hence the orbit type [S3] and the elements of (T x t)/ have the subgroup of order 2 
generated by the transposition of the two coinciding entries as their stabilizer. Since 
the latter subgroups are conjugate in S3, all the elements of (T x t); have the same 
orbit type [S2]. To summarize, we have found that, under the action of S3, 


(T x to =(T x yay, (Tx ti =( x t)syy, (T x Ho = (T x t)[sy]- 


On the other hand, under the action of SU(3), the elements of (T x t)2 have stabilizer 
T and hence orbit type [T], the elements of (T x t)g have stabilizer SU(3) and 
hence orbit type [SU(3)] and the elements of (T x t)/ have stabilizer U(2)/, where 
the subgroups U(2)/ are given by (6.6.3). Since the latter subgroups are conjugate 
in SU(3), all elements of (T x t); have the same orbit type [U(2)]. Thus, 


Po=Pir}, Pi=Puey, Po= Fue 

Recall that P = Mop so that with P, we have determined the orbit-momentum 
type subsets Mun al Moe with coadjoint orbit @ = Oo. From (10.7.10) we read 
off that P) and P; are connected and that Py consists of three points labelled by 
the elements of Z. Thus, altogether there are five orbit-momentum type strata. By 
Theorem 10.5.4, each of these strata inherits the structure of a smooth symplectic 
manifold from G x g. Taking into account that S3 is finite, from (10.7.10) we can 
also read off the dimensions: Py has dimension 4 and P, has dimension 2. 

One can use the mappings A; to describe the symplectic structure of the orbit- 
momentum type strata as follows. Since t is the Lie subalgebra of su(3) associated 
with the Lie subgroup 7, the submanifold T x t is a symplectic submanifold of 
G x g. Analogously, so are (T x o/, j =1,2, 3. It follows that (T x t)2 and (T x €); 
are symplectic manifolds. For convenience, in the following we will view (T x to 
as a (trivial) symplectic manifold, too. 


Proposition 10.7.1. The mapping d is Poisson. The mappings i are local symplec- 
tomorphisms. 


Proof By definition, C°(P) is a quotient of C°(G x g)@, see Remark 10.5.11/3. 
Hence, the first assertion is a direct consequence of the fact that T x t is a symplectic 
submanifold of G x g. The second assertion is trivial for i = 0. For i = 1,2 it 
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? 8 e 8 e e 


aeT® aeT! aEeZ 


Fig. 10.1. The fibres #7! ([a]) of the projection # : P > O 


follows by observing that any point of P, has a representative (a, A) in (T x t); and 
that a sufficiently small Gia, 4)-invariant neighbourhood W of (a, A) in (T x t); 
generates a tube for the action of G = Go on a!(B) 1 Mo, modelling the open 
neighbourhood G - W of the orbit G - (a, A) onto G xG4.4) W = G/Gia,a) x W. 


Remark 10.7.2 


1. Since the submanifolds (T x t); are symplectic and since S;3 is finite, the quotient 
(T x t)/S3 naturally carries the structure of a stratified symplectic space, cf. 
Remark 10.5.11/3. In view of this, Proposition 10.7.1 states that A descends to 
an isomorphism of stratified symplectic spaces from (T x t)/S3 onto P. 

2. Dynamics on P with respect to an SU(3)-invariant Hamiltonian like (10.7.2) 
is thus given by the dynamics on T x t with respect to the corresponding S3- 
invariant Hamiltonian and the symplectic form dé, where @ is given by (10.7.4) 
witha eT and A, B,C et. 


Now, let us analyse the projection 
7:P+0 
induced by the cotangent bundle projection T*Q — Q. We have the commutative 
diagram 


Peta 


P 
pn | [ 
| a O 
where the lower horizontal arrow is defined by restriction of the natural projection 
Q — Q. Consequently, the fibre over [a] € Q is given by 

#—"([a]) =t/(S3)a, 


where the representative a is chosen in T and where (S3), denotes the stabilizer of 
a under the action of S3. There are 3 cases, illustrated in Fig. 10.1. As in Exam- 
ple 6.6.6, let T° denote the subset of T of elements with pairwise distinct entries. 
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(a) Ifa € T°, (S3)q is trivial, hence #~! ({a]) = t. That is, the fibre is a full 2-plane 
and belongs to Pp. 

(b) IfaeT/ \Z, 7 =1, 2,3, then (S3)¢ = So, acting by permutation of the coincid- 
ing entries. Hence, ~! ([{a]) = t/So, acting by reflection about the subspace t/. 
Therefore, the fibre may be identified with one of the two closed half-planes of t 
cut out by t/. Its interior belongs to P>, whereas the boundary t/ belongs to Py. 
If a € Z, then (S3)q = S3. The action of S3 on t is generated by the reflections 
about the 3 subspaces t/, j =1,2,3. Hence, | ({a]) may be identified with 
one of the six closed subsets of t (the Weyl chambers) cut out by t/, j = 1,2,3 
(the walls of the Weyl chambers). The interior of the Weyl chamber chosen 
belongs to Py, the walls minus the origin belong to P, and the origin belongs 
to Po. 


(c 


wm 


Remark 10.7.3 


1. Recall from Example 6.6.6 that the reduced configuration space O coincides 
with the orbit space of the action of $3 on the subgroup T C SU(3) of diagonal 
matrices by permuting the entries, 


O=T/S3. 


From the above discussion it is obvious that the projection 7 : p> O does not 
preserve the stratification, because the fibres over points in the secondary”® orbit 
type strata of O, corresponding to the edges and the vertices of the 2-simplex 
structure of 0, intersect more than one orbit-momentum type stratum of P. 

2. The description of the reduced data given here generalizes to an arbitrary com- 
pact semisimple Lie group in an obvious way: T and t are replaced by a maximal 
torus in G and the corresponding Cartan subalgebra of g. O is replaced by a Weyl 
alcove in T and S3 is replaced by the Weyl group of G. 

In the simpler case G = SU(2), on the other hand, we obtain Pe (S! x R)/S2, 
where the generator of Sz acts by complex conjugation in the first component and 
by multiplication by —1 in the second one. This way, the reduced phase space is 
getting identified with an orbifold usually referred to as the canoe, see Fig. 10.2. 
It consists of three orbit-momentum type strata: the two points [(+1, 0)] and 
the rest, which is a 2-dimensional symplectic manifold diffeomorphic to the 
2-punctured 2-plane. For a detailed discussion of its symplectic structure, see 
[142]. From Example 10.6.7 it is clear that P coincides with the reduced phase 
space of the spherical pendulum with zero angular momentum. 

3. As we know from Proposition 10.5.9, the orbit-momentum type strata of P can 
also be obtained via regular reduction of the connected components 2'7,,,, of the 
subsets My 1 M,, by the quotient Lie groups (I’y,,),,. For 4 = 0 and for the 
choice of the subgroups T and U(2)! as representatives of the corresponding 
orbit types, we find that 


28Cf. Remark 6.6.2/5. 
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configuration space Q and 
reduced phase space P for ee awe = (+1,0) (-1,0) 
G=SU(2) 
T&S! Txt2S'xR 
O=S'/S, P~ (Ss! x R)/S2 


(a) the subsets My M M,, are given by 
MrOMo=(T xt)2, Mya AMo=(T x 8), 
Msug) 1 Mo = Z x {0}, 


where the last one decomposes into the connected components {(a, 0)}, 
aeéZ; 
(b) the groups (’y,,)o = I'y,, are given by 


I (rx), = S83, Trxey! = {1}, Tao ={1}, aeZ. 


. For a detailed description of O and P in terms of invariants, including the strat- 
ification, we refer to [92]. Using standard results of the invariant theory of com- 
plex matrices one can show that the algebra of invariant real polynomials on Mo 
is generated by 


ce = Re(tr(a(—iA)*)), k=0, 1,2, 
dy = Im(tr(a(—iA)*)),  k=0,1,2, 
tk = tr((—iA)*), k=2,3. 


By setting A = 0 one obtains a set of generators for the invariant real polynomials 
on Q=G: 


co = Retr(a), do = Imtr(a). 
The associated Hilbert mappings 


pp = (Co, do, C1, di, C2, d2, fp, 13): Mo > R¥, P6 = (co, do): Q> R? 


induce homeomorphisms from P and O onto the respective images, which are 
semialgebraic subsets of IR° and R?, respectively. Consider the following poly- 
nomials: 


Ri = (34.05 +6) — 2(c} + df) — 4(coc2 + dod2), 
1 
R= (3 - 3 (0 + a) ) — 2(c1c2 + dyad), 


R3 = coc2 — dod2 — 2cotr — a + d; + 3c2, 
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Fig. 10.3. The image of PO (— 3 3 V3) 
in R? “0 


R4 = cod2 + docz + 2dotr — 2e1d\ — 3d, 
R5= 5 (co —Dd,+ doc1)tz + (Feot + ay) — cyjdz —d\c2, 
P, = 27 — cj — 2chde — dj + 8c, — 24cod? — 18c2 — 18d2, 

Py = 243 — 312, 


2 
— 42 2 2 
P3 = ty — cy — dz. 


The image of p, is given by the inequality P, > 0. It forms a region in R? 
bounded by a hypocycloid, see Fig. 10.3. As is shown in [92], the defining equa- 
tions and inequalities for the image of ps are given by 


R;=0, i=1,...,5, P,>0, i=1,2,3. 


In a similar way, one can derive the defining relations for the images under p, 
of the orbit-momentum type strata. 

5. It turns out that the reduced configuration space is a deformation retract of the 
reduced phase space, see [95]. Therefore, to study the topology of the reduced 
phase space amounts to studying the topology of the reduced configuration space. 
For case studies in the context of lattice gauge theory, see [61, 62]. 
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Let (M,w, WY, J) be a Hamiltonian G-manifold with G-invariant Hamiltonian H. 
As we know from the Noether Theorem 10.1.9, the functions J, are constants of 
motion for all A € g, that is, the level sets of H and of J are invariant under the flow 
of the Hamiltonian vector field X 7. Thus, the dynamics reduces to the level sets of 
the combined mapping 


&€:M—>Rxg*, &(m):=(H(m), J(m)). (10.8.1) 
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Studying the partition of M into these level sets is at the heart of a general pro- 
gramme formulated by Smale, see [277, 278], as well as [1] and [69]. In this context, 
the notion of the bifurcation set of a mapping plays a key role. 


Definition 10.8.1 (Bifurcation set) A smooth mapping yg: M —> N is called locally 
trivial at po € N if there exists a neighbourhood U C N of po, such that the follow- 
ing conditions are fulfilled: 


1. The level set g~!(p) is an embedded submanifold of M for all p € U. 
2. There exists a smooth mapping ®: g~!(U) — g7!(po), such that 


yx :9-'(U)>U xg '(po) 
is a diffeomorphism. 


The bifurcation set By of ¢ is the set of all points in NV, for which g is not locally 
trivial. 


Point 2 means that 9;4-1(y): yg '(U) > U is a trivial fibre bundle over U with 
typical fibre y~!(po). In particular, then (py x P)hy-1(p) Yields a diffeomorphism 


between the level sets g~!(po) and g~!(p). As p runs through the bifurcation set, 
the topology of the level set g~!(p) may change. The programme of Smale can be 
summarized as follows: 


1. Determine the topological type of the level sets of & and its bifurcation set. 

2. Determine the flow of Xq on each level set. 

3. Investigate how the fibres & lh, [L) fit together as (A, jz) runs through the values 
of &. 


The following two observations are helpful for working out this programme in 
concrete cases. The first one is of general nature and concerns the bifurcation set. 
Let My be the set of critical points of g. Then, g(M,) is the set of its critical values. 
The proof of the following proposition is left to the reader (Exercise 10.8.1). 


Proposition 10.8.2 For a smooth mapping g: M — N, one has (Mg) C By. If g 
is proper, then p(Myg) = Bo. 


The second observation is a consequence of the specific structure of the energy- 
momentum mapping. By the Regular Reduction Theorem 10.3.3 for invariant 
Hamiltonian systems, the reduced Hamiltonian is given by 


Ay 0M, = Ho ju, 
see (10.3.5). Thus, 
(His) =m! oH, (h) 
and, therefore, 


"(hw = A") NIT (W) = (yi) |@ =2,' (Az). (10.8.2) 
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Fig. 10.4 The effective Ui(q) 
potential (10.8.4) 


This identity reduces the study of the topology of the fibres &~! (h, j2) to computing 
the preimages of the level sets of the reduced Hamiltonian and taking the preimages 
under zr,,. An analogous statement holds for the singular case. 

In what follows we discuss essential aspects of the programme of Smale for the 
Kepler problem, the Euler top and the spherical pendulum. 


Example 10.8.3 (Kepler problem) According to Example 10.6.3, we have 
M =T(R° \ {0}) = (R° \ {0}) x B® 
and under the identification of s0(3) with R? via the isomorphism (5.2.6), the 
energy-momentum mapping is given by 
&€ =(H,J):T(R* \ {0}) > Rx R? 
with 
pl 
AgGp= >--, J@pP)=axp=L. 
2 4 
Let (h, L) € R x R? and denote / = ||L||. By (10.8.2), we have 
6h, L) =2;,,'(H; '@), 
with the reduced Hamiltonian H and the natural projection 
my: J~'(L) > J~!(L)/SO@G)z. (10.8.3) 
As in Example 10.6.3, we have to distinguish the following two cases. 


(a) In case / > 0, the reduced Hamiltonian H is given by (10.6.18) and we have a 
one-dimensional motion with effective potential 
Cee (10.8.4) 
I q — 2q2 q ’ Oo. 


see Fig. 10.4. The reduced Hamiltonian H; possesses the single critical point 
(q°, P)) = (I*, 0) and hence the single critical value h° = . Using 


~ 2P 
Pq = +y/2(h — Ui(q)), 


from the shape of the effective potential we can read off the topology of the 
level sets H, '(h), see Fig. 10.5: For h° < h <0 we get a sphere S!, which in 
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Pq Pq 


Fig. 10.5 Level sets Hy (h) in the Kepler problem for / > 0 (left) and 1 = 0 (right) 


the limit h > h° descends to the 1-point space. For h > 0 one obtains the real 
line R. Since for / > 0, the preimages of the projection (10.8.3) are isomorphic 
to S!, we obtain 


s! forh =h°, 
&—'(h,L) = 3 T? for h® <h <0, (10.8.5) 
RxS! forh>0. 


(b 


we 


In case / = 0, under the identification of the reduced phase space with the sub- 
manifold © C M given by (10.6.19) with some chosen nonzero qo € R?, the 
integral curves and hence the level sets of the reduced Hamiltonian H) are of 


the form 
1 
{ (ze + 2(h + ~ \e) :0<q< ix] 
q 
qo 


with e = Taon S&° Fig. 10.5. For h < 0, we have gmax = -; and hence 


H, '(h) >R. For h > 0 we find qmax = oo and hence |: ie (h) < S° x R. Since 
for / = 0 the preimages of the projection (10.8.3) are isomorphic to S*, we ob- 
tain 


Rx S? forh <0 


S°xRxS? forh>0. Cnn 8 0) 


&"(h,0) = 


From (10.8.5) and (10.8.6) we read off that the bifurcation set is the preimage of 
the set 


1 
{(-32-) il> of U {(0, 1) lS 0} U { (h, 0) the R} CR’, (10.8.7) 
under the mapping 


Rx s03)>R*, — (h, L) (h, ILI), (10.8.8) 


see Fig. 10.6. We encourage the reader to study the Hamiltonian flow on the level 
sets given by (10.8.5) and (10.8.6) and to compare these results with what he knows 
from classical mechanics, see Exercise 10.8.2. In the next chapter, we will return to 
the case h? < h <0, L¥0, in the context of action and angle variables. 
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T2 
s! 


Rx S? 


Fig. 10.6 Image of the energy-momentum mapping & of the Kepler problem, projected to the 
[-h-plane. The (projected) bifurcation set consists of the critical values of &, represented by the 
hyperbola, and of the two coordinate axes. The regions are labelled by the topological type of the 
level sets of & according to (10.8.5) and (10.8.6) 


Example 10.8.4 (Euler top) According to Example 10.6.5, we have 
M =T*SO(3) = SO(3) x 50(3) 
and the energy-momentum mapping is given by 
6 =(H, J):SO(3) x s0(3) > R x 50(3) 
with 
Laat 
H(a,L)= - (L,L), J(a, L) = Ad(a)L. 


Let (h, Lo) € R x s0(3) be a fixed value of & and denote / := || Lo||. Recall that the 
reduced phase space may be identified with the adjoint orbit @;,, endowed with the 
negative Kirillov structure and that the latter coincides with the origin for / = 0 and 
with the sphere S? of radius / in s0(3) otherwise. Moreover, in the latter case, the 
reduced Hamiltonian reads 


1 
Hy (L) = 501, DD), Les. 


To find its level sets, we first have to find the critical points. For that purpose, we 
pass to vectors L € S: C R? and choose the body frame in such a way that 


0, O 
O=|0 © 0 
0 O 63 
For simplicity we assume that 
0<A3=0,' <’2=0,' <4, =07' (10.8.9) 


and leave the discussion of the other cases to the reader (Exercise 10.8.3). The crit- 
ical points of H) can be determined from 


1 
(Hi +af")\L)=0, f= 5 (ILI? -1), 
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with L € R? and w denoting a Lagrange multiplier. We obtain the equations 
(a1L1, A2L2,A3L3)—a(L1,L2,L3)=0, 9 LT +L3+L3=7, 


that is, L is located on the intersection of the eigenspaces of the matrix ©~! with 
SF. This yields the critical points 


L; =+le,, i=1,2,3. (10.8.10) 


Remark 10.8.5 Since the critical points of H) coincide with the critical points of the 
corresponding Hamiltonian vector field X y,, they can be read off from (10.6.39), 
too. Thus, on the level of s0(3), they are given by the solutions of the equation 
[O-'L, L] =0 and hence by the eigenvectors of O~!, indeed. 


Next, we show that H) is a Morse function. For that purpose, we observe that for 
every i, the tangent space 


Ty +87 = ker f’(L;*) = {Xe R*?:X-L =0} 
is spanned by the two standard basis vectors e; and ex, with j,k #i. We compute 


2 —% 0 
Hess; + (Hi) = 0 : , 


A3—A1 
Ay—A 0 
Hess; + (H)) — : 0 2 eee ia ; 
Ay—A 0 
Hessy + (H)) — : 0 2 ho— ml : 


Since 4] — Az > 0, A, —A3 > O and Az — A3 > O, the critical points Ly, Lz, Lz are 
non-degenerate and have the Morse indices, respectively, 2, | and 0. 


Remark 10.8.6 Using Proposition 9.7.1, from the Hessians we read off that the crit- 
ical point L> of the Hamiltonian vector field X 7, on Ss; is stable for 1 = 1, 3 and un- 
stable for i = 2. It follows that these points are also stable or unstable with respect to 
the projected flow on T*SO(3)/SO(3) = s0(3) as a whole, see [181, Thm. III.12.4]. 
Thus, the corresponding stationary motions in T*SO(3), given by rotation about 
the corresponding principal axes of inertia, are relatively SO(3)-stable or SO(3)- 
unstable, cf. Remark 6.8.5/4. 


Proposition 10.8.7 The level sets of Hj have the following topological structure. 


. For 1 =0, one has h=0 and H* (h) consists of a single point. 

. Forl>Oandh= sl hi, i= 1,3, the level set Ay (h) consists of two points. 

. For 1 > Oandh € (517A3, $1?A2) U (GP Aa, $1724), one has H,'(h) =S'us!. 

. Forl>Oandh= 517d2, the level set H, |(h) is the union of two distinct great 
circles on S?. 


BRWN eR 
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Proof 1. As noted above, in this case the reduced phase space consists of a single 
point. 
2. For h= 517A, we have 


ML} +A2Le +a3L2=Pa3, LI+L34+ L352. 
If we multiply the second equation by 43 and subtract it from the first one, we obtain 


(Ar —As)LT + 2 —A3)L3 =0. 


Under the assumption (10.8.9), this implies L; = Lz = 0 and hence L= L. 
3. The critical points Lz are non-degenerate minima with Morse index 0. Hence, 


according to the Morse Lemma 8.9.3, for values of h between 5I7A3 and 51713 +6 
with ¢ small enough, the level set H, (nh) is the union of two 1-spheres in the 
neighbourhood of the points Lt and L, . An analogous statement is true for values 
of h between 5A 1 — € and 517A 1. Then, the assertion follows from the Morse 


Isotopy Lemma 8.9.6. 
4. For h = 5l *2, an analogous calculation as under point 2 yields the equation 


(Az — A3)L3 — (Ay — Aa) LZ =0. 
This defines two planes P+ intersecting in the subspace spanned by e2. Hence, 


H,'(h) = (PU P_) NS}. 


This set consists of two circles which intersect in the points L;. 


In the following, we write L; for the elements of s0(3) corresponding to the 
critical points L;-. 


Proposition 10.8.8 For Lo 4 0, the function Hy) j-1(,,) is an SO(3)z,)-invariant 
Morse-Bott function with 6 non-degenerate critical SO(3),,-orbits 


a0, (oy. #1 


0 

having Morse indices, respectively, 2, 1 and 0. 

Proof Due to H 0 jr, = Ho 719, for any p € J—!(Lo), we have 
(Ah y-1(19))p = (AD (p) ° Gio) y: 


Since zy, 1s a submersion, this implies that p is a critical point of H} j-1(,,) iff 
Ly (p) is a critical point of H;,. The subset of critical points of Ar y-1 (19) is thus 


the union of the subsets y;~, i = 1, 2,3. Obviously, every y;~ is an SO(3)7)-orbit. 
Since SO(3);, = SO(2), it is therefore diffeomorphic to a 1-sphere. 
It remains to show that H; ;-1(;,) is a Morse function and to determine its Morse 


index at y;~. For that purpose, we choose an open neighbourhood U;~ of the critical 


point L> in S and a local trivialization 


xf oz) (UP) > UF x SOB), 
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Using x;~, the SO(3),,-orbit y;~ can be parameterized by 


ce ee 
the (x; ) (is ,exp(tA)), 
where A € $0(3)7, =50(2), A 40. For every fixed value fo, the submanifold 


S* = (x*) | (UF x exp(toA)) (10.8.11) 


intersects y;~ transversally in the point (x ra ae (L~, exp(t9A)). This shows that H; 
is a Morse-Bott function. Since (To) gti %- — U;" is a diffeomorphism, the 


Morse index of H ty! (Lo) at Y; (to) coincides with the Morse index of Hj at L>-. 


Proposition 10.8.9 The level sets of the energy-momentum mapping €& for the Euler 
top have the following topological structure. 


. For Lo = 0, one has h =0 and &~'(h, Lo) = SO(3). 

. For Lo #0 and h = 517), or h = 5173, one has &-!(h, Lo) =S! us}. 

. For Lo #0 and he (5173, 5172) U (SP? A2, 5171), E-'(h, Lo) = TUT’. 

. For Lo £0 andh= 5!7do, the level set &—! (h, Lo) is a union of two tori inter- 
secting along two circles. 


RWNrR 


Proof 1. This is obvious. 

3. Proposition 10.8.7 implies a, (h) = S'US!. Each of these two 1-spheres is 
the boundary of a 2-disc De; i = 1, 2, which is contractible in Sr It follows that the 
bundle zz: J~!(Lo) > S? is trivial over Dp. Thus, ia (D?) = D? x S! and hence 

E-'(h, Lo) =m; (Hy, '(h)) = (aD{ x S') u (aD3 x S'). 

2. This is a limiting case of point 3. According to Proposition 10.8.7, here 
i (h) consists of two points. Over each of them we have a fibre si. 

4. According to Proposition 10.8.7, in this case, H, |(h) is the union of two 
circles S on ce which intersect in the points L*. As under point 3, these circles 


are the boundaries of two 2-discs and we obtain 2-tori, which intersect along the 
critical curves y;~ of Proposition 10.8.8. 


According to Proposition 10.8.9, the bifurcation set of the energy-momentum 
mapping & of the Euler top is the union of three paraboloids, 


3 
Be= Ufan ER x §0(3): srilei? =a}, 


i=1 


see Fig. 10.7. We encourage the reader to find out how the level sets of & fit together 
to build J~! (Lo) for a fixed Lo, see Exercise 10.8.4. 


Example 10.8.10 (Spherical pendulum) According to Example 10.6.7, we have 
M =TS* = {(x,y) € TR’: ||xll =1, x-y=0} (10.8.12) 
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Fig. 10.7 Image of the 2 4 
energy-momentum mapping h 
& of the Euler top, projected 

to the /—h-plane via the 

mapping (10.8.8) (shaded 3 
region). The (projected) 
bifurcation set consists of the 
three parabolas corresponding B 
to A, (narrow), Az (medium) 
and 43 (wide). The numbers 2 
refer to the cases 1-4 of 
Proposition 10.8.9 


and the energy momentum mapping is given by 
& =(H, J): TS? > s0(2)=R 
with”? 
1 
A(x, y)= silyl? +x: e3, J (x,y) =X1y2 — x21. (10.8.13) 


To find the critical points of &, we compute the Hamiltonian vector fields (Exer- 
cise 10.8.5): 


Xu (x,y) = (y, es — (llyll? — x-e3)x), 
X7(x, y) = ((—72, *1, 0), (—yz, 1,0)). (10.8.14) 


The critical points of X q are 


m. = (0, 0, +1, 0, 0, 0). (10.8.15) 


At these points, we have 


Himi)j=+1, F(mh)=0,  X7(nl)=0: 


The points where X 7 and X y are parallel are determined by the equations 


—x2=hy1, y2= —A([ly ||? — x3)x1, 
xj~=Ay2, yi = —A(lly |? — x3)x2, 
O=Ay3, 0=—1— (Ilyl|? — x3) x3, 


with A € R. First, we conclude that A? = —x3, that is, every A € [—1, 1] defines a 
horizontal plane intersecting S? in the critical point m°. for A = +1 and ina 1-sphere 
otherwise. In spherical coordinates 


x; =cos¢sinv, x2 =singsin?v, x3 =cosv, 


Resetting H = Hand J=J. 
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this 1-sphere is parameterized by the angle @. In these coordinates, the solution of 
the above system of equations reads 


sind sing sin? cos@ 


= +—____— = +—_____., =0. 10.8.16 
se /— cost = V¥— cost _ 
The corresponding values of the constants of motion are 
Ha cod J(9) ga oore (10.8.17) 
= —cosv — ; = xi ——-.. 0. 
2 2cos 0 V/— cost 


This is the parameter presentation of a curve s +> y(s), s =cos%, in the space 
of values of the energy-momentum mapping &. The union of this curve with the 
isolated point (H (m4), J (m°.)) = (1,0) yields the set of critical values of &, see 
Fig. 10.8. The isolated point is obtained for the parameter value cos? = 1. The 
curve y can be parameterized by the value j of J or by the value h of H. Let 
hj and jy denote the corresponding value of the other constant of motion. While 
the function j +> hj cannot be expressed in terms of elementary functions, for the 
function h +> jp one finds 


p 
jn=45(3—h° + hvh? +3) h+vh24+3 (10.8.18) 


(Exercise 10.8.6). It remains to determine the set of regular values of &. For that 
purpose, we note that H can be written in the form 


y2 2 
H= ~ + a +cosv. 
2sin°-? 2sin* 


For given j, the critical points of H; 71(;) are determined by the equations 


y3 = 0, j- cos? +sin* o =0. 


Comparison with (10.8.16) and (10.8.17) shows that the latter equations define a 
critical point with critical value h;. From H — oo for cos} — +1 we conclude 
that this critical point is a minimum. Thus, the regular points of & are located above 
the curve y and, therefore, the image of the energy-momentum mapping is given by 


&(TS*) = {(h, j) €R?:h>hj, j eR}. 


Proposition 10.8.11 The level sets of the energy-momentum mapping & for the 
spherical pendulum have the following topological structure. 


. Forh > —-1,\|j| < |j(h)| and (h, j) € C, 0), the level set E—'(h, J) is a 2-torus. 

. Forh > —\ and j =+j(h), the level set €~|(h, j) is a 1-sphere. 

. For (h, j) = (-1, 0), the level set & "th, J) consists of a single point. 

. For (h, 7) = (1, 0), the level set E-'(h, J) is a 2-torus, in which a longitudinal 
circle has been contracted to a point. 


BRWN Fe 


The results are summarized in Fig. 10.8. 
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Fig. 10.8 Image of the 2 h 2 
energy-momentum mapping 

& of the spherical pendulum 

(shaded region). The 

bifurcation set Ag consists 1 
of the boundary and the point 

(j, 4) = (0, 1). The numbers 

refer to the cases 1-4 of 

Proposition 10.8.11 


@4 


Proof 1. By Remark 10.6.8/2, the fibres of the natural projection 77;: I Gr 
M 6, are 1-spheres and M 6, may be identified with the subset of IR? defined by the 
relations (10.6.49). Expressing the reduced Hamiltonian (10.6.51) in terms of the 
invariant polynomials 01, 03, 


1 
AH; =-—03+0}, 
i~9 3 
we find that these relations can be written in the form 


2 
2(H; — 01)(1—o7) —03 — j? =0, Hj> +o. loi] <1. — (10.8.19) 


This means that for every regular value h, the level set H; is the 1-sphere which 
forms the boundary of the 2-disc defined by the two inequalities in (10.8.19). Since 
this implies that the bundle E(h, Do He (A) is trivial, (10.8.2) yields the as- 
sertion. , 

2. By (10.8.16) and (10.8.17), the level set can be parameterized by ¢ as follows: 


sinvsing  sinvcosd ) 
coe 0). 
J/—cost? /—cosd 

3. This is obtained from point 2 by taking the limit sind — 0. 


4. According to Remark 10.6.8/2, for regular points of Mo, the preimages of the 
projection 


pr (sino cose, sin? sing, cos 0, 


mo: J~!(0) > Mo 


are |-spheres. For the two singular points p+ = (1,0, 0) of Mp, they consist of 
the corresponding single points Mo = (0,0, +1, 0, 0,0). Since hh = 1 = 503 +o}, it 
follows that only p+ lies in the image of zo. This completes the proof. 


To conclude, we stress that the occurrence of tori in the above models is no 
accident, because all of them are integrable systems. This will be the topic of the 
next chapter. 
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Exercises 

10.8.1 Prove Proposition 10.8.2. 

10.8.2 Study the properties of the Hamiltonian flow of the Kepler problem on the 
level sets given by Eqs. (10.8.5) and (10.8.6). 

10.8.3 Study the topological structure of the energy-momentum mapping of the 
Euler top for the case A3 <A2 =A}. 

10.8.4 Discuss the structure of the foliation of the level set J~!(Lo) for the Euler 


10.8.5 


10.8.6 
10.8.7 


top. Find out how the fibres fit together to constitute J~!(Lo) for a fixed 
value Lo. 

Hint. One possible strategy consists in using J~!(Lo) = SO(3), together 
with the fact that the group manifold of SO(3) can be viewed as a ball of 
radius z with identified antipodes. 

Compute the Hamiltonian vector fields generated by the Hamiltonian H and 
the momentum mapping J of Example 10.8.10. 

Verify Formula (10.8.18). 

Calculate the critical points of the Hamiltonian (10.8.13) and the correspond- 
ing Morse indices. 


Chapter 11 
Integrability 


In this chapter, we study the concept of integrability of Hamiltonian systems in a 
systematic way. We start with the very notion of an integrable system and with a 
number of examples: the two-body problem, the two-centre problem, the top, the 
spherical pendulum and the Toda lattice. In Sect. 11.2, we analyze Lax pairs in the 
context of Hamiltonian systems on coadjoint orbits. In particular, we show that the 
Toda lattice can be understood in this framework. In Sects. 11.3 and 11.4 we analyze 
the local geometric structure of integrable systems. We prove the Arnold Theorem, 
discuss the relation with symplectic reduction and present the construction of local 
action and angle variables in detail. Thereafter, we construct action and angle vari- 
ables for a number of examples. We also show that action and angle variables are 
very well adapted to the study of small perturbations of integrable systems. In this 
context, we meet another application of KAM theory. In Sect. 11.7 we give an intro- 
duction to global aspects in the spirit of Nekhoroshev and Duistermaat, with some 
emphasis on monodromy. This phenomenon will be illustrated in detail for the case 
of the spherical pendulum. Finally, we present a generalization to the concept of so- 
called non-commutative integrability in the sense of Mishchenko and Fomenko. We 
prove the Mishchenko-Fomenko Theorem and illustrate it for the case of the Euler 
top. 


11.1 Basic Notions and Examples 


Definition 11.1.1 A Hamiltonian system (M,w, H) of dimension 2n is called in- 
tegrable! if there exist m constants of motion H; = H, M2,..., Hy satisfying the 
following conditions: 


1. The functions Hj are in involution, that is, {H;, H;} =0 for alli, j=1,...,n. 
2. The subset of regular points of the mapping # = (M1,..., Hn): M > R" is 
dense in M. 


‘Or completely integrable. 
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For an integrable system we will write (M, w, Hi,..., H,) or (M, w, #). Recall 
that the set of singular points of the mapping 7 is denoted by M yw and that the 
set of regular points is denoted by M # Thus, we have the disjoint decomposition 
M=M*UM yw, where M FH is an open and dense submanifold of M. 


Remark 11.1.2 


1. The Hamiltonian vector fields Xj, span a distribution on M, which will be de- 
noted by D* . Since [Xn,,X H;] = XH, H)} = 0, this distribution is involutive. 
By Proposition 3.2.13/2, its rank is constant along the integral curves of the 
Hamiltonian vector fields X q,. Hence, Theorem 3.5.10 implies that D* is inte- 
grable and Proposition 3.5.21 yields that the integral manifolds form a foliation 
of M. Since w(Xq,, XH;) = {H;, H;} =0, the distribution D*# is isotropic and 
hence its integral manifolds are isotropic submanifolds of M. 

2. The open submanifold M # of M consists of the points where the differentials 
dj and hence the Hamiltonian vector fields X, are linearly independent, that 
is, where D™ has the maximal rank n. Since the rank of D™ is constant along 
the integral curves of the Xq,, M is invariant under their flows. In particular, 
the dynamics reduces to M # Moreover, the restriction 


JE. M* + R" Cit Alay 


of # to M* is a submersion.” 

3. The restriction D” of D* to M* isa regular distribution of rank n. Since 
it is isotropic, it is Lagrange. Moreover, due to XH; (H;) = {H;, Hi} = 0, we 
have D% C ker. %' and by counting dimensions, we find that equality holds. 
Hence, according to Example 3.5.4/4, the maximal integral manifolds of D* are 
given by the level set components of .#;. In particular, the latter are Lagrangian 
submanifolds of M”. To summarize, the level set components of .#%; form a 
Lagrangian foliation of M H which is generated by the Lagrangian distribution 
D* and which is invariant under the flows of the Hamiltonian vector fields X Hi: 
In particular, the dynamics reduces to these level set components. 


Obviously, every autonomous system with one degree of freedom is integrable. 
Moreover, by Proposition 9.1.10, in an autonomous system, every constant of mo- 
tion is in involution with H. In particular, an autonomous system with two degrees 
of freedom is integrable if only it admits a second constant of motion, functionally 
independent from H. In higher dimensions, integrability is rare. We start with a 
number of examples. 


Example 11.1.3 (Two-Body Problem) The two-body problem has n = 6 degrees of 
freedom given by the position vectors q, and qp of the two bodies, that is, the phase 


Beware that the set of regular values of .# is contained in the set of values of 7% but need not 
coincide with the latter. 
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space is M = T*(IR? x R3\ Ap3), where Ap; denotes the diagonal. For definiteness, 


we consider the Hamiltonian? 
2 2 
Pi Pp k 
H (pi, p2, qi, 42) = + ; (11.1.2) 
Dee omy” ene lai Gal 


By Galilei invariance, we have 10 constants of motion: the total momentum P, the 
total angular momentum L, the total energy F and the vector MR — tP, with M 
denoting the total mass and R being the position vector of the centre of mass. By 
separating out the motion of the centre of mass, for the relative motion we obtain 
the phase space T* (R3 \ {0}) and the constants of motion 


_p_k _ 
Aye) = — — —., Lrel = q X p. 
2 |Iqil 
Here, 
mim ; 
=" q=q1—- q, P=/q 
m, +m? 


are, respectively, the reduced mass, the relative position and the relative momentum. 
One easily checks that 


P Ga 1. “Caan (11.1.3) 


are constants of motion in involution and that their differentials are independent on 
a dense subset of M (Exercise 11.1.1). Thus, the 2-body problem is integrable. 


Example 11.1.4 (Two-Centre Problem) It turns out that already the 3-body problem 
is in general not integrable. Let us restrict our attention to the case where the mass 
of one of the three bodies is assumed to be so small that it does not influence the 
motion of the other two bodies. This approximation leads to the so-called two-centre 
problem,* where one studies the motion of a body in the field of two fixed force 
centres. The Hamiltonian of this model is given by” 


pe a, (11.1.4) 
with p denoting the momentum and 7; denoting the distances of the body from 


the two centres. For simplicity, we have set the mass of the body equal to one. 
Coulson and Joseph have shown that the two-centre problem can be treated in the 


3The Kepler potential can be replaced by any central force potential. 


4Some authors call this the Three-body problem of Euler, because Euler solved it first and pub- 
lished it in his memoirs in 1760. 


5It turns out that this is a good approximation provided the electromagnetic forces between the 
particles dominate. It is not a good approximation if the gravitational forces dominate, see Sect. 4.3 
in [286] for a discussion of this point. We also refer to §47 of [18] for an application to the motion 
of the moon in the gravitational field of the earth. 
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same way for any number n of degrees of freedom [68]. Following these authors, we 
choose Cartesian coordinates qg; such that the two centres are located at the points 
a= (0,...,a) and —a. Then, the configuration space is Q = R” \ {a, —a} and the 
distances of the body from the centres are given by 


n—1 n—1 
=\S@t+in—a, r= SY \gtqta”. G15) 


i=l 


e 


Angular momentum is represented by the skew-symmetric n-dimensional matrix 


Lij =i Pj — 4jPi- (11.1.6) 
We define 
P=ViLli, Li= Do Li, (11.1.7) 
i<j ) i<j<n , 


By direct inspection, one can check the following (Exercise 11.1.2). 


(a) The angular momentum components L;; with i, j <n are constants of motion. 
(b) The quantity 
Ae A ae, Mi & (11.1.8) 
2 if r| i) 
is a constant of motion which Poisson-commutes with all components L;;, 


i, j <n, and hence with ice The reader can find the relation of A to the Lenz- 
Runge vector of the Kepler problem by sending one of the centres to infinity. 


We conclude that the quantities H, A and oe are constants of motion in involu- 
tion. For n > 3, their differentials are linearly independent on a dense subset of T* Q. 
For n = 2, the so-called planar two-centre problem, this still holds for H and A but 
one has L; = 0. As a result, for n = 2 or 3, the two-centre problem is integrable. 


Example 11.1.5 (Top) We take up Example 10.6.5. Recall from there that the phase 
space of a top is 


T*SO(3) = SO(3) x s0(3) 


and that in the special case of the Euler top, the equations of motion have the form 
of a Lax equation for the Lax pair (L, w). We continue to identify s0(3) with R? via 
the isomorphism (5.2.6), denoting the vector corresponding to a Lie algebra element 
A by A. Here, we consider the more general case of a top in a constant® external 
force field f. The most important example of this type is, of course, the gravitational 
field f = —mgnz of the earth. Before turning to the discussion of integrability, let us 
derive the Hamilton equations and the Poisson structure. 


With respect to the inertial frame. 
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In the body frame, the external field is given by the vector 
F=a'f 
and the potential has the form 
Via)=-F-S, 
where S denotes the position vector of the centre of mass. On the level of the Lie 


algebra, the relation between F and f reads F = Ad(a~') f and the Hamiltonian 
function takes the form 


1-1 
H(a,L)= 50 (L, L) — (F, S), (11.1.9) 
where (-,-) denotes the Ad-invariant scalar product defined by 
1 
(F,S) = 5 (FS) =F-S. 


The Hamilton equations for this system are given by (9.2.8). To find their explicit 
form, under the identification of s0(3)* with so0(3) induced by (-,-) we compute 


d 
((Li,)' ((H1)),). ASH A) = anh (Ad(exp(—rA)) F, S) = (ad(F)S, A). 


0 
Hence, 
T 
(Li) ((Hz)),) =ad(F)S 
and the Hamilton equations are given by 


a 'a=O7!L, L=~—ad(@7'L)L —ad(F)S. (11.1.10) 


The first equation is called the Poisson equation and the second one is called the 
Euler equation. In vector notation it is given by 


L=Lx@+S~xF. (11.1.11) 


Since S x F = N is the torque acting on the system, the Euler equation is equivalent 
to the angular momentum balance equation in the body frame, 


L+@xL=N. (11.1.12) 


To find constants of motion in involution, we determine the Poisson structure. For 
that purpose, we view L and F as mappings SO(3) x s0(3) > so(3), given by 


L(a, A)=A, F(a, A) = Ad(a“') f. 
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Let L; and F; denote the corresponding coefficient functions with respect to the 
basis {e; = IR of Example 5.2.8. Using the method of Example 9.2.2, we compute 
the Hamiltonian vector fields 


X 1; (a, A) = (Li,e:, —ad(e;) A), XFrj(a, A)= (0, ad(F)e;) (11.1.13) 
(Exercise 11.1.3) and read off the Poisson brackets from (9.2.10), 
{L;, Lj} = e;;*Le, {F;, Fj} =0, {Li, Fj} = €ij* Fe. (11.1.14) 


Remark 11.1.6 Due to (11.1.14), the subspace of C° (SO(3) x s0(3)) spanned by 
the functions L; and F; forms a Lie subalgebra. This Lie subalgebra is isomorphic 
to the Lie algebra of the Euclidean group E(3) = SO(3) « R°?, see Exercise 8.4.2. 
One can check that the Casimir functions’ || F'||* and (F, L) Poisson-commute with 
all the generators L; and F; and that they label the coadjoint orbits of E(3) (Exer- 
cise 11.1.4). 


Now, we turn to the discussion of integrability. Since the system is autonomous, 
the Hamiltonian 


h:i=dH 


is a constant of motion. Besides that, the projection 
h:=L-e, (11.1.15) 


of angular momentum to the direction e := ||F||~'F of the external field F in the 
body frame should be a constant of motion, because it corresponds to the symmetry 
of this model under rotations about the axis defined by F. Indeed, since N - F = 0, 
the angular momentum balance equation (11.1.12) yields 


, ; d 
Da NseS teres ale bee (lise), 


A third constant of motion which is in involution with J» has been found in the 
following special cases. For simplicity, we assume that the body frame is chosen in 
such a way that the tensor of inertia O is diagonal, 


Oo 0 0 
O=|0 ® Oo 
0 0 © 


This is only relevant in the cases (b) and (c). 


7In Lie algebra theoretic terms, the quantities || F \|? and (F, L) are the quadratic Casimir operators 
of the Lie algebra of E(3) = SO(3) x R? in the representation defined by L;, Fj. 
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(a) The Euler top: this is the situation discussed in Example 10.6.5. In this case, 
either S = 0, that is, the fixed point coincides with the centre of mass, or F = 
0. By scalar multiplication of Eq. (10.6.36) with L we obtain the additional 
constant of motion 


h=L?. 


According to (11.1.14), J) and J; are in involution. Note that in the case F = 0, 
for e one can choose an arbitrary unit vector in R?, 

The Lagrange top: here one assumes ©; = ©, and S = (0, 0, S3). By the ax- 
ial symmetry, the additional constant of motion in involution is given by the 
projection of angular momentum to the symmetry axis of the inertia tensor: 


(b 


wm 


h=L-e. 


(c 


Nee 


Kovalevskaya top: here one puts O; = O2 = 203 and S = (Sj, S2, 0). Without 
loss of generality, we may assume that the centre of mass vector is parallel to 
e;. Then, the additional constant of motion in involution is given by 


= (L}-134+28@,F\)” + 4(LiL2 + SQ, Fa)’, 


where S is the distance from the fixed point to the centre of mass. 


We leave it to the reader to check that in each case, J; is a constant of motion 
and that the differentials of the functions in involution are linearly independent on a 
dense subset (Exercise 11.1.5). Let us add that in a completely analogous way, one 
can discuss the top in an ideal fluid. Here, a number of integrable situations exists 
as well, see Sect. 2.2. in [237]. 


Remark 11.1.7 Since the model is symmetric under SO(2)-rotations about the axis 
given by the external field F, the configuration space reduces to the two-sphere S” 
and since Jy = L- e is aconstant of motion, the dynamics in phase space reduces to 
the 4-dimensional level sets 


Z) = {(e,L) €R°: lle =1,L-¢ =I}. (11.1.16) 


Via the mapping 
®: > TS*, (e,L) (e,L—le), 


each of the level sets 5) is diffeomorphic to TS. This way, these level sets become 
symplectic manifolds. As a consequence of the fact that the Casimir functions || F || 
and (L, F) label the coadjoint orbits of the Euclidean group E(3), cf. Remark 11.1.6, 
2 is symplectomorphic to such a coadjoint orbit. Moreover, one can show that 27 
corresponds to the reduced phase space at momentum level / obtained by symplectic 
reduction of the Hamiltonian SO(2)-manifold which arises from that of the Euler 
top, discussed in Example 10.6.5, by restricting the group action to the subgroup 
SO(2) of rotations about the axis defined by F. 
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Example 11.1.8 (Spherical pendulum) Recall from Example 10.6.7 that the spheri- 
cal pendulum has the phase space 


TS* = {(x, y) € TR?: |x| =1,x-y=0} 
and that the Hamiltonian function is 
1 2 
A(x,y) = 5lyll +X-e3. 


A further constant of motion is given by the momentum mapping of the SO(2)- 
symmetry of this system: 


J (x,y) =x1y2 — x2y1. 


The subset of TS? where dH and dJ are linearly independent coincides with the 
complement of the set of critical points of the energy momentum mapping. In Ex- 
ample 10.8.10, we have seen that this set is dense. Thus, the spherical pendulum is 
integrable. 


Example 11.1.9 (Toda Lattice) The non-periodic Toda lattice is the Hamiltonian 
system on T*R" defined by 


ix 9 ae a(gk—gkt! 
Ae (ge-g") 
ae Sag a ; (11.1.17) 


where q', p; denote the standard bundle coordinates. It describes a linear molecule 
consisting of n atoms with exponential nearest neighbour interaction. The Hamilton 
equations are 


q* = Pk. ES Meds 
De = 2021-9") — 292a* a"), Ba 3.1, 11018) 
pi= —2¢24'-@°) Pn= Ie2(q""!-4"). 


Since )~;_, Px = 0, we can separate out the motion of the centre of mass by passing 
to the rest frame. Then, a pi = O and the relative motion can be described in terms 
of the new variables 


k_k+1 
aac T. be = Pk 
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fulfilling }“7_, bg = 0. Define 


b} a O 0 
a, bo 
ba\g: %. %, % oe Is 
: bn—-1 An-1 
O --++ O ay_-1 bn 
0 a O 0 
-—a, 0 
M=) § *, *%, Pa 0 
: . 0 An-1 
O ++ O -an-1 0 


We show that the Hamilton equations (11.1.18) are equivalent to the Lax equation 


L=[L, M}. (11.1.19) 
For that purpose, we calculate 
—2at aj(b} —bo) 0 1 0 
aj(b} — by) 2a? — 2a¥ 
[L, M]= 0 oh "e 7 0 


2a —2a?_, an (bp—1 — bn) 


0 aoe O  ay—1(bn—1 — bn) +2a?_, 


and read off that (11.1.19) corresponds to the following system of equations: 


a = ag (be — beat), k=1,...,n—1, 
by = 2a?_, — 2a?, k=2,...,.2-2, (11.1.20) 
by = —2a?, bn = 2a? 5: 


By direct inspection, one can check that this system is equivalent to (11.1.18). Due 
to 


“ tr(L*) = ktr(LL‘') = ktr([L, MJL*“') = tr([M, L*]) =0, 
the quantities 


i 
k= zp u(L') (1.1.21) 
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are constants of motion for every k. Since the coefficients of the characteristic poly- 
nomial of L can be expressed in terms of the traces tr(L*), the eigenvalues of L 
are constants of motion, too. In the next section, we will show that the /; are in 
involution. Moreover, one can check that the differentials d/o, ...,d/, are linearly 
independent on a dense subset of T*IR” (Exercise 11.1.6). We conclude that the Toda 
lattice is integrable. Finally, let us calculate 7; and Jo: 


n n 
h= ~ b= ~ Pr =O0 (total momentum), 
k=1 k=1 


n-1 


1 n 4 
h= 5 db + dai =H (energy). 


Exercises 

11.1.1 Show that the six constants of motion of the two-body problem given by 
(11.1.3) are in involution and that the differentials of these functions are 
linearly independent on a dense subset. 

11.1.2 Prove the statements (a) and (b) in Example 11.1.4. 

11.1.3 Prove Formulae (11.1.13) and (11.1.14). 

11.1.4 Verify that the Casimir functions || F||? and (F, S) of Example 11.1.5 com- 
mute with all generators of the Euclidean group E(3) and prove that they 
label the coadjoint orbits of E(3). 

11.1.5 Show that both for the Lagrange and for the Kovalevskaya top, the quantity 
Iz is a constant of motion in involution with J) and check the linear inde- 
pendence of the differentials of H, I> and Jz. 

11.1.6 Determine the subset of T*IR” where the differentials of the constants of 
motion J, ..., J, defined by (11.1.21) are linearly independent. 


11.2 Lax Pairs and Coadjoint Orbits 


As we have seen, sometimes the Hamilton equations can be written in the form of a 
Lax equation 


L=[L,M] (11.2.1) 
with L and M being matrices whose entries are functions of positions and momenta. 
As noted above, the quantities J, defined by (11.1.21) are constants of motion, and 
so are the eigenvalues of L. One says that the motion is isospectral. Thus, if one can 
find a sufficiently large number of invariants of this type which are in involution and 
functionally independent, the system is integrable. 


Remark 11.2.1 Assume that t > L(t) is a solution of the Lax equation (11.2.1) with 
initial value L(0). Then, there exists a smooth matrix-valued function tf +> g(t) such 
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Li)=g()LOgit)', Mi=-s@gt)', gO)=1. (11.2.2) 


Indeed, the function g is found by solving the initial value problem for the second 
equation.® Then, by direct inspection, one checks that the matrix-valued function 
g(t)L(0) g(t)! solves the Lax equation for the initial value L(0). Now, by unique- 
ness, the first equation in (11.2.2) follows. Thus, one can always seek the solution 
of the Lax equation by making the ansatz L(t) = g(t) L (0) g(t)! . This procedure is 
called the method of isospectral deformation. Finally, we note that (11.2.2) imme- 
diately implies 


1 
i, = — tr(L(0)*). 
k 
This is an alternative argument showing that the J; are constants of motion. 


In this section, we discuss aspects of integrability for Hamiltonian systems on 
coadjoint orbits in the dual space g* of a Lie algebra g. Recall from Sect. 8.4 that the 
coadjoint orbits in g* are the symplectic leaves of the Lie-Poisson structure (8.2.18) 
on g*, and that their symplectic form is given by the (positive) Kirillov form (8.4.2). 
In this context, there exists a variety of constructive methods [237]. Here, we discuss 
one of these methods and apply it to the study of the Toda lattice. The setup is as 
follows. Let G be a Lie group with Lie algebra g, let g be a Lie subalgebra and let G 
be the connected Lie subgroup of G associated with g. Assume that there exists a 
Lie subalgebra € of g such that 


g=geet 


(direct sum of vector spaces). Then, the dual spaces g* and €* can be naturally iden- 
tified with €° and g°, respectively, and thus give rise to the induced decomposition 


=o Oe. 


Denote the natural projections corresponding to these decompositions by zg, 7¢ 
and 7g*, 7~«. Below, for the adjoint and coadjoint representations of G and g, we 
will use the conventional notation Ad etc., whereas for the adjoint and coadjoint 
representations of G and g, we will occasionally use the notation Ad ete. Finally, 
recall from the discussion of Formula (8.2.18), that for a smooth function f on g, 
the exterior differential df can be viewed as a smooth mapping df : g* > g. 

The following result belongs to Kostant [171] and Symes [284].? 


8Note that M(t) depends on L(t). 
°The statement can be formulated entirely on the level of Lie algebras. To see this, reformulate 


the requirement of Ad’ -invariance in terms of ad’, drop G and replace G by the Lie subgroup of 
GL(g) generated by the automorphisms exp(ad(A)), A € g, defined by the exponential series. 
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Proposition 11.2.2 Under the assumptions made, the following holds. 


1. If f and g are Ad’ -invariant smooth functions on g*, their restrictions to g* are 
in involution with respect to the natural Poisson structure on g*. a 

2. For a Hamiltonian function H on g* which is obtained by restriction of an Ad’- 
invariant function H on g*, the Hamilton equations have the form 


b =ad" (1e(dA(v)))v. (11.2.3) 


According to Example 9.2.3, every coadjoint orbit of G is invariant under the 
flow defined by H and (11.2.3) is the Hamilton equation on this orbit. 


Proof 1. Let f,% € C™(§*) and let f := figs and g := g}g» be the restrictions. By 
(8.2.18), 


{f, 8) :=(v, [df da]), 
where df and dg are viewed as mappings g* — g. Writing 
dif=ngodf:§*>g, df=meodf:g>t 
and using that Ad’ -invariance implies 
(v,{df, XI) =(v,[dg, X]}=0 forall X eg, (11.2.4) 
we find 
{f, ghar(v) =(v, [df (v), dg(v)]) = (v, [di FO), di 8)]) = (v, [do FO), d28(0)]). 


Since € is a Lie subalgebra, [d2 f(v), d2g(v)] € €. Since v € g* = €°| the right hand 
side vanishes. 

2. A smooth curve t +> v(t) in g* is an integral curve of the Hamiltonian vector 
field X 7 generated by H iff 


(Xu (v(t), df(v@)) = (6, df (vO) 


for all f € C°(g*) and all t. While, originally, the pairing is that of tangent vectors 
with covectors, by the usual identifications, we can interpret it as the pairing of 
elements of g* with elements of g and rewrite the left hand side as follows, denoting 
w=v(t): 


(XH (uw), df (uw) ={H, f}(w) = (u, [dH (Hw), df (2) ]) =(u, [di 4H), df ()]). 


In the last expression, df (jz) is viewed as an element of g and the bracket is that of 
g. Using (11.2.4), we can rewrite the last expression as 


—(u, [do (u), df ()]) = (ad" (do (w)) uM, df (W))- 


This yields the assertion. 
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According to Remark 5.4.11/2, in case g admits a non-degenerate symmetric 
Ad-invariant bilinear form k, one has a natural isomorphism F : g > g* of repre- 
sentations. Then, F~! maps g* and €* to their mutual k-orthogonal complements in 
g and the Hamilton equation (11.2.3) takes the form of the Lax equation (11.2.1) 
with 


L=v, M =—n¢(dH(v)). (11.2.5) 


Example 11.2.3 (Toda Lattice) Using Proposition 11.2.2, we analyze the Toda lat- 
tice. On the way, we use a number of facts about the Lie algebra sl(n, R) of traceless 
real (n x n)-matrices. We leave it to the reader to check them (Exercise 11.2.1). 

Let G = SL(n, R), the group of real (n x n)-matrices of unit determinant, let 
g C sl(n, R) be the Lie subalgebra of traceless upper triangular (n x n)-matrices, 
and let = so(n) C sl(n, R) be the Lie subalgebra of real skew-symmetric (n x n)- 
matrices. As a vector space, 


g=g@F. 


To write down the projections zg and ze, observe that every A € g can be decom- 
posed uniquely as A = A; + Ag + A,, where Aq is diagonal and A; and A, are 
lower and upper triangular with zero diagonal, respectively. We have 


Wg(A) = Ag+ A,+A;, me(A) = Aj — A}. (11.2.6) 


The trace form (A, B) + k(A, B) := tr(AB) on g is non-degenerate, symmetric and 
Ad-invariant. The corresponding isomorphism of representations F : g > g* iden- 
tifies g* with g, g* with the subspace S,,(R) of g of real symmetric matrices and €* 
with the subspace of upper triangular matrices with zero diagonal. The correspond- 
ing projections gx and gx are given by 


Hy(v) =va ++ yp, me«(V) = Vy — v (112-7) 


The Lie subgroup G of G generated by g is the identity connected component of the 
subgroup of upper triangular matrices with unit determinant. Since the diagonal en- 
tries of the elements of this subgroup must be nonzero, G consists of those elements 
whose diagonal entries are positive. Under the identifications made, the coadjoint 
action of G on g* = S,, (R) is given by 


Ad*(g)v =ag*(gvg~'), g eG, veg". (113.8) 
As the Hamiltonian H on g* = S,,(R) we take 


H=- tr(v?). 


By letting v range through the whole of g, we can extend H to a function H on 
g. A brief computation shows dH (v) = v, where v is viewed as an element of g. 
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Hence, (11.2.5) and (11.2.6) yield that the Hamilton equation is a Lax equation with 
L=v and 


M=v} —y. (11.2.9) 


Now let us restrict this equation to the coadjoint orbit of 


0 1 O ::- O 
1 O 
* 
vo=] 0 o|eg.- 
“y O 1 
0 0 1 0 


Using (11.2.7) and (11.2.8), we find that points on this orbit have the form 


by} ay O tee 0 
a, bo : 
v = gx(Ad*(g)v9) = Oc. cet, 0 » geéG, (11.2.10) 
bn-1 An-1 

0 0 ayn-1 by, 

where 
Ga, Peat. el: 
8kk 
and 
— cs — Skk+1 _ 8k—-1k . ne 8n—-1n 
S11 Skk &k—-1k-1 §n-1n-1 

with k = 2,...,n — 1. The real numbers a, and bx can take arbitrary values such 


that a; > 0 and ae , 5 = 0. In particular, the orbit has dimension 2(n — 1). Thus, 
according to (11.2.9), the Lax pair (L, M) representing the Hamilton equation on 
the coadjoint orbit of vo is given by 


bh a, O 0 
a, bo 
L=|] 0 0 |; 
Dn-1 An=1 
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0 a O tee 0 

—a, O : 

Ma) 9. **, a 0 
: : 0 An—1 

O ++ O -an-1 0 


We conclude that the Lax pair constructed this way coincides with the Lax pair 
obtained in Example 11.1.9. Hence, the substitutions 

ag =e), by = Pk 
yield the non-periodic Toda lattice model with the motion of the centre of mass 
separated out. Now, Proposition 11.2.2/1 immediately implies that the invariants J; 
given by (11.1.21) are in involution. This proves that the non-periodic Toda lattice 
is integrable. 


Remark 11.2.4 


1. We have shown that the phase space of the Toda lattice is symplectomorphic to 
a certain coadjoint orbit of the group of upper triangular matrices with unit de- 
terminant. This observation goes back to Kostant [171] and Adler [5]. The dual 
space of the Lie algebra of this group can be realized in different ways. Above 
we used the so-called symmetric Lax representation. Another representation is 
provided by the subspace of lower triangular matrices with zero trace. This yields 
the so-called non-symmetric Lax representation. For this and a number of gen- 
eralizations of the above construction to the case of coadjoint orbits of parabolic 
subgroups of simple Lie groups which yield generalized Toda systems, we refer 
to [237]. 

2. The periodic Toda lattice is defined by the Hamiltonian 


te 2 . aAgk—gkt! 
al (q-g) i = 
H= 5 2 P+ 2 e with gn41= 1. 


This system is integrable, too. For both the periodic and the non-periodic Toda 
lattice, the integration of the equations of motion can be carried out explicitly. 
It turns out that the dynamics of the non-periodic model is asymptotically free. 
Using a method proposed by Moser [223], the solution can be constructed re- 
cursively, see [140] and [237] for details. The solution for the periodic model is 
much more complicated, see [155] and [173]. 


While solving the equations of motion of an integrable system explicitly can 
be quite involved, these systems have an interesting geometric structure. A care- 
ful analysis of this structure shows that there exists a special sort of adapted local 
Darboux coordinates, called action and angle variables. Once such coordinates have 


584 11  Integrability 


been found, the integration of the equations of motion becomes trivial. This will be 
the topic of Sect. 11.4. 


Exercises 
11.2.1 Complete the discussion of the Toda model in Example 11.2.3 by showing 
the following. 

(a) As a vector space, sl(n, R) is the direct sum of the subspace of upper 
triangular matrices with zero trace and the subspace so(n) of skew- 
symmetric matrices. 

(b) The trace form k(A, B) := tr(AB) is a non-degenerate Ad-invariant 
symmetric bilinear form on sI(n, R). 

(c) The k-orthogonal complement of so(n) in sl(n, R) is given by the sub- 
space S,, (IR) of symmetric matrices. 

(d) Points on the coadjoint orbit of vo are given by (11.2.10). 

11.2.2 In the notation of Example 11.2.3, show that in the coordinates a;, by the 

Poisson structure on the coadjoint orbit of vo is given by 


{a;, bi} = —aj, {a;, bi41} =a), 


whereas all other Poisson brackets vanish. Using these relations, confirm 
that in these coordinates the Hamilton equations read 


a;={H,aj}, — b ={H, bi}. 


11.3 The Arnold Theorem 


In this section, we start to discuss the foliation of an integrable system (M, w, 7) 
defined by the level set components of the mapping #. We confine our attention 
to the restriction #% of 7 to the subset M™ of regular points of #, cf. (11.1.1). 
Recall from Remark 11.1.2 that the level set components of .#%; coincide with the 
integral manifolds of the regular distribution D*% on M#” spanned by the Hamil- 
tonian vector fields X y,. 

Belonging to the same level set component of #%- is an equivalence relation in 
M* Let V% denote the set of equivalence classes!” (the space of leaves of D*), 
endowed with the quotient topology, and let 


FE: M® ~ FH (11.3.1) 


be the natural projection, assigning to m € M % the level set component of .#%; 
containing m. Since a submersion is open, the values of .#%; form an open subset 
V™ of R” and the mapping 4: M” + V* decomposes into #% = #0 H, 
where 


FE. VF + VF (11.3.2) 


'0Sometimes referred to as the Reeb graph of the mapping .%. 
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assigns to a level set component of .# the corresponding value. In this section 
we prove the Arnold Theorem, which states that each level set component of 
A, on which the Hamiltonian vector fields Xy, are complete is diffeomorphic to 
T* x R"~* for some 0 <k <n. Here, T* denotes the k-dimensional torus. In the 
next section, we will show that the restriction of the foliation (11.3.1) to the subset 
Ve CV of compact level set components is a locally trivial fibre bundle with 
typical fibre T”. This will be a consequence of the existence of certain adapted coor- 
dinates, called action and angle variables. In Sect. 11.7, we will discuss topological 
aspects of this bundle. 
Let us start with the following classical result. 


Theorem 11.3.1 (Liouville) Let (M,w, Hi,..., Hy) be an integrable system and 
letm € M bea regular point of H# = (M,..., Hy). There exists an open neighbour- 
hood U of m and smooth functions G!,...,G" onU complementing H\,..., Hy to 
Darboux coordinates. In these coordinates, the flow ®' of the Hamiltonian vector 
field X y, is given by 


O}(G, AVG 02, O + tye OBiges My): (11.3.3) 


Proof Since the subset of regular points M H is open, we may search for U inside 
M*. Since D* is a regular distribution of rank n and since the Xq, commute, 
Remark 3.5.11/2 yields a local chart adapted to D* on some open neighbourhood 
U of m, whose first n coordinate functions G!,..., G” satisfy Xq, = Ogi. Obvi- 
ously, we can replace the last n coordinate functions of this chart by Mj,..., An. 
By construction, the local representative of the flow ®! is given by (11.3.3) and the 
Poisson brackets are 


Hoy, {aC =x (Ge) =e, 
A brief computation shows that the latter implies 
w= dH; AdG' +hdH; A dH; 


with smooth functions h‘/, uniquely determined by the condition h!/ = —h/’. By 
(11.3.3), the 2-forms dH; A dG’ and dH; A dH; are invariant under each of the 
flows &!. Since o is invariant under these flows, too, each of the functions h'/ must 
be invariant. Hence, the second term can be written as 


hdH; \dHj = 4*(kijdx' A dx/) 


where x! are the standard coordinates on R” and kj; are smooth functions on the 
open subset #(U) CR". Since d(h'/dH; \ dH;) = d(w — dH; A dG') = 0, we 
have 


KH A(kjjdx' A dx!) =0. 
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Since .# is a submersion, this implies that kjjdx! A dx/ is closed. Hence, by the 
Poincaré Lemma,!! there exists a 1-form a;dx' on #(U) such that 


kijdx! A dx! = d(ajdx'). 


We leave it to the reader to check that after replacing the coordinate functions G' by 
G' — a! o # we have w = dH; A dG‘, whereas (11.3.3) remains unchanged. 


Remark 11.3.2 


1. The functions G' provide coordinates on the level set components of .7%.. Since 
G', H; are Darboux coordinates, their Poisson brackets are 


{Hi,Hj)}={G',G/}=0,  {Hi, G4} = 4). 


Since the Hamiltonian is given by the coordinate function H;, the Hamilton 
equations take the following simple form: 


H,=0, i=1,...,n, Gl=1, Gi=0, i=2,...,n. 


Hence, the integral curves are given by (11.3.3) with i = 1. 

2. Since in the proof of Theorem 11.3.1 we can replace a by a + dd, with A being 
an arbitrary smooth function on R”, there is the following freedom in the choice 
of the functions G': 


, OAC 
GeRG+t+ ( ) 
0H; 
Now we discuss the structure of the level set components of .%;. 


Theorem 11.3.3 (Arnold) Let (M,o, #) be an integrable system and let 7 be a 
level set component of ;.. If & is compact, it is diffeomorphic to T". If 7 is not 
compact but the restrictions of the Hamiltonian vector fields X y, to X' are complete, 
it is diffeomorphic to TE x R"~* for some 0 <k <n. 


Proof It suffices to prove that if the restrictions of the Hamiltonian vector fields X y, 
to Y are complete, the latter is diffeomorphic to T' x R’~* for some 0 < k <n. 
By assumption, the flows &’ of the restrictions of the vector fields Xy, to Y are 
complete and commute with one another. Thus, they define an action of R” on ¥ 
by 


WwW R’x DoD, W(t, m) = (@) 0---0 O")(m), (11.3.4) 


cf. Example 6.1.2/4. We show that this action is transitive. First, we observe that W/, 
is invertible for every m € 2’, because the vector fields X y, are linearly independent 


'lWe can assume .#(U) to be contractible. 
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Fig. 11.1 Transitivity of the action W in the proof of the Arnold Theorem 


on 2’. Hence, the orbit mapping W,,: R” — defined by m restricts to a diffeo- 
morphism from some open neighbourhood V,, of the origin of R” onto an open 
neighbourhood U,, of m in 2. Now, let mo and m be arbitrary points in 2’. Since ¥ 
is connected, we find a curve y : [0, 1] ~ » from mo to m. Since [0, 1] is compact, 
we find points m,,...,m, on y such that the subsets U,,,,..., Um, form a covering 
of y. By an appropriate reordering we may assume that Uj», 1 Um;,, is nonempty 
and hence contains a point p; for alli = 1,...,7 — 1. Put pp =mo and p,; =m. 
For i = 1,...,r, there exist unique s;,t; € Vn, such that Wp, (s;) = pi-1 and 
Win; (ti) = pi, see Fig. 11.1. Then, %,—s,(pi-1) = Ys; 0 Ws, (mi) = A, (mi) = pi 
and hence 


Wr, —s)4---+t, 5, (mo) =m. 


Thus, the action W is transitive, indeed. Now, the Orbit Theorem 6.2.8 yields that 
» is diffeomorphic to the homogeneous space given by the quotient of IR” by the 
common” stabilizer of the points of D. Since (Wn) ;v,, is injective, the stabilizer 
is a discrete subgroup of R” and hence isomorphic to the integer lattice generated 
by k linearly independent elements of R”, where 0 < k <n. By complementing the 
generators to a basis of R”, we finally obtain 5 = T* x R’-*. 


Motivated by the proof of the Arnold Theorem, let us discuss the special case 
where the Hamiltonian vector fields Xj, are complete on the whole of M. In this 
case, there is a deep relation between integrability and symplectic reduction. Since 
the vector fields X 7, and hence their flows &' commute, the mapping 


W:R"xM>M, — Wit,m):=(#), 0---0 87) (m) (11,2;5) 


!2Since R” is Abelian, all stabilizers along an orbit coincide. 
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is an action of the additive Lie group R” on M. In the local coordinates G', H; 
provided by Theorem 11.3.1, this action reads 


W(t,m) =(Gl(m) + t,...,G"(m) +t, Hi(m),..., Hn(m)). (11.3.6) 


Since each of the flows ®! leaves w invariant, this action is symplectic. We show 
that # can be viewed as a momentum mapping. For that purpose, recall from Ex- 
ample 5.2.10 that the Lie algebra of G = R” is given by the vector space g = R” 
endowed with the trivial Lie bracket. Thus, if we identify g* with IR” via the canon- 
ical scalar product, we can view # as a mapping 


J:M> 4g", J(m):= #(m). 


Next, we calculate the Killing vector field of x € g. On the one hand, by Exam- 
ple 5.3.15, we have exp(sx) = sx for all s € R and hence 


d n 
Helm) = Fe ox) = Be (m) = Xn x, (0): 


On the other hand, 
n 
Jg(m) =(J(m), x)=) xi Hi(m). 


Thus, x, = Xj,, and J = # is a momentum mapping for W, indeed. It is trivially 
equivariant, because Proposition 9.1.10 implies that J is constant on the orbits of 
W. Let us add that the fact that # is a momentum mapping for WY implies, in par- 
ticular, that the distribution D8 spanned by the Killing vector fields of YW coincides 
with the distribution D7” generated by Hamiltonian vector fields X ,. Hence, the 
Orbit Theorem 6.2.8 implies that the orbits of W in M % coincide with the level set 
components of # in M # This yields an alternative proof of the Arnold Theorem 
in this special situation. 

Now, consider the invariant open submanifold 4 © of the regular points of #. 
Since J is a submersion on M”, every value is regular. Since the coadjoint action 
is trivial, the stabilizers Gp of the values h € g* = R"” of J coincide with G = R”. 
While we cannot directly apply the theory of regular symplectic reduction as dis- 
cussed in Sect. 10.3, because the action W need not be proper, we can nevertheless 
form the topological quotient J~!(h)/Gh. Since the Gy-orbits in J~! (h) coincide 
with the connected components of J~!(h), this quotient is a discrete space consist- 
ing of at most countably many isolated points. Hence, it is trivially a symplectic 
manifold and can be interpreted as the reduced phase space at momentum h. Let us 
summarize. 


Proposition 11.3.4 Let (M,w, M,..., Hn) be an integrable system. If the Hamil- 
tonian vector fields X y, are complete, the RR" -action (11.3.5) endows (M, w) with 
the structure of a Hamiltonian G-manifold with equivariant momentum mapping 
given by 7 = (HM, ..., Hy). For the invariant open submanifold M#, the reduced 
phase spaces are discrete and at most countable. 
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Remark 11.3.5 Let &’ bea level set component of .#;. and assume that the restriction 
of the Hamiltonian vector fields Xq, to X' is complete. Then, the action (11.3.4) of 
R” on ¥ endows the latter with an affine connection,'* that is, with a globally 
defined prescription for the parallel transport of tangent vectors from a point mo to 
another point m, along a curve connecting them. This affine connection turns out 
to be flat, meaning that the result of the parallel transport does not depend on the 
connecting curve. By parallely transporting every tangent vector to the tangent space 
Ting & at some chosen point mo, one can represent vector fields on &' by smooth 
mappings X' — T,,,, 2’. Vector fields represented by constant mappings are called 
constant vector fields. They form a vector space which is isomorphic to T,, 2’. By 
virtue of the basis {X 4, (mo), ..., XH, (mo)} in Tm, X’, we can identify this vector 
space with R” in a natural way. As noted in the proof of the Arnold Theorem, all the 
stabilizers of the action of R” on » coincide with a certain integer lattice generated 
by k linearly independent elements of IR”. We refer to this lattice as the period lattice 
of the mapping # at the value h € R” and denote it by Pyv(2). Under the above 
identification, P v(2’) corresponds to a subgroup of the additive group of constant 
vector fields. 


11.4 Action and Angle Variables 


By the Arnold Theorem, compact level set components of .#%; are tori. There arises 
the question whether it is possible to pass to new constants of motion in involution 
which are adapted to these tori in the sense that their flow is periodic. This leads to 
the notion of action and angle variables. 


Definition 11.4.1 (Action and angle variables) A system {J,,...,Z,} of smooth 
functions on an open subset W of M H is called a system of action variables if 


1. the Hamiltonian vector fields X7, span the distribution D* over Ww, 
2. (i, 1j} =0 for all i, j, 
3. the flows of the Hamiltonian vector fields X 7, are complete and 27 -periodic. 


Any system of functions !,...,” on W such that (i, /) yield Darboux coordi- 
nates on a neighbourhood of every point of W is called a system of angle variables. 


Note that, up to the choice of the Lagrangian submanifold 3 = 0 transversal to 
the foliation, the functions 1! are given by the flow parameters of the vector fields 
X1,. Hence, they are necessarily multi-valued mod 277. 

The historical origin of action and angle variables can be traced back to Jacobi 
and Liouville. The first modern formulation belongs to Arnold and Avez [23]. Be- 
fore proving existence, let us collect the basic properties of action and angle vari- 
ables, following immediately from the definition. Obviously, both the X7, and the 


13To be defined in Sect. 11.7. 
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X y, form a local frame in D* over W. These two frames are related by 

X 1, = bi Xu, (11.4.1) 
with a unique smooth mapping b: W > GL(n, R). By taking rows of b we obtain 
mappings 


b):W > R", bj (m) := (b;'(m), ..., b? (m)). (11.4.2) 


Proposition 11.4.2 Let I be a system of action variables on WC M* 


1. One has {I;, Hj} =0 for alli, j and {H;, b;*} = 0 for all i, j,k. In particular, 
the functions I; and b;! are constant on the level set components of H;. 

2. With 6% and 6" denoting the flows of Xp, and X{,, respectively, form € W 
we have 


vf os Hy Ay 

fi (m) = OP 0-0 OL gy (m), (11.4.3) 
A; — q ore Tn 

Pi." (m) = ® b-1);1 0) O++-0 P (o-1);"(m)" (11.4.4) 


This implies the following. 

(a) The vector fields X yj, are complete on W. 

(b) The R"-actions W! and W defined by the flows ®",...,6 and 
o,..., 6% on W, respectively, are related by 


Ye (m) =P" bom): (11.4.5) 


In particular, they have the same orbits and these orbits coincide with com- 
pact level set components of H;. 
(c) For every m € W, the vectors 21b,(m),...,22b,(m) form a set of genera- 
tors for the stabilizer of m under W. 
3: If th, Dalek 1 is another system of action variables on Wc M*, then, on WM W, 


1, = AT; +Ci, bj = Ajib; 


with smooth mappings A: WW — GL(n, Z) and C: WN W > R" fulfilling 
dc =0. 


Proof 1. By (11.4.1), we have 
Ui, Hj} = X1,(Hj) = bi‘ X 4, (Hj) = b;*{ Hk, Hj} =0. 
In turn, this implies 
O=[Xy,,X1]=[Xu,.b;'Xm.] = Xu, ()) Xu, = (Hi bj" |X, 


which proves {H;j, b;*} = 0, because the Xy, are pointwise linearly independent 
on M#. Asa consequence, the functions J; and b;/ are constant on the maximal 
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integral manifolds of the distribution D**, As noted in Remark 11.1.2/3, these are 
the level set components of #%;. 

2. To prove (11.4.3), denote the level set component of .# containing m by 2. 
Since » is invariant under all the flows involved, it suffices to show that the curves 
on the two sides of (11.4.3) are integral curves through m of the same vector field 
on %. Indeed, by (11.4.1), for all m € X, 


X1,(m) = b(m);* Xn, (i) 


and by point 1, b(m) = b(m). The statements (a)—(c) now follow. In particular, the 
Arnold Theorem 11.3.3 yields that the orbits of W are level set components of 7%; 
and the periodicity of the flows &/' implies that they are compact. 

3. Since {X7,} and {X i } are frames in the same distribution on WM W, there 


exists a smooth mapping A: WM W — GL(n, R) such that 


Xj,(m) = A(m);/X7,(m). (11.4.6) 


Then, (11.4.1) implies bj (m) = A(m);/b;. By point 2(c), both {b;(m)} and {bj (m)} 
are systems of generators of the stabilizer of m under the action WY. Hence, A(m) € 
GL(n, Z). Then, dA;/ = 0 and (11.4.6) implies J; = Aj/1; +C; with dC; =0. 


Now, we show that action and angle variables exist. Given the great importance 
of these coordinates, especially in perturbation theory, various constructions can be 
found in the literature. Our proof will follow the approach presented in Libermann 
and Marle [181], which in our opinion is particularly transparent. It is based on The- 
orem 8.6.4 and the fact that every Lagrangian submanifold admits local generating 
functions. Thereafter, we will show that action and angle variables can also be rep- 
resented in terms of line integrals. This yields a relation to the more conventional 
approaches in [1], [23], [18]. Other approaches can be found in [189], [71], [80] 
[34] and [116]. Below, we will comment on the relation to the latter. 


Theorem 11.4.3 (Existence) Action and angle variables exist in a neighbourhood 
of every compact level set component of Z. 


Proof Let So be a compact level set component of .% with value h° € R”. By 
Remark 11.1.2/3, X}0 is a Lagrangian submanifold of M. Thus, Theorem 8.6.4 
implies that there exists an open neighbourhood W of Xo in M H anda symplec- 
tomorphism ®: W — V onto an open neighbourhood V of the zero section of the 
cotangent bundle T* X40. Since .% is a submersion, U := #(W) is an open neigh- 
bourhood of h® in R”. Let 2 denote the canonical projection in T* Xo. We can 
shrink W so that the mapping 


X=(106,H): W > SyoxU (11.4.7) 


becomes a diffeomorphism: since 2 and .#w are submersions, by counting dimen- 
sions, we find that x;, is bijective for all m € W. Therefore, for every m € D0 we 
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can find open neighbourhoods W,, of m in W and U,, of h° in U such that xX fe- 
stricts to a diffeomorphism from W,, onto (Win Xo) x Um. Since LX} is compact, 
the open covering of 2},0 by the subsets W,, contains a finite subcovering, labelled 
by my,...,my. If we replace W by 77! ((Vi_, Um;) O (U1 Wm; ), the mapping 
(11.4.7) becomes a diffeomorphism, indeed. As a consequence, the level sets of 
in W, given by 


Zn := x7 '(Zy0 x {h}) 


with h € U, are diffeomorphic to X40 and hence, in particular, connected. 

Let {e;} denote the standard basis of R” and let b; denote the generators of the 
stabilizer P y(X},0) under the action W of R” on Xj given by (11.3.4). For a chosen 
point mo € Xo, we define 


ee 
p:R" > Xp, p(q@) := Uma( se Soa) 
i=1 


Since p(q + 27re;) = e(q) for all 7, the mapping p defines global angle coordi- 
nates!* on the torus 2,0, denoted by q'. Denoting the corresponding fibre coordi- 
nates in T* X}0 by p;, via the symplectomorphism ® we obtain Darboux coordinates 
on W, which we also denote by q’, p;. The idea of the proof consists in constructing 
a canonical transformation from q‘, p; to the desired action and angle coordinates 
2, J; in terms of a generating function S(q, I) of the second kind. 

In the first step, we construct S$ as a function of the variables qg' and h;. Since 
2; is Lagrange for every h € U, the image ®(2) is a Lagrangian submanifold of 
T*X},0 and Proposition 8.3.10 implies that it coincides with the image of a closed 
1-form Bp on Xo. Since 


Bu(m) = ® o x~!(m,h) (11.4.8) 
for all m € Xy0 and he U, this family of 1-forms is smooth. Accordingly, 0* Bp is 
a smooth family of closed 1-forms on R”. By the Poincaré Lemma, there exists a 
smooth function S: R” x U — R such that 


e* Bn = dSh, (11.4.9) 


with Sy(q) = S(q, h). Then, 
, as ; 
P Bn(q) = agi oda ; (11.4.10) 
where aq + 27e;,h) = aq h) for all i, j. It follows that 


S(q,h) = S(q,h) + I(h)-q (11.4.1) 


'4That is, composition with the natural projection R” + R” /27Z" =T" yields a diffeomorphism. 
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with a smooth function § : R” x U > R" satisfying S(q + 27e;,h) = S(q, h) for 
all i and a smooth mapping J : U > R”. For the components of the latter, we read 
off 


1 
7;(h) = 5, (Sa + 2xe;,h) — S(q,h)). (11.4.12) 
Since {yo coincides with the zero section in T* X}0, we have mace h°) = 0 and 
hence 
1 2n as ; 
I; (h°) = — -(q, h°)dg' = 0. 11.4.13 
(0!) = 5 [Saha (1.4.13) 


Obviously, the functions J; o # on W are in involution. We take them as candidates 
for the desired action variables and S as the generating function for the desired 
canonical transformation. First, we show that the variables J; on U are in one-to- 
one relation with the variables h;. 
Lemma 11.4.4 
1. Forallm € W, p;(m) = ar (m), HE (m)). 
2. For allhe U, det( 2 (h)) £0. 

J 


Proof of the Lemma 1. Denote h:= #(m). By the definition of By, we have 
®(m) = By(z o @(m)). By the definition of the coordinates g' on W, 


pog(m)=poq(xo0®(m)) = 0 O(m). 
Thus, we obtain 
pi(m) = pj o P(m) = (Bp)i (1 0 B(m)) = (Bp)i (0 0. g(m)) = (0* Bn), (4m), 


where (Bp); and (o*fp); denote the coefficient functions of the 1-forms Bp and 
p* Bn with respect to the global frames {dq'} in T*D}0 and {o*(dq') = dq‘} in 
T*R", respectively. Hence, the assertion follows from (11.4.10). 

2. According to point 1, in the coordinates q', p; on W and q', h; on dyo X U, 
the inverse of x has the form 


as 
(q, h) be (ar= *<(a.h)). 
q 


Since x is a diffeomorphism, for every (q, h), 


aet( of m)) 40 (1.4.14) 
e : , : A. 
agiah; “4 


Assume that the assertion of point 2 does not hold. Then, 


n 
al; = 
j= (h) = 0 
D5, | ) 
j=l , 
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for some h € U and some nonzero a € R”, and (11.4.11) implies 


n as n as 
a, (q, h) = ya (q, + oa Lage ®= Lag, (q, h). 


j=l j=l al i=l J= 


Since the function q > ae aj os (q, h) is periodic, it has at least one maximum, 
. j 


say at q. There, we have 
n 2 
aS oo = 
yi 57a, =O. 
jar OE ONS 


which contradicts (11.4.14). This proves the lemma. 


Proof of Theorem 11.4.3 (continued) By Lemma 11.4.4/2, U and hence W can be 
shrunk so that the mapping 


IT:U>1(U) CR’, hr /(h), 
becomes a diffeomorphism onto an open neighbourhood of the origin in R” and the 
functions J; provide coordinates on U. In particular, the Hamiltonian vector fields 
of the functions J; o # span the distribution D* over W. That their flows are com- 


plete and 2z7-periodic will be obvious after the construction of angle coordinates. 
To obtain the latter, we view S as a function of q and I and define functions 


as 
eR" xITU)>R, HGD= GD. (11.4.15) 
j 


Equation (11.4.11) yields 
0 (q+ 27e;, 1) — 0 (gq, 1) = 275; (1.4.16) 


for all i, 7. Hence, for every I, the composition of 7 : R” — R” with the natural 
projection pry. : R” > R"/27Z" = T” induces a smooth mapping 


g: 3,0 > R"/2xZ" =T", —-(o(q)) = pr» (01). 


Since by Lemma 11.4.4/2 and Eq. (11.4.14), we have 


Pe gat p) 40. (11.4.17) 
agial; (4. s 


this mapping is a local diffeomorphism. We show that it is a global diffeomorphism, 
which implies that the functions oF induce global angle coordinates on 2,0, denoted 
by the same symbol. First, the image of g coincides with T”, because it is open and 
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by compactness of Xo it is also closed. To prove that @ is injective it suffices to 
show that it is homotopic to a diffeomorphism. Indeed, by (11.4.12) we have 


os 
ot _ I Raed, I : 
1M=¢q + aI; (q. 1 
so that 
: as 
F: [0,1] x Sy0> T", F(s, (q)) := ptm q+s@D 


is a smooth homotopy between @ and the mapping 


Sy > T", p(q) > pry (q), 


which is a diffeomorphism, because the g' are global angle coordinates. Now, the 
mappings 


W>R", mb 0(r0O(m), 10H (m)), W>I(U), mryIo#(m), 


define coordinates on W, for which we keep the notation 9, J;, respectively. By 
construction, the functions J; are constant on the level set components X}, and the 
functions 9! are global angle coordinates on D). Finally, by Lemma 11.4.4/1 and 
by the definition of the angle coordinates %’, on W the coordinates q', p; and 9, 
7; satisfy the relations 


SGD pin 0S(q,1) 
agi : Ol; ; 


L 


We conclude that the coordinate transformation (qi Di) (3, I;) is canonical and, 
therefore, the coordinates 3’, J; are Darboux. Then, we have X7, = 0y:, so that 
the flow of X7, is complete and 27:-periodic and hence the J; are action variables, 
indeed. 


Now, we restrict the foliation (11.3.1) to the subset Vee CVF consisting of the 
compact level set components of #%-. Thus, let M me denote the subset of M” of 
points whose level set component of .#%; is compact. By Theorem 11.4.3, Mx can 
be covered by Darboux charts on M built from action and angle variables. By point 
(b) of Proposition 11.4.2/2, the domains of action and angle variables are necessarily 
contained in M, ig . It follows that M — is an open submanifold of M.'> Hence, Vee 
is an open subset of V* and the projection (11.3.1) restricts to a projection 


Hoi Me ave, (11.4.18) 


'STn fact, this is immediate after one has shown that the mapping y defined by (11.4.7) can be 
made into a diffeomorphism. 
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Corollary 11.4.5 The foliation (11.4.18) is a locally trivial fibre bundle with typical 
fibre T”. 


Proof By Theorem 11.4.3, Mw can be covered by Darboux charts (3, 1) 
on W™ built from action and angle variables. Since W is a union of level set 
components of .% which are labelled by 1‘, the subset 


UM -= Fe,.(W) 


of Ve is open and J) induces a local chart 7 .Y@ _, R", Using Proposi- 
tion 11.4.2/3, it is easy to check that these charts form an atlas on V# , thus en- 
dowing ve with the structure of a smooth manifold. To prove local triviality, we 
observe that the local charts (9, 1) define diffeomorphisms 


WwW) > Tx [(w™), (11.4.19) 


Composing these with the diffeomorphisms (T' @)y-1. J @O(wo) + U™, we ob- 
tain diffeomorphisms 


x9 WO =F 1'(F) Tx", xm) := (Helm), 0 (m)), 


which are obviously local trivializations of the projection (11.4.18). 


Remark 11.4.6 Below we list different approaches to action and angle variables. 
The first three are close in spirit. They are based on the original ideas of Arnold. 


1. In the approach of Arnold and Avez [23], see also [1], one assumes that one 
has a local torus fibration and defines action and angle variables via a canonical 
transformation given by a certain generating function. The action variables and 
the generating function are defined by line integrals over a symplectic potential. 
Here, point 2 of Lemma 11.4.4 is usually imposed as an assumption. 

2. In the approach of Nekhoroshev [228], the Tubular Neighbourhood Theorem for 
embedded submanifolds is used to prove that a neighbourhood of any torus has 
the structure of a local torus fibration. On such a neighbourhood, the action vari- 
ables are defined via line integrals over a symplectic potential. Angle variables 
are constructed using the Liouville Theorem.!° 

3. In the approach of Libermann and Marle [181], the Weinstein Theorem 8.6.4 and 
the fact that every Lagrangian submanifold admits a local generating function is 
used to find a generating function S of a canonical transformation providing 
action and angle variables. Here, the action variables are obtained directly from 
the periodicity properties of S. 


'6Tn view of the classical Carathéodory-Jacobi-Lie Theorem, this construction generalizes to non- 
commutative integrable systems, cf. Theorem 11.8.3. 
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4. The approach of Markus and Meyer [189], see also Duistermaat [80], Bates and 
Sniatycki [34] and Guillemin and Sternberg [116], is different. Here, the action 
variables are derived from a local frame in the period bundle,!? made up by the 
stabilizers Py (2) (the period lattices) of the level set components of .#%; under 
the action of IR” defined by the flows of the Hamiltonian vector fields X y,, cf. 
Remark 11.3.5. Angle variables are constructed via the Liouville Theorem. 


To build the bridge to the original approach of Arnold and Avez, described in 
point 1 of Remark 11.4.6, we derive explicit formulae for the action and angle vari- 
ables constructed in the proof of Theorem 11.4.3 in terms of line integrals. We use 
the notation of that proof. Without loss of generality, as the potential S(q, h) of the 
smooth family of closed 1-form p* By, we may choose 


S(q, h) = p* Bn with Tq: [0, 1] > R”, Tq(t) := rq. (11.4.20) 
Tq 


This can be rewritten as an integral over X}, as follows. Let 6 denote the canonical 
1-form in T* X},0. By (8.3.2), 


sa.) =f p'tn= | a= | 0. 
Tq POTq Bnopocty 


Here, Bh o p © Tq iS a curve in T* 2},0 which is contained in the image of Y under 
@. Hence, there is a unique curve VG in X', such that 


Bro poty=Poyy. 


Thus, 


scab) = | P*0. (11.4.21) 
va 
To obtain from (11.4.21) an integral representation for the action variables J;, denote 
yrs Ponce? By (11.4.12) and (11.4.21), 


ri 
1 
[;(h) = =| P* 6. (11.4.22) 
20 yp 


Here, *6 is a potential for the symplectic form w and the curve yp encloses the 
i-th factor!® of , and is contractible with respect to the remaining factors. Thus, 
{y}} is a system of fundamental cycles!° for 2_. Now, since the mapping ht / (h) 


'7To be introduced in Sect. 11.7. 
'8 As defined by the diffeomorphism 50 = T” induced by the mapping p. 


'94 system of closed curves whose homotopy classes yield a minimal set of generators for the 
fundamental group of Xp. 
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satisfies the condition in Lemma 11.4.4/2, one can express h in (11.4.21) in terms 
of I. Thus, we obtain the generating function S as 


S(q,D = I, O*6, (11.4.23) 
Y 


q 


with v4 = Va | and the angle variables 1/ are given by (11.4.15). That the lat- 


ter satisfy (11.4.16) can also be read off from the integral representation (11.4.23), 
because integration over the curve Vea dee e; yields 


sqq+2r6.0= | 


VYq+2re; 


b*6 =) ’*6 +f &*6 =S(q,1I +27]j. 
vi y} 
Now, we show that Formulae (11.4.22) and (11.4.23), with ®*6@ replaced by an 
arbitrary symplectic potential for w, {fy} replaced by an arbitrary system of funda- 


mental cycles and ve replaced by some homotopic curve allow for locally defining 
action and angle variables. 


Theorem 11.4.7 (Representation by line integrals) Let mo € M, let h° = #(mo) 
be a regular value of # and assume that the connected component Xo of mo of 
the level set #—'(h°) is compact. Let the following data be given: 


1. a connected open neighbourhood W of Xo in M XH such that 
Zh = WG" |(h) 


is a compact level set component of A; for allhe U = 7/(W), 

2. a potential t for w on W, 

3. smooth families fark baa {y} of curves in W such that (yh, aie yy is a system 
of fundamental cycles in Xy for allhe U, 

4. Darboux coordinates q', p; on some open neighbourhood V of mo in W such 
that t = p;dq' and such that, for all h € U, the subset VN SD}, is simply con- 
nected and can be coordinatized by the q', 

5. a smooth mapping s:U — V such that 7 os =idy. 


Define 


1 
I; (h) =| T, S(q, h) =) T (11.4.24) 
2x ye vq 


where Ya is some curve in V 1 Sy from s(h) to the point with coordinate value q. 


Then, V and W can be shrunk so that 


1. The first equation in (11.4.24) can be resolved for h and hence S can be written 
as a function of q and I, 
2. the functions 3' on V defined by 


; as 
v(m) = a7 am: I(m)), (11.4.25) 
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where I = I 0 Hw, complement the functions I; to Darboux coordinates on V 
which have a unique extension to action and angle variables on W. 


For the proof, we need 


Lemma 11.4.8 Lethe U. 


1. [fy and yz are curves in X, which are homotopic with fixed endpoints or if they 
are closed curves in &, which are homotopic in Ly, then a hie? 
2. If t%| and T2 are potentials for w on W, then Syn ™T— Sn t, does not depend on h. 
L I 


Proof of Lemma 11.4.8 1. Let F :[0, 1] x [0, 1] ~ 2 be a smooth homotopy from 
y) to y2, viewed for simplicity as a surface in &,. By Stokes’ Theorem and the fact 


that X}, is Lagrange, 
fr-fref r= w=0. 
YI v2 oF F 


2. Since W is connected, so is U. Hence, for every h € U we find a smooth curve 
t +> h(t) such that h(0) = h® and h(1) = h. Then, the mapping F; : [0, 1] x [0, 1] > 
W defined by Fj(t,r) := “wen (r) represents a smooth surface in W and Stokes’ 
Theorem yields 


(a-n)-f @-n= | am—n=0 
yp yn Fi 


Proof of Theorem 11.4.7 The idea of the proof is to relate the functions J; and 
2! defined in the proposition with the action and angle variables constructed in the 
proof of Theorem 11.4.3 in a neighbourhood of 2,0 and with the point 9 chosen to 
define the mapping p used there. Here, the objects from that proof will be denoted by 
their original symbols endowed with a hat. For simplicity, without loss of generality 
we may assume W=WandU =U. 

In the first step, we relate 7 with i. Lethe U be given. Since both {yh} and 
{yh} are systems of fundamental cycles for Xj, there are unique integer-valued 
(n x n)-matrices A and B such that vn is homotopic to Aj/ yp and ph is homotopic 


to Bj! yy Here, by the sum of two curves we mean the composite curve. Up to 
homotopy, the order of composition is not relevant. Obviously, A and B are inverse 
to one another. Using Lemma 11.4.8 and (11.4.22), we obtain 


1 1 ok 
ny= =f ero+c=— | e+e =asim +c, 
20 yh 2m JA,iph ; 


where (Cj,..., C,) is a vector in R” which is independent of h. Since the families 
{y}} depend smoothly on h € U, so do the coefficients Aj/ and B;/. Since U is 
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connected, these coefficients are therefore constant in h. To summarize, the func- 
tions 7; and J; on W are related by an invertible affine transformation with constant 
coefficients: 


= Ail; +Ci, i, = BU; —C)). (11.4.26) 


In particular, W and hence U and V can be shrunk so that the first equation in 
(11.4.24) can be resolved for h, because this is true for the corresponding equation 
for I. 

In the second step, we show that the functions % are well defined and comple- 
ment the functions J; to Darboux coordinates on a possibly smaller V. Consider 
the integral in the definition of S. According to point | of Lemma 11.4.8, the value 
of this integral does not depend on the homotopy class with fixed endpoints of the 
curve Mee Since VM Xp is simply connected, it therefore depends on q and h only, 
and the dependence is smooth. Thus, S is a smooth function in the variables q and 
h, and by expressing h in terms of I we find that the 2! are well-defined smooth 
functions of q and I. By plugging in for q and I the mappings g and J =/ 0 Hy, 
v' and S become functions on V. Let 4: VM Xp —> V be the natural inclusion 
mapping. By construction, 


dS = yt. (11.4.27) 
Since the functions q' provide coordinates on V N S}, for all h € U, the functions 
qi and J; yield coordinates on V. In these coordinates, the left hand side reads 


Ind = Uh (Agi S)dq’ + (81, S)dl;) = ((gi S) 0 tn) d(q' o tn), 


because J; o ty is constant. On the other hand, in the coordinates qi and pj, the right 
hand side reads 


t= (peo tn)d(q' ° th). 


Since the d(q' © ly) are pointwise linearly independent, and since (11.4.27) holds 
for all he U, we conclude that 


Ogi S = Pi (11.4.28) 
and hence 
dS = pidq' + 8'dl; (11.4.29) 


on all of V. By taking the exterior differential of (11.4.29), we obtain w = dJ; A dvi. 
First, this implies that the mapping (7, /) : V > R?” is a local diffeomorphism. 
Hence, we may shrink V so that o and ], define coordinates. Second, this implies 
that these coordinates are Darboux. 

In the third step, we show that 2% and J; extend to unique action and angle co- 
ordinates on W, where W is shrunk appropriately according to V. Since the /; are 
already defined on W, it suffices to consider the 3. By (11.4.26), on V we have 


dl; A d0! == di; A dd! = d(B;/1;) Add! = dl; A d(B,/5"). 
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Thus, by Remark 8.1.6/3, Bid! = 90/ +a/ oJ and hence 
ov = BH —alol, (11.4.30) 


where a! are the coefficient functions of a closed 1-form on /(V) C R”. The func- 
tions on the right hand side are defined on W. Hence, this equation defines exten- 
sions of 9 to functions 3! on W. These functions arise from the 3! by an invertible 
linear transformation with integer coefficients, followed by a shift which depends on 
m only through /, that is, which is constant on each 2. Since the pf provide global 
angle coordinates on each L}, in W, so do the B. Thus, }! and I; are action and 
angle coordinates on W. That the # are uniquely determined by their restrictions 
2 to V follows once more from Remark 8.1.6/3. 


Remark 11.4.9 


1. According to (11.4.25) and (11.4.28), the function S defined by (11.4.24) is a 
generating function of the second kind for the canonical coordinate transforma- 
tion (g, p) + (8, I) on V. In particular, 


as 
et{| —_—— ] £0. 
This generating function is related to the generating function used in the proof 


of Theorem 11.4.3, here denoted by S, as follows. In view of (11.4.30), from 
(11.4.29) and the corresponding equation 


dS = pidg! + b'dl; 
we read off that 
d($ — $) = Wal; — dI; =I*a. 


Since the 1-form a on /(U) is closed, on every contractible subset of V, the 
generating functions § and S differ by a function of the action variables /;. 

2. For later reference we note the following representation of (11.4.24) in terms of 
local Darboux coordinates: 


1 . 
7;(h) = =| pidq', S(q, h) = pidq'. (11.4.31) 
Jyh yh 


q 


To conclude this section, we discuss dynamics in terms of action and angle vari- 
ables. Since the Hamiltonian H is constant on level set components of #, in action 
and angle variables #' and J;, it depends on the J; only. Hence, with the notation 
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the Hamilton equations take the following simple form: 


o4 : 0H 
0 (t) =a! (I(r), [;t)=-— =0. (11.4.32) 
ov 
The corresponding integral curves are given by 
v(t) =0/ (t+ 9 (0), 1;(t)=1;(). (11.4.33) 


The functions w/ (I) are called the characteristic frequencies associated with the 
action variables /;. To summarize, the action variables J; are functionally inde- 
pendent constants of motion in involution. For each value I, the dynamics of the 
system reduces to a motion on the torus Zy with constant angular velocities w/ (I). 
At this point, the reader should recall the discussion of Sect. 9.6, where we found 
this structure in the neighbourhood of a critical point of the Hamiltonian function of 
an arbitrary Hamiltonian system. As noted there, there are two qualitatively distinct 
cases: if the frequencies w (I) are rationally independent, that is, if for all nonzero 
k € Z” one has 


ko’ 40, 


each integral curve is dense in Xj. In this case, the torus Xj] is said to be non- 
resonant and the motion on it is said to be quasiperiodic. If, in contrast, the fre- 
quencies w’' (I) are rationally dependent, Xj is said to be resonant. In this case, we 
have 


Proposition 11.4.10 Jf the frequencies wh = w! (Io) are rationally dependent, there 
exists a canonical transformation (0, I) b> (, f ) to new action and angle variables 
such that a certain number | <n of the new frequencies is rationally independent 
on Xj}, whereas the remaining frequencies vanish. 


Thus, the torus Xj], is decomposed into /-dimensional invariant subtori and the 
motion on these subtori is quasiperiodic. 


Proof The frequencies ai, define a module 
n 
pa Lk; € a} 
i=1 


over the integers. Let {bo}, eas b!} be a basis in this module. Since, by assumption, 
the real numbers wy) are rationally dependent, / < n and one has n — / rationally 
independent equations 


Bi jaf =0, i=I4+1,...,0, fHl,....n, 


with Bi; being an integer-valued matrix whose rows (B',,..., Bn), i=1 41, 
...,M, are rationally independent. The basis elements b’ possess a (not necessarily 
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unique) decomposition 
b=Dijop, i=1,...,1, f=1,...5n. 
We combine the matrices B and D to an (n x n)-matrix A =[D' B"] with integer 
entries and define 
BD := A'D, T:=A7'T. (11.4.34) 


This transformation is canonical, because J; dovi= i jd . Moreover, since A is an 
integer matrix, %’ and J; are action and angle variables again. In the new variables, 
the Hamilton equations take the form 


dH = _. * 
(1) =a' (D), Ij; =—-—=- = 0, 


_—— 
ali av 


where the Hamiltonian is given by H(A/). For the new frequencies @ correspond- 
ing to Io = A7!Ip we obtain 


This proves the proposition. 


Finally, let us recall the important notion of degeneracy. If the frequencies do 
not depend on the action variables, the system is called isochronous.Otherwise, it is 
called anisochronous. In the latter case, it is said to be non-degenerate if 


a°H da! 
det = det #0. (11.4.35) 
01,01; al; 


Then, the mapping I+> (I) is a local diffeomorphism and one can use the fre- 
quencies as (non-canonical) coordinates labelling the tori. In this case, almost all 
tori are quasiperiodic. Nonetheless, there is a dense set of resonant tori as well. If 
(11.4.35) does not hold, the system is said to be degenerate. Often the occurrence of 
degeneracies is related to the fact that the system possesses more than n functionally 
independent constants of motion.”° This situation will be discussed in Sect. 11.8. 


11.5 Examples 


In this section we present the construction of action and angle variables for the 
harmonic oscillator, the Kepler problem and the symmetric Euler top. For the first 
two of these examples, we will use Theorem 11.4.7. 


20Which, of course, cannot all be in involution. 
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Example 11.5.1 (Harmonic Oscillator) We consider the Hamiltonian system 
(R*, dp A dg, H) with 
pr ka? 


H(g, p)=—+—. 
(q, P) an 5 


Being autonomous and having one degree of freedom, this system is integrable, 
where # = H. Thus, the level sets of # coincide with the energy surfaces 
2 2 
Ee= |g. per: 24 Le], 
2m 2: 

which at the same time coincide with the orbits of the system, that is, with the images 
of the maximal integral curves of the Hamiltonian vector field X77. For E > 0, Ye 
is an ellipse, whereas for E' = 0, it consists of the origin, which is the only critical 
point of X;,. In either case, ’¢ is connected and compact. Thus, we can apply 
Theorem 11.4.7. Let W = R? \ {0} and t = pdq. Since the fundamental cycle y” 
is given by 2’x itself, and since on X'¢ we have 


k 
pq, E)= 2m(E = ?). 


where q runs between — 2B and 2E for the action variable we obtain 


1 pve E 
= T= eee oo (11.5.1) 
Las DTH Ww Je 28 (2) 
with w= Re . To calculate the generating function and the angle variable, we 
choose, for example, V = {(g, p): p > 0} and obtain 


Sq@,D=I aresin( > ‘) ee 5 \2mol ~ m2o2q?, (11.5.2) 
M@ 

Seaecal sy = 11.5.3 

acsin( i ‘) ( ) 


(Exercise 11.5.1). By extending % in the obvious way to an angle coordinate on 
W, we obtain the desired action and angle variables on W = R? \ {0}. Obviously, 
these coordinates induce a symplectomorphism (W, dp Adq) = (S! x Ry, dI Adi). 
In the coordinates 3 and J, the Hamiltonian is given by H = Jw and hence the 
frequency corresponding to J is 
oH 
I) =— =o. 

w(1) aT a) 

Thus, for E > 0, we have a periodic motion on the torus 2 with frequency w: 


H(th=o-t+9(0), I(t)=1(0). 
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Example 11.5.2 (Kepler Problem) Let us consider the Hamiltonian system 
(T*(R? \ {0}), dé, H) with 


Pp? 


2 lal’ 


From the discussion in Example 11.1.3 we know that the functions H, L? and L, 
are constants of motion in involution which are functionally independent on a dense 
subset of T* (IR? \ {0}). It is clear that the level sets of 7 are compact only in the 
case 


1 
ao ee LO. (11.5.4) 


In spherical coordinates r = ||q||, , @ and the corresponding fibre coordinates (con- 
jugate momenta) p;, Py», Pg, the constants of motion read 


2 2 
ee Po ) 1 2 2 Po 
H= ++ ‘ L* =p, + ———. L,= 
5( r2 2 sin? 9 r re r2 sin? 3 ae 
(11.5.5) 


(Exercise 11.5.2). Denote the values of # = (H,L7, L,) by h= (E, 07, ®). On 
the level set X},, the momenta satisfy the relations 


29642 oe 7=0" - =@ (11.5.6) 

ie ae aad Pa sin? 9 ene ~ 
Since ®? < , the right hand side of the second equation in (11.5.6) has 
two zeros, sar 2 ph and it is non-negative in the interval between a and on, ax 


Thus, on this interval, we can define a function py (v, h) by the positive square root 
of the right hand side. In the same way, since E < 0, the right hand side of the 
first equation in (11.5.6) has two zeros, Pe <r}... and in the interval pi <r< 
7 os a function p;(r,h) can be defined by the positive square root. To Sanctnick 


fundamental cycles on the tori X,, define curves 


ynz(t) = (t,0,0; p,(t,h),0, &), Zales 
ypa(t) = ("mins t0;0,+p9(,h), 6), t €[Pmins Omax], 
Vet) := (rRins Din £3 0,0, 6), t €[0, 27]. 


Denote _ = Ve co) vee and y; = ye fo) Pers Then, {v3}, {yb} and {yp} are 
smooth families of curves and, for every h, eas ye vps is a system of fundamental 
cycles on 2}. Using these cycles and choosing the canonical 1-form @ as a potential 
for the symplectic form, we calculate the action variables: 


ha S— / jet] Giese 2) 
On yp i: rh. aad ~ «fae , 


min 
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1 1 omax 
hy = = | -- | Ps(t, hdt = 0 — @, (11.5.7) 
x Jyh Jah 


min 


1 1 20 
a bdt =o. 
2m Jyh 2 Jo 


For the inverse of the mapping (EF, 0, ®) (Nh, h, Ih) we find 


1 
ee . 2543 Dak. 11.5.8 
Ri Let ey aes ? ( ) 


Finally, to compute S and the angle variables 3!, 37, 33, we choose the mapping 
s:U—>Vas 
s(h) = (r2 


min’ 


on. ,0;0,0, &) 


min? 
and express h in terms of I and @ in terms of the variables r, 3, @ and 1), ln, 13. We 
find 


S = Sr, qh, h, Is) + Sed, I, Is) + Sod, Ts) 


with 
rp 1 1 I 13)? 
eof 5 (2+ 13) an 
rh t A4W+ht+hy t2 
v IZ 
3 
sy =f Jotnr- “3dr, 
oe sin‘ t 
So = bo 
and 


go! aS; w= O57 o8e P= O5r aSy 
dln 012 Ce 0b 


+o. 


By (11.5.8), in the action and angle variables v!, 1;, the Hamiltonian reads 


1 


H())= 
@ 2, + b+ hb) 


and the characteristic frequencies are 


o!D=aD)=07°D)= 


———_ =o (D. (11.5.9) 
h+h+h) 

Thus, the system is degenerate, and Lemma | 1.4.10 yields that the dynamics reduces 
to one-dimensional subtori, on which a periodic motion with period 


20 20 
T= — = —____ (11.5.10) 


o  /(-2E)3 
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takes place. This is Kepler’s Third Law, because the semimajor axis is equal to — se ; 


cf. Example 10.6.3. Let us apply the procedure of Lemma 1 1.4.10 to determine new 
action and angle variables (g, J) adapted to this reduction. The module generated 
over Z by the characteristic frequencies is Zw. We choose w! = w as a basis and 
ow? — w' =0 and w* — w* = 0 as the relations. This leads to the transformation 


matrix 


t 0° <0 
A={=1 1 0 
0 -1 1 
Then, (11.4.34) yields 
gi=v', ga=r—p!, pap (11.5.11) 


and 


/ 1 
JiHKhth+h= —3R’ yg=h+h=20, J3=1,=@. (11.5.12) 


The new action and angle variables g', J; are referred to as the Delaunay elements. 
In order to avoid degeneracy, in addition to (11.5.4) we have to assume 


L. <|L. (11.5.13) 


In terms of the Delaunay elements, the Hamiltonian and the frequencies are given 
by 


HD=-—>, 8O=5 #FM=HM=0. (11.5.14) 
257 i 


To clarify the geometrical meaning of the Delaunay elements, from (11.5.12) we 
read off that 


=-—, J5 -L?, J3=Lyz. (11.5.15) 


Moreover, we use (10.6.31) to express the semimajor axis a, the semiminor axis b 
and the inclination angle a (the angle between the z-axis and the direction normal 
to the plane of motion) in terms of the J;. This yields 


L J 
ae b=av1—e=Sbh, cosa = T= T 
2 


The geometrical meaning of the angle coordinates gy! and g? is illustrated in 
Fig. 11.2: g! is the angle between the x-axis of the inertial frame and the nodal line, 
that is, the line of intersection of the plane of motion with the x—y-plane. Therefore, 
it is called the longitude of the ascending node. The angle coordinate gy? is the angle 
between the perihelion and the nodal line. The angle coordinate y? coincides with 
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Fig. 11.2. Geometrical ZA 
interpretation of the Delaunay 
elements 


normal direction 


= perihelion 


nodal line 


the so-called mean anomaly. For a detailed discussion of the angle coordinates g', 
see [1] or [67]. 

To summarize, let us stress once again that the Delaunay elements are well de- 
fined under the conditions (11.5.4) and (11.5.13), that is, for elliptic orbits. Thus, 
circular orbits and orbits lying on the ecliptic plane are excluded. 


Remark 11.5.3 For given J, the invariant 3-torus 2’y is defined by the equations 


2 
m= 1 2 & 


Pe= J3. 


On the other hand, we know that the motion reduces to the level sets of the energy- 
momentum mapping & = (H, L,, Ly, L,). According to (10.8.5), for negative en- 
ergy E and L¥ 0, the level sets are 2-tori. Obviously, these invariant 2-tori foli- 
ate the 3-tori Xy provided by the Arnold Theorem. This is in accordance with the 
Nekhoroshev Theorem, to be discussed in Sect. 11.8. Indeed, the invariant 2-tori 
coincide with the level sets of the n + 1 = 4 functionally independent constants of 
motion H, L2,L y and L, which are in involution with the n — 1 = 2 constants of 
motion H and L?. 


Example 11.5.4 (Symmetric Euler Top) Consider the Hamiltonian system 
(T*SO(3) = SO(3) x s0(3), dO, H) with H given by (11.1.9). The action and angle 
coordinates we are going to construct are the so-called Andoyer variables [7, 43]. 
Unlike the preceding examples, here we will proceed by first defining the coor- 
dinates geometrically and then proving that they possess the defining properties of 
action and angle variables. For clarity, we use the vector notation, that is, we identify 
50(3) with R? via the isomorphism (5.2.6). 
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Let us assume that the tensor of inertia © has been diagonalized and let us denote 
the principal moments of inertia by O;, i = 1,2, 3. As a local chart on the config- 
uration space SO(3), we use the Euler angles”! $, 3 and w, where 0 < ¢, W <2 
and 0 < # < z. Let us denote the corresponding fibre coordinates in T*SO(3) (con- 
jugate momenta) by pg, p» and py. For later use, we introduce the notation 


Po — Py Ccosv Pw — Ppcosv 
A SS B =S 
sind sind 
To define the Andoyer variables, we start with expressing the angular momentum 
L in terms of the Euler angles and their momenta. This can be done by rewriting 
the canonical 1-form on T*SO(3) in terms of the L;, or by means of the Legendre 
transformation generated by the Lagrangian 
: 1S) : O10; + O205 + 0303 

ek @) = al 1@j + O2w5 + 303), 
where w = w(¢, 3, yr) denotes the angular velocity of the top. We shall follow the 
latter strategy. The Legendre transformation is given by 


_ar _ar _ar 


Using this and L = Ow = (O1@1, O2@2, O33), we obtain 


Po 


Lj} =pyscosw+Asiny, Lo = py sinw — Acosy, L3 = py (11.5.16) 


(Exercise 11.5.4). Now, we can construct the Andoyer variables j, g,/ and J, G, L. 
For that purpose, recall that the elements of the inertial frame and of the body frame 
are denoted by nj, no, n3 and ej, e2, e3, respectively. Define 


J:=L-m, G:=|Li,  L:=L-es. isi) 


To define the remaining Andoyer variables j, g, 1, we have to assume that L is 
neither parallel to e3 nor to nz, so that the vectors 


n :=n x L, n’ :=L xe 


do not vanish. Then, we can define j to be the angle between the n;-axis and the 
nodal line defined by the vector n’, g to be the angle between the nodal lines defined 
by n’ and n” and / to be the angle between the e -axis and the nodal line defined by 
n”. While G, L and / are defined intrinsically, J, g and j depend on the choice of 
the inertial frame. A brief calculation (Exercise 11.5.4) yields 


J=pe, G@=p3t+pyt+A, L=py (11.5.18) 


21Cf. Exercise 5.5.4. 


610 11  Integrability 


and 

ing —B 

cogs eS ee (11.5.19) 
P59 + B 

Po Py — cos? (ps + py, + A’) 

ie mee cae (1.5.20) 
(3 + 42)(p3 + B2) 
in — A 

nei eee (11.5.21) 


V Pp + A? 


We note that the coordinates G, L,/ and G, J, j parameterize the angular momen- 
tum L in the body frame and aL in the inertial frame, respectively: 


L= (VG? — L? sin(/), VG? — L? cos(I), L), 
aL = (VG? — J? sin(j), —vV G? — J? cos(j), J). 


By a lengthy but straightforward calculation (Exercise 11.5.4) one can prove that, 
in the Andoyer variables, the canonical 1-form 6 on T*SO(3) reads 


6 = Jdj + Gdg + Ldl. (11.5.22) 


Thus, these variables provide local Darboux coordinates on T*SO(3). Now, consider 
the Hamiltonian (11.1.9). In Euler coordinates, it reads 


sin? 


r (pp cosy +e oe cos? cin wy 
~ 20; 


- 5) 
Z (pp sinw — Pompe cosy)? Py 
202 203 


+V@,0,W). 


Rewriting this formula in terms of the Andoyer variables, we obtain 


L? 
\@ L*) + 50, TVG 8) (1.5.23) 


sin?! —cos?/ 
= + 
20; 202 


(Exercise 11.5.4). As we know from the discussion in Example 11.1.5, in general 
this system is not integrable. Therefore, let us assume that OQ; = QO and V =0, 
that is, let us consider a symmetric Euler top. Then, the Hamiltonian (11.5.23) boils 
down to 


_ 1 ) 5) Ors 
Wma (2 +aL ), a= o 


Since it does not depend on the Andoyer angles j, g, /, the Andoyer variables J, 
G and L are constants of motion. By (11.5.22), they are functionally independent 


(11.5.24) 
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and in involution. Hence, the system is integrable. Moreover, since J, G and L are 
canonically conjugate to angle coordinates, the flows of their Hamiltonian vector 
fields are 27-periodic. Thus, J, G and L are action variables and j, g and / can be 
taken as the corresponding angle variables. The characteristic frequencies are 


e_9H_G 19H aL 


foe = a aa ee oe 
eS ap ES ae ee Ay 


._ OH 
of a4 (1.5.25) 


Since w/ = 0, the system is degenerate.”* As in the previous example, we have 
more than three constants of motion, and their level sets decompose the 3-torus 
into 2-tori. In the inertial frame, the constant of motion L is fixed in space and the 
figure axis performs a circular motion (regular precession) around the axis of L with 
frequency w’. The frequency w! characterizes the rotation of the top around its own 
symmetry axis, given by e3. Altogether, we find a quasiperiodic motion on the 2- 
torus, parameterized by the angles g and /. This can be further illustrated by the 
following observation: since L; = O;@;, at each moment of time, the vectors L, e3 
and @ lie in a plane and we have 


1 L 
o= oF (aes +67). (11.5.26) 
The motions of rotation and precession define two cones: the first one, called the 
space cone, is defined by the rotation of @ around L. The second one, called the body 
cone, is defined by the rotation of w around the figure axis e3. The body cone rolls 
without slipping on the space cone, with the instantaneous tangent line coinciding 
with the axis defined by w. 


Remark 11.5.5 


1. If Lis parallel to e3, we have L x e3 = 0 and the Andoyer variables are not well 
defined. In this case, we have L = L3e3 and hence G? = L? = L?. Thus, 


(2 (11.5.27) 


Obviously, in this case @ and e3 coincide, that is, the body cone and the space 
cone degenerate to a ray defined by the axis of symmetry and the top rotates 
around this axis with frequency w = it = G/O3. We note that for L = 0 we get 
an analogous degeneracy. In this case we have 


Pe (11.5.28) 


Thus, w and L coincide, that is, the space cone degenerates to a ray and the 
body cone opens to build a half space. The symmetry axis rotates in the plane 


>2For the special case © = Qs there is an additional degeneracy. 
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orthogonal to L with angular velocity w* and there is no rotation of the body 
with respect to its symmetry axis.”* Finally, there is the trivial case defined by 
G =0. Then, L = 0 and the top remains at rest. 

2. The case G = L corresponds to the two critical points L = (0, 0, +/) of the re- 
duced Hamiltonian with corresponding critical value given by (11.5.27). The 
case L = 0 corresponds to the critical circle defined by L(@) = (J cosa, / sina, 0) 
with critical value given by (11.5.28). These critical subsets can be viewed as in- 
tersection sets of the 2-sphere Ss? with the ellipsoid defined by the kinetic energy. 
They belong to the bifurcation set of the energy momentum mapping, cf. Exam- 
ple 10.6.5 and Exercise 10.8.3. 

3. We note that there are further coordinate singularities, defined by the condition 
n3 x L=0. That the Andoyer variables are not defined globally suggests that 
the bundle (11.4.18) defined by # = (NK, h, 13), with the constants of motion 
of Example 11.1.5, is nontrivial. This will be explained in Example 11.8.10. 


Let us summarize the above discussion: If one finds action and angle variables 
for an integrable system, then, locally, one has an explicit description of the Hamil- 
tonian flow and of the invariant tori of this system in terms of a (quasi)periodic 
motion on tori. Action and angle variables usually cannot be extended to global co- 
ordinates on the subset of regular points of the phase space. Instead, there may exist 
topological obstructions. This will be discussed in Sect. 11.7. 


Exercises 

11.5.1 Prove the equations (11.5.2) and (11.5.3). 

11.5.2 Verify the formulae in (11.5.5). 

11.5.3 Find action and angle variables for the planar two-centre problem, cf. Ex- 
ample 11.1.4. 
Hint. Use elliptic coordinates. 

11.5.4 Prove the following formulae in Example 11.5.4: (11.5.16), (11.5.18)— 
(11.5.22) and (11.5.23). 


11.6 Small Perturbations 


In this section, we show that the description of an integrable system in terms of ac- 
tion and angle variables is well adapted to the study of small perturbations. Thus, 
let (M,w, #) be a 2n-dimensional integrable system with Hamiltonian function 
Ho, possessing some compact connected component of a level set of #. By Theo- 
rem 11.4.3, in a neighbourhood of this connected component there exist action and 
angle variables #/, J; and dynamics is given by 


I, =0, v= dHo gy =a! (1. 
al; 


?3¢y! tends to zero with L tending to zero. 
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Assume that the system is non-degenerate, 


dw! 8? Ho 
det = det #0. (11.6.1) 
al; d1,01; 


Consider a small perturbation of this system, given by 


AY’, D=AD)+eM(,D, (11.6.2) 


where € is a small parameter and Hj is a Hamiltonian function which is 27 -periodic 
in the variables 7’. The Hamilton equations for this system read 


i@Q= = (910, I)),  #@ =o! (I(r) + ow, I(t). (11.6.3) 


We aim at finding an iterative canonical transformation (#, I) > @, 1 which makes 
the full system integrable, order by order in e. Let us discuss the first step of this 
procedure in detail: we seek the canonical transformation in terms of a generating 
function S = S(#, 9) of the second kind, that is, 


= 


25 os ~ 
B, I o = —(,D. 11.6.4 
“0, ), vA (11.6.4) 


We make the ansatz S(?, 1) =v jreSi(@, 1 and require that the Hamiltonian 


function in the new variables, given by H(#,1) = H(v,D, be integrable to first 
order in €, 

H(O,D = Ao) +eM (0) +7 Ay(0,D. (11.6.5) 
By (11.6.4), 


p, ono! 2 
0D, v=v (0D. (11.6.6) 


Plugging this in into the Hamiltonian (11.6.2) and expanding by powers of ¢, we 
obtain7+ 


= os = 
H0.D=H(di+e° 0D) 


7 0S} = = 0S) = 
= Ho( I+ e«—(v, I H I+e—(,I 
of ere (B, )) +e (9. Teas (8, ) 
aS} 


= Ho(I) + 7 Paar? JD +e7Ro(0, D + eH (0,1 + 67Ri (0, D 


24Since S is a function of the variables # and I, we express H in these variables. 
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= HD +e(w! DI. + H0.D) + 6%(Ro + RIOD 
(11.6.7) 
with 


Ro. = 4 ( M(I+e 0.) — mo) — wi 
08, D= 5 0 Tea, ed 0 (1) "ap 7 gee ’ 


= 1 = = Ss 
RU,D= =( (9.1+eS0.p) = (0,0). 


Now we choose I such that w (Io) is non-resonant.”> For ||I — Io|| being of order 
€, we may replace w(I) by @o := w(Io) in (11.6.7), thus producing a correction of 
order ¢*. Then, requiring that (11.6.7) equals (11.6.5) and comparing coefficients, 
we find 

Ho) = Ho(), 

- = = j, OS} = 

Bu = OT) Oy Oe (11.6.8) 

Ao(8, 1) = (Ro + Ri)... 


To analyze the second equation in (11.6.8), we use the Fourier expansion of S; and 
H, with respect to the angle variables %, 


SOD=S° KD, HOD= >> De®”, 
keZ" keZ" 


wihk-#=)-"_, k,9'. Then, this equation reads 
Ay) = D> _,,, ((@o- WS) + HD)e*”. 


This yields H, (I) = Ho(1) for k = 0 and i(wo-k) Sq (I) + Hy) = 0 fork 4 0. Thus, 
the solution reads 


oe wee _ Ad . 
AO=h%®, so@Dd=- YO BO ews, (11.6.9) 
i@o-k 
keZ"\ {0} 
and the transformed Hamiltonian function takes the form 
H(8, 1 = Hod) + eH) +e? (0, D. (11.6.10) 


>5Since the system is assumed to be non-degenerate, the tori can be labelled by their frequencies, 
and hence such Ip are dense. 
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If @o - k becomes very small, there is no hope that the Fourier series in (11.6.9) con- 
verges. This is the famous problem of small denominators, which we met already in 
Sect. 9.6 where we had shown that a symplectomorphism can be brought to normal 
form in the neighbourhood of an elliptic 4-elementary fixed point. The same was 
true for the Hamiltonian function in the neighbourhood of an elliptic 4-elementary 
critical point. We had seen that in a neighbourhood U of such a critical point the nor- 
mal form part of the Hamiltonian yields a foliation of U into invariant tori, defined 
by a set J; of constants of motion given by (9.6.11), and that, in this approximation, 
the system becomes integrable. In this context, the choice of symplectic polar coor- 
dinates on U yields action and angle variables. We had also noted there that in the 
cases under consideration the KAM theory applies, see Remark 9.6.8. This theory 
yields that non-resonant tori persist the perturbation caused by passing to the full 
system, provided they fulfil a strong non-resonance condition of Diophantine type. 
Here, we meet another situation where KAM theory is applicable. If wo is strongly 
non-resonant, cf. (9.6.21), that is, if there exist constants tT > 0 and y > 0 such that 


|@o -k| > y|IKI|‘, (11.6.11) 


for all k € Z” \ {0}, the Fourier series in (11.6.9) is convergent.° In this case, the 
generating function S exists and for tori fulfilling |/I — Io|| < e, at first order, the 
small perturbation yields a change of the frequency by a constant: 


=e 0H - = dH = 
wo (1) aT, wo (1) am” 


Moreover, on the torus defined by I, at first order we have a quasiperiodic motion 
with frequencies @/: 


i, Be 
I; =-~ =), v/ = — =o’. 
avd ol; 


We also note that, since I is of order ¢?, the condition \|{ — Io] < remains valid 
for large times, that is, times of order et, 
Now, the above described procedure must be iterated. In the second step, one 


starts with the Hamiltonian 
A(O,D = Ap) + e* Ap (0, D. 


This step removes the dependence on 7 up to order ¢*. After n steps, one arrives 
at order 2’. This is the reason why the KAM procedure is said to be superconver- 
gent. See [26] for further comments and historical remarks. Finally, one arrives at 


6Under the assumption that H; is analytic on a certain domain the proof is not difficult, see e.g. 


(286, §3.5]. 
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the following result.*” If the unperturbed system is non-degenerate in the sense of 
(11.6.1), for sufficiently small ¢ > 0, most non-resonant invariant tori persist and 
are only slightly deformed, so that in the phase space of the perturbed system there 
exist invariant tori densely filled with quasiperiodic integral curves. These invariant 
tori form a majority in the sense that the Lebesgue measure of the complement of 
their union is small for small perturbations. 


Remark 11.6.1 


1. The KAM method works for tori satisfying the strong non-resonance condition 
(11.6.11), provided both Ho and Hj are of class C! with ] > 2t +2 > 2n. This 
weakens the regularity assumptions made in the historical papers cited before 
considerably [221, 244, 259]. 

2. Let us make precise what it means that a majority of invariant tori persists: let 
A,,r be the set of frequencies fulfilling the infinitely many conditions contained 
in (11.6.11), with y and t kept fixed. It can be shown that A,,, are Cantor 
sets in IR” and that the union A; = Leg A,,r has full Lebesgue measure for 
t >n—1, whereas A, = @ for t <n — 1. That is, almost every @ belongs to 
A, provided t > n — 1. On the other hand, the parameter y turns out to limit the 
magnitude of the perturbation parameter through the condition that ¢ < y*. This 
means that for a given perturbation one cannot vary y arbitrarily but has to keep 
it large enough. Therefore, one takes a subset (2, of A,,, of frequencies whose 
distance from the boundary of the set 2 of all frequencies is at least y. These are 
Cantor sets whose complements in {2 have Lebesgue measure of order y. For 
more details we refer to the papers of Poschel [245, 246]. In [245], it is shown 
that the persisting tori form a family over the Cantor set §2,, which is smooth in 
the sense of Whitney. Thus, one may say that the perturbed system is integrable 
over this Cantor set. 

3. The above results remain true if the assumption (11.6.1) of non-degeneracy is 
replaced by 


a? Ho dHo 

det | 297 8% | Lo, (11.6.12) 
aHo 0 
ol; 


A system fulfilling this condition is said to be isoenergetically non-degenerate. 
In this case, the persisting invariant tori form a majority, in the sense explained 
above, on each energy surface. 


27See the list of references in Remark 9.6.8. Additionally, we draw the attention of the reader 
to Sect. 3.6 of the textbook [286] by Thirring, where the proof is provided for a special class of 
Hamiltonians. This way, the complicated KAM theory analysis becomes quite transparent. 
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11.7 Global Aspects. Monodromy 


In the present section, we use action and angle variables to study the global structure 
of the submanifold M ia of points which belong to a compact level set component 
of #%,. In particular, we derive topological obstructions to the existence of global 
action variables and to the existence of global action and angle variables on M ar : 
The presentation below is in the spirit of Duistermaat [80]. 

First, we discuss the existence of global action variables. Recall from Corol- 
lary 11.4.5 that the natural projection 


HEME eV (11.7.1) 


has the structure of a locally trivial fibre bundle with typical fibre T”. Since M, we 
is made up by compact level set components of .#%;, the Hamiltonian vector fields 
X 7, restrict to complete vector fields on M. ni . Hence, according to Example 6.1.2/4, 
their flows define an action W of R” on M or . By the Arnold Theorem 11.3.3 and 
Corollary 11.4.5, Ve is the orbit manifold of this action and Ean is the correspond- 
ing natural projection. Let 7; be action variables on W C Mx and let Ww! denote 
the action of IR” on W defined by the flows of the Hamiltonian vector fields X7,. By 
(11.4.5), the actions YW and W! on W are related by 


I 
Wy (m) = Ye" :5:(my™), (11.7.2) 
where b; : W — R” are smooth mappings uniquely defined by 
Xj,= bi) Xu,. 


In particular, YW and W/ have the same orbits in W. Hence, we may view the bundle 
structure of Mx over U = H,(W) as being induced by the action W/. Since all 
points of W have stabilizer 27 Z” under wW! this action descends to a free action 
W! of T? =U(1)" on W, given by 
ee vee eiln) (m) = Wi saith) (m). 

Over U, the action W/ turns (11.7.1) into a principal bundle with structure group 
T”. Thus, the existence of global action variables is related to the existence of a 
global reduction of the action W to a free action of T”. Via the relation (11.7.2), the 
latter is equivalent to the existence of a smooth assignment to X' € ve of a set of 
generators for the stabilizer of YW on 2%’. Hence, we have to discuss the existence of 
such an assignment. For that purpose, we first construct the period bundle mentioned 
earlier. Consider the subset Py of ve x R” made up by the pairs (2’, t) such that 
t belongs to the stabilizer of W on 2. According to Proposition 11.4.2/2, a set of 
action variables J; on W defines a local frame in the vector bundle ve x R" by 


Tire {bi (Z),...,bn(Z)} 
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and hence a local chart on He(W) xR Cc ve x R” which maps Py onto 
H;-(W) x 22Z”. This shows that Py is an embedded submanifold of Ve x R” 
and, in the smooth structure induced, a locally trivial Z-module bundle over Ve : 


Definition 11.7.1 (Period bundle) The bundle Py is called the period bundle of 
the integrable system (M, w, #7). 


The bundle projection 1?” : Py > ve is induced from the natural projection 
to the first factor in V“” x R” and local trivializations are given by the frames {b;} 
induced by action variables. Thus, a smooth global assignment to »’ € Vx of a set 
of generators for the stabilizer of Y on X' corresponds to a global frame in Pw, that 
is, it exists iff Pw is trivial. A criterion for the (non-)triviality of Py is given by 
the so-called monodromy, which will be constructed now. 

Since the fibres of Py are discrete, the projection x? is a local diffeomor- 
phism. Hence, for every curve y : [0, 1] > ve and every t in the fibre of Pv over 
y (0), there exists a unique curve 7 with 1? o py =y and (0) =t. This curve is 
called the lift of y to t. Via its lifts, y induces a mapping Pyw(y(0)) > Py(y()) 
by assigning to t the endpoint of the lift of y to t. This mapping is called the par- 
allel transport along y. The parallel transport along a composite curve y2 - yj is the 
composition of the parallel transport along 7; with the parallel transport along y2. 
Moreover, curves in yer which are homotopic with fixed endpoints generate the 
same parallel transport, because every smooth homotopy in Vv can be lifted to 
Py by means of a covering by local trivializations. Thus, by choosing Xo € ve 
and by assigning to a closed curve based at 2 its parallel transport, we obtain a 
group homomorphism from the fundamental group 7, ve , 0) of vee based at 
29 to the group of transformations of the fibre of Pv over Xo. 

To compute the parallel transport along the closed curve s +> y(s) based at Xo, 
we cover y by open subsets U; which admit action variables pe, 1=0,...,r. For 
te Py), we decompose t = anki b (Xo) with k! € Z and {b} denoting the 
local frame in VF x R” induced by J. As long as y stays in Up, the lift of y 
based at t € Py(2Xo) is given by y(s) = aki by (s)). When entering U;, we 


merely have to express each vector by (s)) in terms of the frame {bY y (s))}. 
Hence, according to Proposition 11.4.2/3, over U; the lift is given by 


7 (8) = 20k! Aid (y(s)) 


with a constant invertible integer matrix A. As a result, on the level of the coef- 
ficients k' with respect to the basis {2b (Zo)} in Py(o), the parallel trans- 
port along y is given by the product of the integer (n x n)-matrices appearing in 
the change of action variables along y. In particular, it is an automorphism of the 
Abelian group Py(). 


Definition 11.7.2 (Monodromy) The group homomorphism 


m(V% , Eo) > Aut(Py(Zo)) 
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which assigns to a closed curve its parallel transport is called the monodromy based 
at Lig of the integrable system (M, ow, #). 


As noted above, after having fixed a system of generators © oe P. '¥é( 20), the mon- 
odromy at X9 may be viewed as a homomorphism from 71 (V% , X19) to GL(n, Z). 
Accordingly, the image of a closed curve y under this homomorphism is referred to 
as the monodromy matrix of this curve. Another choice of generators results in this 
homomorphism composed with an inner automorphism of GL(n, Z), as does an- 
other choice of the base point 2p in the same connected component of Ve, Thus, 
up to conjugacy in GL(n, Z), the monodromy can depend on the connected compo- 
nent of Vi only. For simplicity, in the rest of this section we adopt the assumption 
that Vv or, equivalently, M ae is connected. In the general case, the results apply 
to each connected component of the bundle HF, :M. a > Vie : 


Theorem 11.7.3 (Global action variables) For an integrable system (M,w, #), 
the following statements are equivalent. 


Mx admits global action variables. 

. The monodromy is trivial. 

. The period bundle is trivial. 

. The action VY on Mx induced by the flows of the Hamiltonian vector fields X n, 
can be globally meaeed to a free action of T", ve is, there exists a free T”- 

action W on Mx and a smooth mapping x: v* x R” = T” such that d(x, -) 


t) (m) for all 


RWNe 


is a group homomorphism for all 37 € Vx and Y(m) = 
Mme Mv ; 


Pre Hee(m), 


The action W turns (11.7.1) into a principal bundle with structure group T” = 
Ud)”. 


Proof 1 = 4: This has been discussed above. es 
4 = 3: The mapping A induces a unique mapping A: Vie x R” > R" by 


ACE, t) = (eS, eH), 
One can check that the mapping 
dh: ve xR’ => ver x R", A(Z,t) = (2, A(2,t)) 


is bijective and has a pointwise injective tangent mapping. Hence, it is a diffeo- 
ago Composing the assignment of the standard basis {27re;} of R” to every 
Le ve with the inverse of this diffeomorphism, we obtain a global frame in Py. 

3 = 2: This is obvious. 

2 = 1: Since the parallel transport along an arbitrary closed curve in Ve is 
given by the product of the transformation matrices for the changes of the action 
variables along that curve, vanishing of the monodromy aes that any such prod- 
uct yields the unit matrix. Thus, for a given covering of MZ by systems of local 
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action variables, by choosing one such system and redefining all others by succes- 
sively transforming them with the inverse transformation matrices and subtracting 
the constant shifts, we obtain a system of global action variables, cf. point 3 of 
Proposition 11.4.2. 


Next, we discuss the existence of global action and angle variables. This discus- 
sion involves the Chern class of a principal bundle with structure group T” = U(1)”, 
for which we refer to the standard literature, see e.g. [166] or [279]. a8 oe we | 
vestigate under which conditions the locally trivial fibre bundle Hoe: Mx > Ve 
is globally trivial. Recall that M ae is assumed to be connected. 


Proposition 11.7.4 The bundle Her :M ae => Ve is globally trivial iff both the 
monodromy is trivial and the first Chern class of the principal T"-bundle so induced 
vanishes. 


Proof If the monodromy is trivial and the Chern class of the induced principal T”- 
bundle vanishes, this bundle must be trivial, because principal T”-bundles are clas- 
sified up to vertical bundle isomorphisms by their first Chern class. Conversely, if 
(11.7.1) is trivial, it suffices to show that the monodromy is trivial, because then, this 
bundle is a principal T”-bundle and since it is trivial, the first Chern class vanishes. 
Thus, let x: M” > ee x T” be a trivialization of (11.7.1). Realize T” as the 
n-fold product of the complex unit circle and consider the local diffeomorphism 


WiVZ xR" oF xT, W(E,0=x0Mox|(¥, (1... 0). 


Define a mapping yg: V.” x R" > V x R” as follows. For (Zt) € V* x R", 
the curve s +> y(Z, st) in ve x T” has a unique lift to vee x R” with respect to 
the covering 


p:V% xR" +V" xT, — p(Z,t):= (2, (e",...,e%)). 


Assign to (2, t) the endpoint of the lifted curve. Then, y = p og, hence @ is a 
local diffeomorphism. Since it is bijective, it is a diffeomorphism. Hence, it is a 
vector bundle automorphism of the trivial vector bundle ve x R”. Since it maps 
the period bundle P.v onto the trivial subbundle Ve x 21Z", Theorem 11.7.3/3 
yields that the monodromy is trivial, indeed. 


Theorem 11.7.5 (Global action and angle variables) Let (M,w, #) be an inte- 
grable system. Then, Mx admits global action and angle variables iff the mon- 
odromy is trivial, the first Chern class of the principal T"-bundle Hye : mM > 


Ve so induced vanishes and w is exact. 


28 Characteristic classes will be discussed in detail in volume II of this book. 
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Proof If Mv admits global action and angle variables 9, J;, the bundle Hor : 
M = > V4 is globally trivial and the triviality of the monodromy and the Chern 
class follow from Proposition 11.7.4. Moreover, w is exact with potential J;dv'. 
Conversely, if the monodromy and the Chern class are trivial, Theorem 11.7.3 
yields the existence of Blo a variables J; and Proposition 11.7.4 ae 
that the bundle Ken: Mx > ve admits a global section s ee > Mx 

Since w is exact, so is the 2-form s*w on es . Choose a potential and expand it 
with respect to the global frame {dJ;} (with the J; viewed as functions on vee ). 
Let # be the expansion coefficients. Define a new section § : Ve > mM by 
s(X):= wt ( sy (s(2 )). A brief calculation shows that s is Lagrange. It follows that 


the functions % on M ee defined by 


0! (W (5(D))) = 


combine with the functions /; to global action and angle variables on M as. : 


Remark 11.7.6 


1. According to the Hurewicz Theorem, see e.g. [55] or [199], if the first and sec- 
ond homotopy groups of M idl are trivial, so are the first and second de Rham 
cohomology groups. This implies that the monodromy and the first Chern class 
are trivial and that the 2-form w is exact. Hence, Theorem 11.7.5 yields that triv- 
iality of the first and second homotopy groups of Mx is a sufficient condition 
for the existence of global action and angle variables, a result which belongs to 
Nekhoroshev [228]. 

2. Let {(Wa, (Ya, Iy))} be an atlas of Darboux charts on Mv built from action 
and angle variables. The equations 0, = 0 define Lasrancian sections Sy over 
Ug = Ho c(Wey) i in the bundle (11.7.1). For each pair (a, 8B) with Wg 1 Wp 4 ©, 
there exists a closed 1-form Agg on Ug M Ug such that 


Sa(X') = Prop (Z)5p(%), 


where ¢ denotes the natural fibrewise action’? of T* Ve _on the fibres of the 
bundle (11.7.1), that is, on the level set components of #--. Note that Aap 1S 
not unique, for the following reason. By virtue of the global frame eA *dx!} in 
T*V: # where x! denote the standard coordinates on R”, one can identify the 
period Saincilé Py with a Z-module subbundle of T* 5% . It turns out that this 
subbundle is the stabilizer of ¢. Thus, to make Agg unique, one has to view it as a 
section in the quotient bundle T* ve / Py, which is a locally trivial T”-bundle. 
Note that this bundle is locally isomorphic to (11.7.1). In more detail, the action 
variables J, define an isomorphism 


Wa = T* (Ua) / Pe (Ua). 


°Defined by (8.6.15). 
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One can check that the system / := {Agg} defines a 1-cocycle on ve with values 
in the bundle T* Vv /Py in the sense of Cech, that is, an element of the first 
Cech cohomology, 


[Ale HEV”, z(TV" /P)). 


This cohomology element is called the Lagrange class of the bundle (11.7.1). 
The Lagrange class is related to the Chern class mentioned above as follows. 
There exists a natural coboundary operator 


8: (VX, Z(T*V /Px)) > H?(V, Px), 


which maps the Lagrange class to a cohomology class 4({A]) € HV, Py), 
called the Chern class of the bundle (11.7.1). If the monodromy is trivial, then the 
Chern class so defined coincides with the ordinary Chern class of the correspond- 
ing principal T”-bundle. One can show that the Lagrange class characterizes the 
bundle (11.7.1) up to bundle isomorphisms which are symplectomorphisms, that 
is, for a given Chern class, there exists a family of isomorphic bundles which are 
distinguished as symplectic manifolds by their Lagrange class. For details, we 
refer to Duistermaat [80], Dazord and Delzant [71], and Zung [317, 318]. 

Let us add that, on M ag , the action @ of T* Vy can be used to replace the ac- 
tion W of R” induced by the Hamiltonian vector fields X y,. This shifts the focus 
from the constants of motion in involution H;, whose choice contains some arbi- 
trariness, to the Lagrangian torus foliation they define, which is more geometric 
in nature. The description of integrable systems in terms of Lagrangian torus 
foliations generalizes to the situation where the foliation is only locally gener- 
ated by constants of motion in involution. For an introduction to symplectic toric 
manifolds, we refer to [28]. 

3. Recently, there have been successful attempts to include the singular points of 
A, with the ultimate goal of understanding the global topological structure of 
the full system, see Zung [317, 318]. 


Example 11.7.7 (Spherical pendulum) Consider the spherical pendulum, with the 
phase space TS?, realized as the level set of the mapping 


f=.f):7TRo>R, fi&y=|xl?-1, Ay =x-y, 
and with the Hamiltonian 
1 
H(x,y) = silyl? + x3. (11.7.3) 


According to Example 11.1.8, as constants of motion in involution we can choose 
Ay = A and Hp = J, where 


J(%,y) =x1y¥2 — x2y1 (11.7.4) 
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is the momentum mapping of the SO(2)-symmetry given by rotations about the 
x3-axis. Thus, # coincides with the energy-momentum mapping & studied in Ex- 
ample 10.8.10. In what follows, we will keep the notation & and we will use results 
from that example without further notice. The critical points of & are m4 = (xi, 0) 
with xi = (0, 0, +1). Correspondingly, the critical values are (h, 7) = (+1, 0). The 
level set of (—1, 0) consists of m_ alone, whereas the level set of (1,0) consists of 
m4 and an open cylinder over S!. All the other level sets are compact and connected. 
Hence, 


M* =TS?\{m4,m_},  V*% =v = &(TS*)\ {(-1,0)} 
and 


mM —TS?\e({41,0}), Vv =e(Ts’)\ {1,0}, 


cf. Fig. 10.8. In particular, V# coincides with the set of regular values of &. We 
will show that for the spherical pendulum, both the bundle (11.7.1) and the mon- 
odromy are non-trivial and that, therefore, neither global action and angle variables 
nor global action variables exist. 

The argument for the first assertion requires knowledge about the topology of the 
level sets of H. Since H has the same critical points as &’, it is a Morse function on 
T*S*. Hence, we can apply the results of Sect. 8.9. To determine the Morse indices, 
we observe that 


Tn (TS*) = kerd f (m+) = ker é = : : 2) 


so that Tn, (TS?) is spanned by the standard basis vectors e;, e2,e4 and es of R°, 
and we calculate*” the Hessian of X at m+ in that basis: 


Fl 0 0 0 
0 1 0 0 
Hess (XH) = 0 : 1 0 
0 0 01 


Thus, the Morse index is 2 for m+ and 0 for m_, that is, m+ is a non-degenerate 
saddle point and m_ is a non-degenerate minimum. 


Proposition 11.7.8 The energy surfaces of the spherical pendulum have the follow- 
ing topological structure. 


1. Forh =—1 we have H~!(h) = {m_}. 

2. For -1 <h <1, the energy surface H~'(h) is diffeomorphic to the 3-sphere S°. 

3. For h = 1, the energy surface H~'(h) is homeomorphic to the unit tangent 1- 
sphere bundle TS? with the fibre over x4 contracted to a point. 

4. For h > 1, the energy surface H~!(h) is diffeomorphic to T;S* = SO(3) = RP?. 


30This was posed as Exercise 10.8.7. 
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Proof 1. This is obvious. 
2. The Morse Lemma 8.9.4 provides local coordinates &1, ..., &4 in a neighbour- 
hood of the critical point m_ such that 


1 
H (Ei, £2, §3, 4) =—1+ 5 (Ef +++ + &4)- 


Thus, for values h close to —1, the level set H~!(h) is diffeomorphic to the 3-sphere 
S. By the Morse-Isotopy Lemma 8.9.6, this remains true for all h < 1. 

3. and 4. In case h > 1, for all (x,y) € TS?, we have x3 < h. Consequently, y 
belongs to the 1-sphere in T,S* defined by 


1 
sli? =A — 33. 


This proves point 4. If h — 1, the 1-spheres over the points x ¢ x4 persist, whereas 
the 1-sphere over x; degenerates to a single point. 


Corollary 11.7.9 For the spherical pendulum, the bundle Hoe :M bam => Ve is 
nontrivial. In particular, global action and angle variables cannot exist. 


Proof We show?! that there exists a closed curve y in Vo such that the subbundle 
E7! (vy) > y is non-trivial. Choose y to wind once around the critical value (1, 0) 
and to touch the two boundary curves of ve in one point each, see Fig. 11.3(a). 
At the points where it touches the boundary, cut it into two closed pieces yo and y1. 
Deform these pieces diffeomorphically, and with endpoints on the boundary, into 
horizontal line segments ap and a; running through all points (ho, j) and (Ay, /) 
in Ve , respectively, where ho < 1 and hy; > | are fixed energy values. If the bun- 
dle €-'(y) > y was trivial, that is, if it was isomorphic to the product bundle 
T° x y, the manifolds &~!(yo) and &~!(y;), and hence the manifolds &~! (ag) 
and &~!(a1), would be homeomorphic. However, &~! (ag) = H~!(ho) = S* and 
&— (a1) = H7!(h1) S SO(3) are not homeomorphic. 


Next, we calculate the monodromy. The fundamental group m1 (V% ) is gener- 
ated by any closed curve y which winds once around the critical value (1,0). We 
have to find the parallel transport of a basis of the period lattice along y. For that 
purpose, we have to construct action variables. In spherical coordinates ¢ and 3 on 
the unit 2-sphere, given by 


x, =cosd¢sin#, x2 = sing sin?, x3 =cosv, 


and their canonical conjugate momenta pg and py, the momentum mapping 
(11.7.4) and the Hamiltonian (11.7.3) take the form 


ps, Pe 
J=Do¢, H=*4+ +cosv. 
ee 2 2sin? 3 


3! The argument belongs to Cushman. 
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h 
eat 


(a) 


Fig. 11.3. The curves in ad used in the prof of Corollary 11.7.9 (a) and in the computation of 
the monodromy (b) 


Thus, the Hamilton equations read 


+sind, =? b=py. (11.7.5) 
sine ? 


2 
; Pg COS 0 
= 0, => —_——_ 
m sin? 3 


For a given regular value (h, 7), we choose the fundamental cycles y; and y2 as the 
integral curve of X 7 and Xy, respectively, through a chosen point of &~'(h, j). 
According to (10.8.14), y1 is given by } = const and it is parameterized by the 
variable @. Since H is a constant of motion, along 72, we have 


p> =2(h —cosv) — 


sin? 9 


Now, the action variables are given by 


qi -| pedo = 2 j, 
Y1 


OL i 
b= [ podd =2 [ 2(h — cos?) — — 5 dt, 
Y2 oe sin 


with 3 denoting the solutions of the equation 2(h — cos #) — j 2 sin-* 9 = 0. Note 
that the integral is of elliptic type and cannot be solved in terms of elementary 
functions. For the corresponding transformation of frames 


X;, = bi" Xm, 


where H = H and Hy = J, we find b;* (h, j) = on (h, j), that is, 


bh, j)=0, 9 bi*(h, j) = 20 
and, by (11.7.5), 


. Ui : of” dv of di? i dt 
2, Ls J = = — —_—— 
: [2th — cos 8) — y 0 c 
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ols dv %+ dao 
nth, =-2) [  — =o fo 
9 sin 9 [20h — cost) — —4— o 


sine 0 
Oy 
=f dg. 


Thus, T := b2!(h, j) is the time needed for running through one period of the re- 
duced dynamics and A¢ := b27(h, j) is the corresponding increase of @. T is called 
the time of return and Ad is called the rotation number. A careful analysis, see [69], 
yields the following result: 


(a) T is a uniquely defined real analytic function on Vy. 
(b) Ad is a locally unique real analytic function on Vv fulfilling 


lim Ag = 
jn, a¢ 


ie for —1<h<1 (11.7.6) 


—2n forh>1. 


This can be understood heuristically by looking at the effective potential of the 
reduced dynamics. 


Now, we choose y to run along the edges of the square with corners 


(ho, — jo), (h1, —jo), (h1, jo): (ho, jo): 


where ho < 1, hy > 1 and jo > 0, see Fig. 11.3(b). Obviously, the parallel transport 
of bi (ho, — jo) along this curve is trivial and the component by! does not change 
as well. Let us calculate the change in the component by? (ho, — Jo) = Ad: we can 
choose jo arbitrarily small. In the limit jo — 0, using (11.7.6) and the obvious rela- 
tion Ag(h, j) = —A¢(h, —{), for the first line segment we obtain 

lim (Ag (1, — jo) — Ad (ho, —jo)) = 27 —m =n. 

jio> 
For the second and the fourth line segments there is no contribution and for the third 


segment we obtain again z. Thus, the parallel transport along y yields the following 
transformation of the frame at (ho, — jo): 


br by, by bo + dy. 


Thus, the monodromy matrix is given by 


dels a) 


In particular, the monodromy is non-trivial, so that the spherical pendulum does not 
admit global action variables either. 


Remark 11.7.10 Another example with nontrivial monodromy is provided by the 
Lagrange top, see [69]. We stress that nontrivial monodromy leads to interesting 
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quantum effects. These are studied for example in molecular physics, see [113] and 
[83] and the references therein. 


11.8 Non-commutative Integrability 


As already noted at the end of Sect. 11.4, the occurrence of degeneracies is of- 
ten related to the fact that a system possesses more than n functionally indepen- 
dent*” constants of motion.*? This situation has been considered in an early paper 
by Nekhoroshev [228]. Below we give a proof of this result. Moreover, we prove 
the Mishchenko-Fomenko Theorem which applies when the constants of motion 
close under the Poisson bracket. For both of these results, one needs the following 
classical theorem, which generalizes the Liouville Theorem 11.3.1. 


Theorem 11.8.1 (Carathéodory-Jacobi-Lie) Let (M,q@) be a symplectic manifold 


of dimension 2n and let f\,..., fi, 1<n, be independent smooth functions in invo- 
lution. Then, for every m € M, there exist (2n — 1) smooth functions fi+1,..., fon 
on an open neighbourhood of m such that f\,..., fon are Darboux coordinates, 


wo=dfi Adfn4i+---+dfn a dfon- 


Proof We show that on some neighbourhood of m in M, the family {fi,..., fi} 
may be extended to a family {f|,..., fr} of independent functions in involution. 
Then, the assertion follows from the Liouville Theorem 11.3.1. 

Since w(X f,, X fj) ={fi, fj} =O foralli, 7 =1,...,/ and since the differentials 
dfi,...,df) are linearly independent, they generate an isotropic subbundle E of 
T*M. Leta =df) A---Adf; and consider the closed (2n — /)-form 


B=aro"”, 


Obviously, all the characteristic subspaces ker 8, have the same dimension. Hence, 
ker 6 coincides with the characteristic distribution** D* of 6 which by Proposi- 
tion 4.2.20 is integrable. Thus, the Frobenius Theorem implies that there exists 
an adapted chart (U,«) such that D* is spanned over U by the / vector fields 
02n—I+1,+---, 02, and the annihilator (DF )0 is spanned over U by the 1-forms 
dx!,...,d«2"~. It is not hard to see that (DB) is IT-orthogonal to E, where IT 
denotes the Poisson bivector defined by w (Exercise 11.8.1). Thus, every function 
«',i=1,...,2n —1, Poisson-commutes with every function f;. Choosing one of 
them and denoting it by f/41, we end up with (/ + 1) independent functions in invo- 
lution. This procedure can be iterated until we obtain n functions in involution. 


32For convenience, throughout this section we will assume the constants of motion under consid- 
eration to be functionally independent on the whole of M. 


33Qne usually speaks of non-commutative (Liouville) integrability or of superintegrability. 
4Cf. Definition 4.2.18. 
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Remark 11.8.2 The Carathéodory-Jacobi-Lie Theorem can be further generalized 
as follows. Let f,,..., ff and, ..., 4% be independent smooth functions fulfilling 


(fi, fj} =9, {hy, hs} =0, (fi, hr} = dir. 


Then, locally, there exist smooth functions f741,..., fy and hp41,..., 4» comple- 
menting the above functions to Darboux coordinates. This result is usually referred 
to as the Cartan-Lie Theorem, see [181, §III.13] for a proof. 


Theorem 11.8.3 (Nekhoroshev) Let there be given (n + k) independent functions 
M,..., Hn4x¢ on a 2n-dimensional symplectic manifold (M, w). Assume that all of 
these functions are in involution with the first (n — k) functions. Let X* be a compact 
level set component of the mapping H = (M,..., Hn4“): M > R"**. Then, = 
is isotropic and diffeomorphic to an (n — k)-dimensional torus. Moreover, there 
exist coordinates 0',..., gn-k a. eee a, T,,..-,In—~ and pj,..., Pk onan open 
neighbourhood U of & such that 


(a) the symplectic form is given by w= ae dl, Advi+ ae dp; \dqi, 
(b) the flows of the Hamiltonian vector fields Xj, on U are complete and 21- 


periodic, 
(c) the coordinates I; can be written as functions of H,,..., Hy—~ and the coordi- 
nates p; and q/ can be written as functions of H,,..., Hn+k. 


Due to the periodicity of the flows of the Hamiltonian vector fields X7,, the co- 
ordinates J; and 2! are referred to as generalized action and angle variables. For 
k = 0, we obtain the Arnold Theorem 11.3.3 and the existence theorem 11.4.3 for 
action and angle variables as a special case. Thus, the proof of the Nekhoroshev 
Theorem, to be given below, yields an alternative existence proof for action and 
angle variables in the case of an ordinary integrable system, cf. Remark 11.4.6. 


Proof Since 
Xy,(H;) = (Hi, Hj} =9, i=l,....n—k, j=l,...,.n+k, 


the Hamiltonian vector fields X7,,..., XH,_, are tangent to all level set components 
of .#. For dimensional reasons, they span the tangent spaces and thus, every level 
set component is isotropic. Since X’' is compact, the restrictions of Xy,,..., XH,_; 
to © are complete. Hence, their flows define an action of R’~* on XY. Since they 
are linearly independent and since 2’ has dimension 2 — (n+ k) =n —k, the orbit 
mapping of any point is a local diffeomorphism. By the same argument as in the 
proof of the Arnold Theorem we conclude that the action is transitive. Since 2’ is 
compact, the common stabilizer of the action is an integer lattice in R”~*, generated 
by elements bj, ..., by—~ € R”~*. This implies that ¥ is diffeomorphic to T” —k and 
that the vector fields 


Xi = bi! Xu, (11.8.1) 
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on 2% have 2z-periodic flows. Moreover, since the constants of motion 
(M,,..., H,-~) commute, we have 


o(XH,,XH;) ={Hi, Hj} =9, i,j=l,...,.n—k, 


that is, X’ is isotropic. Using the Tubular Neighbourhood Theorem for the subman- 
ifold X and the Implicit Function Theorem, one can show that there exists a dif- 
feomeorphism @ from a neighbourhood W of © onto B x T’~*, where B Cc R"+* 
is some open ball, such that przg o® = #. We conclude that all level set compo- 
nents in W are diffeomorphic to Y = T”~* and by the same arguments we obtain 
27 -periodic vector fields X; on each level set component, defined by (11.8.1). We 
choose the generators b; so that, for every i, the integral curves of X; on different 
level set components are homotopic in W. 

Now, let y; be the image of the integral curve of X; through mo € » and let yj; (m) 
be a closed curve through m € W which is homotopic to y; and which is contained 
in the torus (m) through m. The curves yj(m),..., Yn—«K(m) form a system of 
fundamental cycles in this torus. Next, using the Poincaré Lemma, we want to show 
that on W there exists a potential form t of w. For that purpose, it is enough to 
note that w vanishes on every 2-cycle of W. This is the case, indeed, because via 
the diffeomorphism ® every 2-cycle on W is homotopic to a 2-cycle on 2. Then, 
the isotropy of &' implies the assertion. Thus, let us choose a potential t and let us 
define 


1 
I;(m) =s/ t. (11.8.2) 
yi(m) 


These functions are well-defined and smooth,*> because all tori foliating the neigh- 
bourhood U are isotropic and thus, the 7; depend on the homotopy class of y;(m) 
only. 

Now, we will show that the functions /; are in involution and that the Hamilto- 
nian vector fields generated by them coincide with the 27 -periodic vector fields X; 
defined by (11.8.1). Then, an application of Theorem 11.8.1 will yield the assertion 
of the theorem. For that purpose, let ®%/ be the flow of X H;- By the assumptions 
of the theorem, the foliation of W into tori is invariant with respect to 4, that is, 


0) (Z(m)) = £(,"' (m)). We calculate 


(Hy, i \(m) d I pHi ) 1d i; 1d / 
i, li}(m)=— I, 0 m) = —— t= T. 
_ dij 2m dt to Jy,(@;/ (m)) 2m dt to Jo,/ oy; (m) 


Let A(t) be the 2-dimensional surface obtained by acting with ot ’ on y;. Then, 


d 1 1 
— Ve r=tim=(f, r- | r) = tim - o=0, 
At todo, Joy(m) 9 t \JoV oy) yilm) 10 t Jar) 


35To see this, note that one can choose the family {y;(m)} to be differentiable in m. 
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because A(f) isotropic. Indeed, its tangent spaces are spanned by Xy, and a linear 
combination of the vectors (X#,,..., XH,_,) commuting with X7,. We conclude 
that 


{Hj,lj}=0, jal,....n+k, i=l,....n—k. (11.8.3) 


Since @ is non-degenerate, it follows that on W we can express the variables /; 
as smooth functions of (H,..., Hn—x). Therefore, X7, = 3H X H; and hence the 
functions J; are in involution. 

Let us calculate the Jacobi matrix a Let m € &, let 6; C » be the integral 


curve through m of the 27r-periodic vector field X; and let gy! be the flow parameter 
on ;. Consider the line segment 


I = {(Hi(m),..., Hj-1(m), Hj(m)+s, Hj+1(m),..., Hn+x(m)) :0<s <t} Cc B 


and take the 2-dimensional surface S(t) = ®—!(I, x ®(B;)) C W. Then, we have 


ol; . ol a. oll 1 2 ‘ 
(m) = lim —— Tt = lim — o=— w(d;, Xj)d¢’, 
0H; t>0 27t aS(t) t>0 27t S(t) 20 0 


where 0; denotes the j-th partial derivative in the coordinate system defined by @. 
Using (11.8.1), we conclude 


(m) =b i( ) ex (11 ) 
0H; : ; , 


and thus X;, = X;,i = 1,...,n — k. Thus, the vector fields X7, are 27-periodic. 
Since the functions /; are in involution, we can apply Theorem 11.8.1, which tells 
us that for any m € 2’, there exists a neighbourhood U in M and smooth functions 
Decapoda” and pj,..., px on U such that 


ory = dl; Add! +dp; Adq!/. 


Let 61 be the flow of X7, and let wi! ‘= @/! O-++-+0 O! be the corresponding 
action of R’~*. This action is transitive on every level set component. Since the 
functions g/ and p ; are constant along the tori, via this mapping they trivially extend 
to functions on W = Wank (U). To extend the functions 2%’, note that on U we have 
Xj, = dy: and thus 


o (Wi (m)) = 0! (m) +1 (11.8.5) 


for all m € U and t € R"~* such that wy! (m) € U. Thus, for a covering of W by 
open subsets Ux := Y (U) with appropriately chosen t, € R”~*, k € Z, one can 
define the extension of 1! to Ux by setting 


Hy, (m) = 01 (Wy (m)) + t, 
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because (11.8.5) ensures that the functions so defined coincide on any nontrivial 
intersection of the U; and thus combine to smooth functions (mod 277) on W. In 
order to show that the functions q/, pj and 2 complement the functions J; to 
Darboux coordinates on W, it suffices to prove that the mapping W — R*” defined 
by #, I, q and p is injective. The latter follows from the transitivity of W/ and 
(11.8.5). 


The Nekhoroshev Theorem immediately implies 


Corollary 11.8.4 Let (M, w, H) be a 2n-dimensional Hamiltonian system such that 
there exist (n +k) constants of motion fulfilling the assumptions of Theorem 11.8.3. 
In the Darboux coordinates #', q/ , Ij, pj; provided by this theorem, the Hamiltonian 
function H depends on I only and the Hamilton equations read 


. dH 


i; =0, bal =«/(D, pi = 0, gi =0. (11.8.6) 
ol; 


Thus, the integral curves of the system are located on (n — k)-dimensional tori and 
the motion is quasiperiodic with frequencies being functions of the constants of 
motion I, ..., In—k. 


Another important class of non-commutatively integrable systems is the fol- 
lowing, studied by Fomenko and Mishchenko [214]. Let (M,w, H) be a 2n- 
dimensional Hamiltonian system, let p be an integer between n and 2n and let 
H\,..., Hp be functionally independent constants of motion. Let g be the linear 
subspace of C° (M) spanned by the functions H;. Assume that 


(a) the Hamiltonian vector fields X y, are complete, 
(b) g is closed under the Poisson bracket, that is, {H;, Hj} = cf Ay with cf ER. 


By assumption (b), g is a Lie algebra of dimension p. It acts symplectically from 
the right on M by the homomorphism g — X(M), Ate X4. By assumption (a), 
this action of g integrates to a symplectic action W of the corresponding simply 
connected Lie group*® G on M, given by 


Pexp, A(m) = D}\(m) 


for all A € g and me M, where ®4 denotes the flow of X A- One can show that the 
adjoint action of G on g is given by 


Ad(a!)A= AoW (11.8.7) 


forall Ae g andae G (Exercise 11.8.2). Note that in the special case where p =n 
and g is Abelian, G = R” and we are in the situation of an ordinary integrable 
system whose Hamiltonian vector fields X q, are complete, cf. Sect. 11.3. 


36R very finite-dimensional abstract Lie algebra is the Lie algebra of a simply connected Lie group 
G, which is unique up to isomorphy [302, Thm. 3.28]. 
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Since, by definition, we have A, = X,4 for all A € g, the mapping J : M > g* 
defined by 


(J(m),A)=A(n), Aeg, (11.8.8) 


is a momentum mapping for the action YW. As a consequence of (11.8.7), it is equiv- 
ariant (Exercise 11.8.2). If we identify g* with R” by means of the basis dual to the 
basis {H;} in g, J coincides with the mapping #7” = (H),..., Hp»). Since the Hj are 
functionally independent, J is a submersion. This implies the following. 


(a) The level set components of J are embedded submanifolds of dimension 2n — p. 
They are the maximal integral manifolds of the regular distribution ker J’, cf. 
Example 3.5.4/4. 

(b) Corollary 10.2.2/2 implies that the stabilizer Gy is discrete, that is, gj, = 0. 
Hence, the action of G is locally free. Consequently, the kernel kerW = 
(mem Gm is discrete and the quotient group G := G /kerW has Lie algebra 
g and universal covering group G. The induced G-action on M is denoted by 
W. It is obviously effective and locally free. 

(c) The image of J is open and thus it intersects the principal stratum of the coad- 
joint action, that is, it contains an element jz such that the Lie algebra g, of 
the stabilizer G,, has minimal dimension. This implies that g,, is Abelian, see 
[77]. Hence, the identity connected component Gi, of G, is Abelian, too. Us- 
ing Proposition 6.2.2/3 and Formula (6.2.3), we determine g,, explicitly (Exer- 
cise 11.8.3): 


gy, = {A eg: {A, B}(n) =0 for all Beg, me J(u}. (11.8.9) 


Let ys € g* be a value of J. Since, by equivariance, J~!(jx) is invariant under the 
action of the stabilizer G,, of jz under the coadjoint representation, and since the 
induced action of G, on J =f (jt) is locally free, 


dimg, < dim J~'(u) = dim M — dimg, (11.8.10) 


where the equality is due to the fact that J is a submersion. In the special case 
of an ordinary integrable system, g is Abelian, and hence dimg, = dimg =n = 
dim J~!(). A reasonable generalization to the present situation is to assume that 
dim g, = dim J ~! (2), which is equivalent to 


dim M = dimg + dimg,. (11.8.11) 


Let us assume that there exist u € g* which fulfil this equality and belong to the 
principal stratum?’ of g*. Then, (11.8.10) implies that the image of J is completely 
contained in this stratum, that is, for every value yw of J condition (11.8.11) holds 
and g,, is Abelian. Under this assumption, we have the following generalization of 
the Arnold Theorem. 


37The subset of elements with minimal stabilizer under the coadjoint action. 
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Theorem 11.8.5 (Mishchenko-Fomenko) Let H),..., Hp,n < p < 2n, be indepen- 
dent functions on a 2n-dimensional symplectic manifold (M, w) whose Hamiltonian 
vector fields X 4, are complete. Assume that this system closes under the Poisson 
bracket, thus spanning a p-dimensional Lie subalgebra g of C~ (M). Let G be the 
induced effective symmetry group and let & be a level set component of the asso- 
ciated momentum mapping J with value J(2’) = jw. Assume that g, belongs to the 
principal stratum of the coadjoint action of G and satisfies (11.8.11). Then, X is 
an orbit of the identity connected component G,, of Gy and hence diffeomorphic to 
T’ x R2"-P— for some 0 <1 <2n-— p. 


Proof Let mo € XY. Since J —l(n) is invariant under the action of G,,, the level 
set component » is invariant under the action of the identity connected component 
Gi According to (11.8.11), the Killing vector fields of the action of Gt, on 2 span 
the tangent bundle of »’. Hence, for every m € 2’, the orbit mapping c. > dv, 
at» W,(m), is a local diffeomorphism. Using this, by the same argument as in the 
proof of the Arnold Theorem, one can show that G acts transitively on 2’. Then, 
the Orbit Theorem 6.2.8 implies that 2’ is diffeomorphic to the quotient of Gy with 
respect to the stabilizer G,, of some*® point m of Y. Since Gu is Abelian, so is 
C Hence, the quotient G), /Gm is an Abelian Lie group. Since G,, is discrete, 


G /Gm has dimension 2n — p and is, therefore, isomorphic to T’ x R2”~?~! for 
39 


some / between 0 and 2n — p 


Remark 11.8.6 


1. In Sect. 11.3 we observed that in the case of commutative integrability, symplec- 
tic reduction of the level sets of J yields discrete reduced phase spaces. Here, un- 
der the assumption that Eq. (11.8.11) holds, we have a certain non-commutative 
analogue of this situation: by Theorem 11.8.5, the topological quotient 


J" (w)/Gy = (J7"(u)/G))/(Gu/G)) 


is discrete and hence trivially a symplectic manifold, which may be interpreted 
as the reduced phase space at ju. 

2. Under the additional assumption that the G-action be free and proper, the theory 
of regular symplectic reduction applies: the level set M, = J ~! (2) is an embed- 
ded submanifold, G,, acts freely and properly on M,, and the discrete reduced 
phase space M,,/G,, results from the Regular Reduction Theorem 10.3.1. The 
level set 4%, = J ie (G,), with @,, C g* denoting the coadjoint orbit through 1, 
is a submanifold of M diffeomorphic to M, xg, G, see (10.3.14). Since My, /Gy, 
is discrete, .@,, is diffeomorphic to a direct sum of group manifolds G,,. Since 


38In fact, since G is Abelian, all points have the same stabilizer. 


3° Every finite-dimensional connected Abelian Lie group is of this form, see Exercise 18 in Chap. 3 
of [302]. 
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G,, acts freely on .%,,, the Killing vector fields generated by the elements of 
9, Span an integrable distribution on .4,,, which yields a foliation of .4,, into 
leaves diffeomorphic to Gi. 


Under the additional assumption that the G-action is free and proper, we have 
the following non-commutative analogue of the existence theorem 11.4.3 for action 
and angle variables. 


Theorem 11.8.7 Let (M,w) be a 2n-dimensional symplectic manifold satisfying 
the assumptions of Theorem 11.8.5. Assume, in addition, that the G-action be free 
and proper. Then, the following holds. 


1. There exists an open neighbourhood U of the origin in Sn and an anti- 
equivariant symplectomorphism ® from a G-invariant open neighbourhood W 
of &' in M onto G x U, endowed with the symplectic form 


w(g,v)((L,A1, 01), (Li, A2, 02) = (01, A2) — (02, A1) — (ue + v, [A1, Aa) 


and the action of G by left translation on the factor G. ® maps G- X' to G x {0} 
and satisfies 


Jo@ (a, v) = Ad*(a)(u + v). (11.8.12) 


2. There exist smooth functions I, ..., I2n—p on W in involution whose flows are 
complete and generate an action of R*"~? on W with the common stabilizer 
2nZ! x {0} C R*"-?. The orbits of this action are the level set components of 
J in W and the level sets of the mapping I = (,..., Iyn—p) are the G-orbits 
in W. 

3. On some neighbourhood Wo of X& in W, which can be chosen to be a union 


of level set components of J, there exist smooth functions*®® #',...,9?"~?, 
@ assy?” and pi, 2.5 Pp—n Such that 
2n—p F F 
_ i . j 
o=)), diy Add! + > dp; Adq!/. (11.8.13) 
j=l 


The functions I;, q/ and p j parameterize the level set components of J in Wo 
and the functions 3' provide coordinates on the latter. 


As in the situation of the Nekhoroshev Theorem, by a slight abuse of language, 
the coordinates provided by this theorem are referred to as generalized action and 
angle coordinates. Note that the #/+!, ..., 32"~P do not provide angle variables. 


Proof 1. First, we use the anti-isomorphism of Lie group actions given by the iden- 
tical mapping of M and the inversion mapping of G to turn the right action W on 


with o!,..., o! being multi-valued mod 27. 
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M~* into a left action W". Both actions have the same orbits and the same invariant 
subsets. For ¥', the Symplectic Tubular Neighbourhood Theorem 10.4.4 yields a 
w-equivariant symplectomorphism ©® of an invariant open neighbourhood W of 
the orbit of some m € & onto the subset 6(W) = G xq, (m* ® V) of the twisted 
product E = G xq,, (m* @ V), where m is a vector space complement of g,, in 
Gu, cf. (10.2.6), and V is some symplectic slice at m, and where m* and V are open 
neighbourhoods of the origin in m* and V, respectively. Since Gm is trivial, we have 
m = g,. Then, (10.2.11) and condition (11.8.11) imply V = 0, so that we end up 
with E = G x U, where U is some open neighbourhood of the origin in Bye Since 
» is contained in the orbit of m, it is contained in W and we have G- Y =G-m. 
Hence, it is mapped onto the zero section of E. Finally, in this special situation, 
the symplectic form on (W) is given by (10.4.12) and the momentum mapping is 
given by the normal form (10.4.15). 
2. Choose a basis {F),..., Z2n—p} in g, and define J; : W > R by 


Ii(m):= (Pigs oP(m), E;\, 


where Pgs | G x i > oi denotes the natural projection. Obviously, the level 
sets of J = (,..., lon—p) coincide with the G-orbits in W. To determine the 


Hamiltonian vector field X7,, we pass to G x U via ®. Denote x I, = ®,X1,. For 
(a,v)EeGx gi, and a tangent vector (B,,0) with B eg ando € a, we compute 


(dJi)(a,v) (Ba, ©) = (0, Ei). 
With the ansatz (X I; )(a,v) = (Aa, p), the defining equation for xX 1, Teads 
(0, B) — (0, A) — (w+ v, [A, B]} = —(o, Ei) 


for all Be gandoe ae Since g,, is the stabilizer of w, we have ad*(£;) = 0. 
Since g,, is Abelian, ad*(£;)v = 0. Hence, A = E;, o = 0 is a solution, and it is, 
therefore, the unique solution. Thus, (X I; (a,v) = (CEj)a, 0) and the flow is complete 
and given by 


(s, (a, v)) > (aexp(sEj), v). 


Since g,, is Abelian, the vector fields x 1, commute pairwise. Hence, according 
to (6.1.5), their flows define an action W/ of R2”-? on G x U. Due to Proposi- 
tion 5.3.10, this action can be written in the form 


2n—p 
WI! (a, v) = (><s0( ~~ iti). t=(t,...,fn—p): (11.8.14) 


i=1 
The corresponding orbit mapping of the point (1,0) induces a Lie group homo- 
morphism from R?”~? to Gh: Since ce is isomorphic to T! x R2”~?~!, the basis 
elements E; can be chosen so that the kernel of this homomorphism is 2nZ! x {0}. 
Then, this is also the common stabilizer of all points of G x ie 
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Next, we return to W. The Hamiltonian vector fields X;, commute and hence 
their flows define an action W! of R*”~? given by W1 = @o Ww! o @~!. Then, 
(11.8.14) reads 


2n— p 
u! 90(0.9=0(aex0( 5 vti).v) (11.8.15) 
i=l 


We show that the orbits of W/ coincide with the level set components of J in W. 
Let m = ®(a,v) € W and let »'(m) be the level set component of J containing m. 
By (11.8.12), 


J(m) = Ad*(a)(u + v). 


Since v € Sh and g, is Abelian, gy¢m) = Ad(@)gyu+v D Ad(a)gy. Since g, has 
minimal dimension, we can choose U so that g,,4, has minimal dimension for all 
v €U. First, this implies 9,4) = g, and hence gyqm) = Ad(a)g,,. Second, in view 
of the first assertion, this implies that 2’(7m) is the orbit of m under the action of the 
identity connected component Bin) of Gj(m). By the previous argument, Gy a= 
aG),a~'. Finally, since for b € G), we have W,,,-1(m) = ®(ab“', v), (11.8.15) 
implies that the orbit of aGia! through m coincides with the W/-orbit. Finally, 
the functions J; are in involution, because their Hamiltonian vector fields commute. 

3. This follows by the same arguments as in the Nekhoroshev Theorem 11.8.3. 


Remark 11.8.8 If the functions H),..., Hp are constants of motion with respect to 
a given Hamiltonian function on (M, w), we have a corollary completely analogous 
to Corollary 11.8.4. In particular, in the generalized action and angle variables pro- 
vided by the theorem, the dynamics of the Hamiltonian system is given by (11.8.6). 


Remark 11.8.9 


1. The Mishchenko-Fomenko Theorem can be generalized in various directions. In 
particular, one can weaken the assumption that the functions H; form a finite- 
dimensional Lie algebra by requiring that 


(Hj, Hj} = Pij(A), 


where Pj; is a matrix of constant rank, see e.g. [89], [91], [261] and the references 
therein. In [261] a nice discussion of non-commutative integrability of the Kepler 
problem can be found. We also recommend the survey article [154]. 

2. One can prove that non-commutative integrability implies ordinary integrability, 
see [214] for a restricted class of Lie algebras (including semisimple ones) and 
[258] for the general case. 

3. If we choose W small enough so that for every level set of J it contains at 
most one connected component, the space of level set components in W may 
be identified with J(W) Cc g* and the corresponding natural projection with 
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J: W— J(W) Cc g*. On the other hand, combining the G-equivariant sym- 
plectomorphism @ with the natural projection Pl ge : Gx oi, > os we obtain a 
smooth mapping 


FI = ptgs oP: W > Gy. 


By definition, .4% = St [, E;. Since the fibres of this mapping are foliated by 
level set components of J, there exists a smooth mapping z : J(W) > ei, such 


that the diagram 
I J 
(11.8.16) 


Tr = J(W) Cc g* 


commutes. One says that J;w and .% define a bifibration of W. The fibres of 
J;w are isotropic and the fibres of .% are coisotropic. We show that x !(v) 
coincides with the coadjoint orbit through jz + v. Indeed, using (11.8.12) and the 
equivariance of J we obtain 


a !(v)=J(4(v)) = J(G- 1 (4, v)) = Ad*(G)(J(@7"(4, v))) = Our. 


In particular, the fibres of 2 coincide with symplectic leaves of the Lie-Poisson 
structure of g*, discussed in Example 8.2.18/3. Via J, the generalized action and 
angle variables 9', q/, Ij, pj; induce coordinates qi, Ii, pj; on g* and, via -f, 
coordinates /; on Bie which in this context is often referred to as the action space. 
Let us add that the above bifibration defines a dual pair in the sense of Weinstein 
[310]. For more details we refer to [89] and Chap. 11 of [232]. 


To conclude the discussion of non-commutative integrability, let us consider a 
2n-dimensional Hamiltonian system which is both non-commutatively integrable, 
with momentum mapping J, and integrable in the ordinary sense, with the indepen- 
dent constants of motion in involution # = (H,..., H,). Let m € M be a regular 
point of both J and .# and assume that the level set components of m with respect 
to J and to # are compact. On the one hand, according to Theorem 11.4.3, m pos- 
sesses an open neighbourhood U which is foliated by n-dimensional tori which are 
invariant under the dynamics, thus giving rise to ordinary action and angle variables. 
On the other hand, according to Theorem 11.8.7, the dynamics in U further reduces 
to lower-dimensional tori. In general, the decomposition into n-tori is not compat- 
ible with the topology of the non-commutatively integrable system. The following 
example illustrates this point. 


Example 11.8.10 (Symmetric Euler top) We take up Example 11.5.4. Recall that by 
means of the left trivialization of T*SO(3) and the natural isomorphism s0(3) = R3 
of Example 5.2.8, the phase space of the symmetric Euler top can be identified with 
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M = SO(3) x R?. In these variables, the action of SO(3) on M induced by left 
translation is given by 

Wp (a, L) = (ba, L). 


In order to reserve the symbols G and J for the Andoyer variables, in this example, 
we denote the Lie group acting effectively by G and the corresponding momentum 
mapping by J. Consider the Andoyer function L on M, given by 


L(a,L):=L-e3, 
and the components of the momentum mapping of the action of SO(3) on M by left 
translation, given by 
K(a,L) =a. 
We have 
{Ki,Kj}=eij*Ke,  {L, Ki}=0, (11.8.17) 


where the last equation is due to the fact that Xx, are Killing vector fields for the 
SO(3)-action, whereas L is invariant. Thus, the functions L, K,, K2, K3 span a Lie 
subalgebra g of C°(M). It is isomorphic to R @ so0(3) via the assignment L tb 
(1,0), Kite rf, cf. Example 5.2.8. The corresponding momentum mapping is 


J =(L, Ki, K2, K3) 


and the associated simply connected Lie group is G = R x SU(2). An easy compu- 
tation (Exercise 11.8.4) shows that its action on M is given by 


You) (a,L) =(b(u)a, RoL), «eR, uweSU(2), (11.8.18) 


where ¢ : SU(2) + SO(3) is the covering homomorphism of Example 5.1.11 and 
Ra denotes rotation about e3 by the angle a. The subset of regular points of J is 
given by 


mM! = {(a, L) € SO(3) x R?: L x e3 £0} = SO(3) x (R® \ Res). 


The points of M 7 have the common stabilizer Gia) = 2rZ x {+1}, hence the 
group acting effectively is 
G =U(1) x SO(3) 


and this action is free and proper. The values of J, are given by 
w=(A,k)eRxR*\ {0}, [Al < [Ik]. 


The stabilizer subgroup G y is the direct product of U(1) with the subgroup of SO(3) 
of rotations about k, hence G _ T?. Under the identification § =R @ s0(3), the 
stabilizer subalgebra g,, is given by R@RKCR® IR3. In terms of the original 
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representation of g in terms of functions, g,, is the subalgebra spanned by the two 
functions L and k' K;. The dimension condition (11.8.11) is obviously satisfied for 
all values jz of J. Hence, Theorem 11.8.7 yields that the level set components of 
J, are 2-tori. The level set components can also be obtained directly by acting with 
G py On a point (a, L) with L- e3 = A and aL =k, which amounts to left translations 
of a by rotations about aL and to rotations of L about the e3-axis. 

Now, consider the Andoyer variable G, given by 


G(a,L) =||L]. 


We already know that X,; and XG define an action of R* on M/ with common 
stabilizer 277 Z?. Due to {L,G} = 0 and {G, J;} = 0, this action leaves the level 
set components of J, invariant and hence the latter are the orbits of this action. 
Moreover, the level sets of the mapping -% := (L, G) are the orbits of the action of 
the symmetry group U(1) x SO(3) and the image is given by 

I(M’) = {(, y) €R*:|a| < y}. 


Thus, as in the situation of Remark 11.8.9/3, we have a bifibration of the type 
(11.8.16), which here reads 


ui 
he 
J(M 
ri 


The induced projection z is given by 


I (M") /) 


(A, k) = (A, ||Kl|) 


and its fibres are {A} x Stel) where Sik is the sphere of radius ||k|| in R*, repre- 
senting the (co)adjoint orbit of k. Thus, the action variables L and G are adapted to 
the foliation of M/ induced by the symmetry associated with g: the fibre of J, over 
(A, y) is a bundle with fibre T? over {A} x Sty: For every A, this bundle is a direct 


product of a trivial S'-bundle over the point 4 with a nontrivial S'-bundle over S? 
of the type SO(3) > S?, that is, the SO(3) x S!-fibres of -Y cannot be decomposed 
globally into 3-dimensional tori. This can be further illustrated using the remaining 


Andoyer variable J: by Remark 11.5.5, J is not globally defined on M7, but only 
on the open subset 


M, = {(a,L) € SOG) x R?: Lx e3 £0, m3 x LO}. (11.8.19) 


The Andoyer variables are adapted to the corresponding bundles as follows: (g, /) 
are angle coordinates in the T?-fibres of J tm,» (G,L, J, j) are coordinates on 
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J(M,), (g,l, J, j) are coordinates on the fibres of 7, and (G, L) are coordi- 
nates on .%(M,.). By the condition n3 x L 4 0, the points (j, J) = (0, G) (north 
pole) and (j, J) = (7, —G) (south pole) are excluded, that is, (J, 7) yield local co- 
ordinates on the 2-sphere with radius G. Altogether, the action and angle variables 
define a diffeomorphism 


(G,L,J,g,1,j):Ms>NexT?, Ne={(G,L, J) eR: |L| <G,|J| < G}. 


To cover M by action and angle variables, one has to introduce a second Andoyer 
chart. This is explained in detail in a paper by Fasso [89]. From the above discussion 
we see that it is quite unnatural to decompose the phase space into 3-tori. The motion 
of the top takes place on invariant 2-tori and the additional angle variable j has no 
physical meaning. The definition of the pair (J, j) of variables is of local nature and 
depends on the choice of the inertial frame. 


Exercises 

11.8.1 Complete the proof of Theorem 11.8.1 by showing that, with respect to the 
Poisson bivector field 7, (D*)° is orthogonal to E. 

Hint. Use Proposition 5.4 in [181, $1.3]. 

11.8.2 Verify Formula (11.8.7) and use this to prove that the momentum mapping 
J defined by (11.8.8) is equivariant. 

11.8.3 Prove Formula (11.8.9). 

11.8.4 In Example 11.8.10, verify that the effective action induced by L, K,, K2, 
K3 is given by Formula (11.8.18). 

11.8.5 Show that the Kepler problem yields a non-commutatively integrable model 
both for positive and for negative values of the energy. Work out the details 
for negative energy values. 

Hint. Reconsider Example 10.6.3 carefully. 


Chapter 12 
Hamilton-Jacobi Theory 


In this chapter, we present the classical Hamilton-Jacobi theory. This theory has 
played an enormous role in the development of theoretical and mathematical 
physics. On the one hand, it builds a bridge between classical mechanics and other 
branches of physics, in particular, optics. On the other hand, it yields a link between 
classical and quantum theory. In Sect. 12.1 we start with deriving the Hamilton- 
Jacobi equation and give a proof of the classical Jacobi Theorem, which yields a 
powerful tool for solving the dynamical equations of a Hamiltonian system. We in- 
terpret the Hamilton-Jacobi equation geometrically as an equation for a Lagrangian 
submanifold of phase space! which is contained in the coisotropic submanifold 
given by a level set of the Hamiltonian. Using this geometric picture, one can extract 
a general method for solving initial value problems for arbitrary first order partial 
differential equations of the Hamilton-Jacobi type. This method is based on the fact 
that solutions are generated by the characteristics of the underlying Hamiltonian 
system. That is why this procedure is called the method of characteristics. It will be 
discussed in detail in Sect. 12.2. In Sect. 12.3 we generalize this method to the case 
of systems of partial differential equations of the Hamilton-Jacobi type. 

It turns out that one can go beyond the case where a solution is generated by a sin- 
gle function on configuration space. This is interesting both from the mathematical 
and from the physical point of view. To do so, instead of single generating functions, 
one must consider families depending on additional parameters. Such families are 
called Morse families. In Sects. 12.4 and 12.5 we develop the theory of Morse fam- 
ilies in a systematic way. In Sect. 12.6 we present the theory of critical points” of 
Lagrangian submanifolds in cotangent bundles, including a topological characteri- 
zation in terms of the Maslov class and a description of the topological data in terms 
of generating Morse families. 

In Sects. 12.7 and 12.8 we discuss applications in the spirit of geometric asymp- 
totics. First, we study the short wave asymptotics in lowest order for the Helmholtz 


‘In this chapter, the phase space will always be the cotangent bundle of some configuration space. 


?Points where the Lagrangian submanifold is not transversal to the fibres. 
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equation. This leads to the eikonal equation of geometric optics. We discuss classes 
of solutions of this equation including the formation of caustics. In Sect. 12.8, we 
study the transport equation. We present a detailed study of its geometry and, on 
this basis, derive first order short wave asymptotic solutions for a class of first- 
order partial differential equations. In this analysis a key role is played by a con- 
sistency condition of topological type, which for good reasons is called the Bohr- 
Sommerfeld quantization condition. We discuss applications to the Helmholtz and 
to the Schrddinger equations. 


12.1 The Hamilton-Jacobi Equation 


The basic idea of Hamilton-Jacobi theory consists in finding a time-dependent sym- 
plectomorphism transforming the system to equilibrium. Let (M, w, H) be a Hamil- 
tonian system and let (M, w, H) be its extension to the time-dependent phase space, 
that is, 


M=T*Rx M, O@=o-dE Ad, H=H-E, 


see Sect. 9.3. Let © = H7! (0) and let ® bea time-dependent canonical transfor- 
mation of M. By Proposition 9.3.4, @ induces a time-dependent canonical trans- 
formation of M, that is, a smooth mapping ® : M x R— M such that D(-,f) isa 
canonical transformation of M for all t. In Darboux coordinates g' and p; on M, ® 
is given by 


(q, p.t) +> ®(q, p,t) = (Gq. Pp. t), PG, p.t). £). 


It transforms the system to equilibrium iff the new variables g' and jp; are constants 
of motion, that is, 


qg'=0, = pj =0. 
In this case, the Hamilton equations imply 


0H 0H 
— =0, — =0, (12.1.1) 
aq' 0 Di 


To find a local generating function S of the first kind for ®, we must solve 


(pidg' — Hdt) — (pidq' — Hr) =—dS (12.1.2) 


on the graph Tg C (M x R) x M of @, cf. Sect. 8.8. Comparison of coefficients 
yields the equations 


H=H+— (12.1.3) 
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on Ig. By (12.1.1), H depends on t only. It can, therefore, be absorbed into S. 
Then, 
as 


H+—=0 12.1.4 
ery: f ) 


and by coordinatizing Ip by qi and q', from (12.1.3) we obtain 


OSL as 
(a 3q q. ¢), ) + —(q,q,t) =0, (12.1.5) 
q ot 


where H stands for the local representative of the Hamiltonian function in the 
Darboux coordinates q', p;. This is the time-dependent Hamilton-Jacobi equation. 
Here, q plays the role of a parameter labelling the solutions. It can, therefore, be 
omitted. If H is not explicitly time-dependent, one can separate the time variable by 
means of the ansatz S(q, q,t) = S(q,q) + T(t): 


oF gg Stacey 
dt ae q.q i 


Since the right hand side of this equation does not depend on f, both sides must be 
equal to a constant, say c. Thus, we have T(t) = —c(t — tg) and 


as 
(a ag ») =c. (12.1.6) 


This is the time-independent Hamilton-Jacobi equation. Again, q appears as a pa- 
rameter labelling the solutions. Let us summarize. If the time-dependent canonical 
transformation generated by S(q, q, 7) transforms the system to equilibrium, then 
S(q, q, t) fulfils the first order partial differential equation (12.1.5) for every q, that 
is, the g' play the role of parameters for a family of solutions. In the sequel, such 
a family will be called a complete integral for H, provided it fulfils a certain regu- 
larity condition. We will show that complete integrals of (12.1.5) are in one-to-one 
correspondence with solutions of the Hamilton equations (2n ordinary differential 
equations of first order) for H. This is the famous Jacobi Theorem, yielding a pow- 
erful solution scheme. On the other hand, forgetting about the above derivation, we 
can view (12.1.5) as an equation defined by H for the function S$ = S(q, ft). This 
type of equations occurs in various branches of physics, notably in optics. We will 
see that the initial value problem for this equation can be solved by means of the flow 
of the Hamiltonian vector field X 7. This is the method of characteristics, which will 
be discussed in detail below. 


3The same statement holds true for a generating function S(q, p, t) of the second kind with param- 
eters pj. 
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Remark 12.1.1 


1. To understand the physical meaning of S, let us consider a mechanical system 
with configuration space Q = R” and Hamiltonian function H :T*Q —> R. Let 
S = S(q,t) be a solution of the associated Hamilton-Jacobi equation. Via S, 
every curve t +> q(t) in Q generates a curve t +> (q(t), p(t)) in T*Q by 


aS 
pi(t) = agi (A): 


Using (12.1.4), for the total derivative of S along q(t) we find 


d as . as 
G7 5(4) = sa (a) a + F(a) 


aqi 
= pi(t)q'(t) — H(q(t), p(s) 
= L(q(t), (0), (12.1.7) 


with L denoting the Lagrange function of the system, cf. (9.1.15). Integrating 
both sides of (12.1.7) from fp to t > fo, we obtain 


t 


S(q(t),t) — S(a(@0). 0) = if L(q(t). 4(¢)) dt. 


10 


Thus, up to an additive constant, S(q, t) coincides with the physical action func- 
tion along t + q(t), which is the projection of the integral curve to Q. 

2. Obviously, one way to transform a Hamiltonian system (M, w, H) to equilibrium 
is given by the flow ® of the Hamiltonian vector field X 7, cf. Remark 8.2.5/1. 
In this case, the constants of motion q and p coincide with the initial values q(to) 
and p(to) for some given time f = fg. Note that in a neighbourhood of the point 


((q(to), p(to), to), @. B. to) € Te CM x M 


we have 


so that the coordinates g! and p; define a local chart on Ig. Thus, in this context 
it is reasonable to use a generating function of the second kind. 


Now, we give the geometrical interpretation of the Hamilton-Jacobi equation. Let 
M = T*Q and assume for simplicity that H does not explicitly depend on time; the 
general case is completely analogous. Under this assumption, we can confine our 
attention to the time-independent Hamilton-Jacobi equation (12.1.6). This equation 
can be rewritten as 


HodS=c (12.1.8) 
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with c € R and with the parameter q omitted. Assume that c is a regular value of H. 
Then, H~!(c) is an embedded submanifold of T*Q of codimension 1. By Propo- 
sition 7.2.4/2, it is coisotropic. On the other hand, the image of the differential dS, 
which will be denoted by the same symbol, is a Lagrangian submanifold. Thus, we 
arrive at the following geometric interpretation of the Hamilton-Jacobi equation: its 
solutions are Lagrangian submanifolds contained in a given coisotropic submanifold 
€ of T*Q on which the Hamiltonian vector field X y has no zeros. 

Furthermore, the integral curves of X can be interpreted geometrically as the 
characteristics of @: to see this, recall from Sect. 8.5 that the characteristics of @ 
are the integral manifolds of the characteristic distribution D®@ . Since the latter has 
rank | and since X y has no zeros on @, Lemma 8.5.4/2 implies that D°@ is spanned 
by Xy. It follows that the integral curves of Xq coincide with the characteristics 
of @, indeed.* 

First, we show that the dynamics of an autonomous Hamiltonian system reduces 
to the image of the differential dS of a function S on Q iff this function solves the 
Hamilton-Jacobi equation. 


Proposition 12.1.2 Let (T*Q,w, H) be an autonomous Hamiltonian system, with 
Q being connected, and let S: Q — R be a smooth function. The following state- 
ments are equivalent. 


1. S is a solution of the time-independent Hamilton-Jacobi equation (12.1.8). 
2. The image of the differential dS is invariant under the flow of X y. 


Proof For x € Q and Y € T; Q, we have 
was(x)(XH, (dS) Y) = —(dH, (dS)'Y) = —Y(H ods). (12.1.9) 


Since Q is connected, S solves (12.1.8) iff the right hand side of (12.1.9) vanishes 
for all Y e TQ. On the other hand, since dS is Lagrange, we have 


(dS)'TQ = T(dS) = (T(ds))”. 


Thus, Xy takes values in T(dS), and hence dS is invariant under the flow of Xy, 
iff the left hand side of the above equation vanishes for all Y e TQ. 


Remark 12.1.3 


1. That point 1 implies point 2 follows also from the more general statement of 
Proposition 8.5.3. 

2. An analogous statement holds for explicitly time-dependent Hamiltonian func- 
tions, because every solution S = S(q, ft) of the time-dependent Hamilton-Jacobi 
equation defines a Lagrangian submanifold dS of T*(R x Q). We leave the de- 
tails to the reader (Exercise 12.1.1). 


4For the case where @ is an energy surface, this has already been discussed in Sect. 9.1. 
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Fig. 12.1 Geometric : TO 

meaning of a solution S of R 
the time-independent 
Hamilton-Jacobi equation 


foe 
ds 


W77a 


Figure 12.1 illustrates the geometric content of Proposition 12.1.2: every solu- 
tion S of the Hamilton-Jacobi equation yields a reduction of the dynamics of the 
Hamiltonian system to the Lagrangian submanifold dS. This submanifold is a union 
of integral curves of the Hamiltonian vector field X 7. This way, we have obtained 
a fundamental relation between systems of first order ordinary differential equa- 
tions (the Hamilton equations) and first order partial differential equations of the 
Hamilton-Jacobi type. On the one hand, this relation can be used to study the dy- 
namics of Hamiltonian systems. On the other hand it can be used to solve initial 
value problems for partial differential equations of this type. 

In the remainder of this section we discuss the first of these two aspects. The 
discussion is based on the notion of a complete integral. In order to keep in touch 
with the standard mechanics language, we give a definition in terms of local coordi- 
nates here. In Sect. 12.3, we will give a coordinate-free definition in a more general 
situation (Definition 12.3.3). 


Definition 12.1.4 (Complete integral) Let (M,w, H) be a 2n-dimensional Hamil- 
tonian system and let g', p; be Darboux coordinates. An n-parameter> family 
S(q,q,t) of solutions of the time-dependent Hamilton-Jacobi equation (12.1.5), 
with H being expressed in terms of the Darboux coordinates g', p;, is called a 
complete integral for H if 


2 
aet( : e ) #0. (12.1.10) 
dq' dq! 

Remark 12.1.5 From this definition one derives the notion of a first integral for the 
time-independent Hamilton-Jacobi equation. Since S does not depend on ¢ here, the 
conjugate variable E is a constant of motion which can be expressed in terms of the 
remaining constants. Therefore, one has only n — | independent constants of motion 
a; and Condition (12.1.10) is replaced by the requirement that the matrix 


>The notation g' for the parameters is a matter of convention, their physical meaning depends on 
the concrete context. 


12.1 The Hamilton-Jacobi Equation 647 


a28 


q oa: 


be of maximal rank. 


The following theorem states that finding a complete integral is equivalent to 
solving the Hamilton equations. Here, we give the formulation and the proof in 
local coordinates. In Sect. 12.3 we will come back to this theorem in a more general 
context. There, we will present a coordinate-free proof. 


Theorem 12.1.6 (Jacobi) Let (M, w, H) be a 2n-dimensional Hamiltonian system, 
let q', pi be Darboux coordinates and let S(q,q,t) be a complete integral of the 
corresponding time-dependent Hamilton-Jacobi equation. Then, via the relations 


as _ _ 0s = 
Pi=—.9,b), Pi=->=>(9.9,1), (12.1.12) 
agi agi 


S defines a time-dependent canonical transformation (q, p) +> (q, p), which as- 
signs to every set of constants of motion (q, p) a solution of the Hamilton equations 
given in the coordinates q' and pj; by 


tt (q(4,p.1), p@,p.1)). (12.1.13) 


Proof By (12.1.10), S defines via (12.1.12) a canonical transformation of the first 
kind, indeed. We show that the curve t +> (q(t), p(t)) in R*” given by (12.1.13) 
satisfies 


. 0H 
Ho=5 7, (t), p(@),t), 2 ae (q(t), p(t), t). (12.1.14) 


By (12.1.12), the mappings q(t) and p(t) are defined by the relations 


as _ 7 os 2 
pit) = agi (q(t),4.t), Bi =~ 9gi 4 4, t) (12,15) 
where q and p are fixed. The second of these relations implies 
d (as a°s aS 
0O= — -(q(t), q,¢ t),q,t)gi(t —(q(t), q,t). 
5 (40.4 ))= 34 aglagi 4» & 1)4 t)q + Faq a4 ) 


By the Hamilton-Jacobi equation and by (12.1.15), the second term yields 


a as dH as 
agi (u (a= 57 (a(t), 4). 1) ) =F 0.00, ‘*) grat 4: 41): 


In view of (12.1.10), this yields the first equation in (12.1.14). The first relation in 
(12.1.15) implies 


nin = F(a (t), ))= os (q(t), 4. 1)¢ DO eer ui) (q(t), 4, 1) 
Pi ~~ dt dq! q »q, ~ agiags ‘dq j q q. q’ agi dt q »q, - 
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For the second term, the Hamilton-Jacobi equation and (12.1.15) yield 


— (w (an. = 9% (a), 4.2), ‘)) 


aH 
ae (q(), p(t), t) — 


2 


S 
aqiaq' 


5, (al p(t), 1) —— (q(t), q, £), 


Op 


where on the right hand side, the partial derivatives of H are taken in the coordi- 
nates g', pi, t (as opposed to the coordinates g', g', t on the left hand side). Thus, 
using the first equation of (12.1.14), which was already shown, we obtain the second 
equation in (12.1.14). 


The only systematic way for finding a complete integral is provided by the 
method of separation of variables. Above we have applied this method to separate 
the time variable in the (time-dependent) Hamilton-Jacobi equation for a Hamil- 
tonian function H which does not explicitly depend on time, thus arriving at the 
time-independent Hamilton-Jacobi equation. It can also be applied to the other vari- 
ables, as soon as the Hamiltonian function consists of a sum of terms which depend 
on disjoint sets of coordinates. This is illustrated by the following example. 


Example 12.1.7 (Central Force Field) Let us find a complete integral for the motion 
of a particle in a central force field, given by a spherically symmetric potential V. 
In spherical coordinates r, 3, ¢, the Hamiltonian takes the form 


1 ps Pi 
H=—(p?+%+—4_]4+v 
sao + + ap) tT VO)» 


cf. Example 11.5.2. Thus, the Hamilton-Jacobi equation reads 


1 ((3) + 1 Ss) ts 1 ay) 4M pag 
2m \\ar also P? sin? = (og a 


We plug in the separation ansatz S(r, 0, 6) = S;(r) + Sp (3) + S¢(@): 


1 (fas, Lf ass) I dSy\7 
V()=E.  (12.1.16 
5 (($) +2(Z) +5 (SZ) Jt v0 (12.1.16) 


Since = is the only quantity depending on @¢, 


dS¢ _ 
dp 


for some integration constant ® € R. Then, (12.1.16) can be rewritten as 


as. \* dSy\?_—s @? 
2 r 2 ov 

2 V E)= : 
?(S) + mr ( (r) ) (=) aa 
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Since the left hand side depends on r only and the right hand side on 7, 


dSy\?7_ @? 5 ds: 
= =0", =2m(E-—V = 
( dv ) 7 sin? 9 dr mal ”) 
with a further integration constant © € R. The constants E, ©, @ play the role of 
the parameters q' in Theorem 12.1.6. Thus, 


2 


or d 
aS = dSy 4 p2 
Ps => =a = = 
av dv sin? 0 
aS  dS¢ 
Po=— =— = 
dp = dg 


and hence 


@2 
S-(r) = 2m(E _ V(r) _ aa dr, 
v)= io 
Oy Nye sin? .¢ 


So(o) = 


Thus, we arrive at the complete integral 
S(t,r,¢,0; E,®,O)=—-Et+@-6+5S,(7, E, 9) + S (v0, &, @), 


and the following equations describing the dynamics: 


as as 
pe pee je ey d al adr, (12.117) 
af oF [2m(E — Vir) — & 
_ aS aS a> 
= = 12.1.18 
Po" 560° 90 930 ( ) 
as os 
a og. (12.1.19) 


Exercises 

12.1.1 Prove Proposition 12.1.2 for the time-dependent case. 

12.1.2 Analyze Eqs. (12.1.17)-(12.1.19) for the case of the Kepler potential. Show 
that they yield the Kepler orbits. 
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12.1.3 Solve the Hamilton-Jacobi equation for the planar two-centre problem (Ex- 
ample 11.1.4). 
Hint. Use elliptic coordinates. 


12.2 The Method of Characteristics 


In this section, we show how to solve the initial value problem for the Hamilton- 
Jacobi equation using the method of characteristics. 

Let QO be a manifold of dimension n and let H : T*Q — R be a smooth function 
for which 0 is a regular value. We consider the level set @ := H —1(0). Recall that 
the characteristic distribution D°@ of @ is spanned by the Hamiltonian vector field 
Xy. Let D be an embedded submanifold of Q of dimension m and let Sp : D> R 
be a smooth function. Solving the initial value problem for the Hamilton-Jacobi 
equation defined by H in the analytic sense consists in finding a smooth function $ 
which is defined on some neighbourhood of D and fulfils 


HodS=0, — Sip =So. 199.15 


To solve this problem, we consider the associated Hamiltonian system (T* Q, w, H) 
and perform the following steps. 


1. We determine the Lagrangian submanifold of T*Q given by the canonical lift 
(D, So) of the pair (D, Sg), cf. Example 8.3.8/4. According to the Transversal 
Mapping Theorem 1.8.2, if (D, So) is transversal to @, the intersection 


Spy = (D, 8) NG 


is an embedded isotropic submanifold of dimension n — 1 of T*Q. If .% is 
transversal in @ to the integral curves of X 7, we say that .“o is in non-characte- 
ristic position and call it an admissible initial condition, or a submanifold of 
Cauchy data. For dimensional reasons, and since X 7 has no zeros on @, .% is 
transversal to the integral curves of X 7 iff 


Te.% NM DES = {0} (12.2.2) 


for all & € .%,. 

2. By means of the flow of Xz, from .% we generate a Lagrangian immersion 
W:A— T*Q satisfying H oW = 0. Locally, this immersion induces Lagrangian 
submanifolds .7. 

3. If .Y intersects the fibres of T*Q transversally and at most once, Proposi- 
tion 8.3.10 yields a local function S on Q satisfying dS = .Y and hence (12.2.1). 


We refer to the Lagrangian immersion (A, WY) as a generalized solution, to the 
Lagrangian submanifold .Y as a geometric solution and to the function S$ as an 
analytic solution of the initial value problem (12.2.1). Figure 12.2 illustrates the 
geometry of the problem. 
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Fig. 12.2. Continuation of 
-%o by the help of the flow of 
Xy 


Theorem 12.2.1 (Method of Characteristics) Let % be an admissible initial con- 
dition for the initial value problem (12.2.1). Let ® be the flow of Xq and let J C 
R x T*Q be the domain of ®. Then, the restriction VW of ® to A= (RX A)NG 
is a Lagrangian immersion. (A, W) is a generalized solution of (12.2.1). 


Proof Obviously, A is an open subset of R x .% and W : A > T*Q is a smooth 
mapping. Since W (0, €) = for all € € .%), W(A) contains .%9. Tangent vectors at 


(t,&) € A are of the form as, Y), where A € R, ¢ denotes the standard vector 


field on R and Y € Tz.%. The tangent mapping of W is given by 


d 
Yoo a r) = (®,),(AXn +Y). (12.2.3) 


Since Xy has no zeros on @ and since (®;);, is a bijection, (12.2.2) implies that 
W is an immersion. To prove that (A, W) is Lagrange, it suffices to show that it is 
isotropic: for A, 4 € R and X, Y € Tz.% we calculate 


esol (od) (od) 


= wo, (G1), AX + X), (Pr)e(UXH + Y)) 
=a¢(AXy+X,uXH+Y) 
=0, 


because ®,; is symplectic, .“ is isotropic and X y is characteristic for @. 


Remark 12.2.2 Theorem 12.2.1 extends to the case of an arbitrary symplectic man- 
ifold M and an arbitrary isotropic and non-characteristic submanifold .% of M. 


Corollary 12.2.3 Let (A, W) be a generalized solution of the initial value problem 
(12.2.1). 
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1. There exists an open connected neighbourhood of {0} x %o in A on which © 
is injective and hence defines a Lagrangian submanifold SY which constitutes a 
local geometric solution. 

2. The geometric solution is locally unique in the following sense. If 7, and. are 
solutions for the initial condition So, there exists a connected subset of 4; 
which contains “o and which is open in both “ and 4. This follows from the 
fact that every solution is a union of open segments of integral curves of X 1. 


It may happen that the Lagrangian submanifold .7 obtained from -Y intersects 
some of the fibres of T*Q several times, whereas other fibres are not intersected at 
all. It may also happen that .” is tangent to a fibre. That is why the third step, the 
reconstruction of an analytic solution, again can be performed only locally. Using 
Proposition 8.3.10, we obtain 


Proposition 12.2.4 Let “9 be an admissible initial condition for the initial value 
problem (12.2.1), defined by the pair (D, Sg). Assume that D is contractible and 
that Ao is transversal to the fibres of T* Q and intersects every fibre at most once. 
Let be a geometric solution. Then, there exists aneighbourhood U of D in Q and 
a smooth function S: U — R such that dS(U) C %. The function S is an analytic 
solution of the initial value problem (12.2.1). 


Proof Since .% is transversal to the fibres of T*Q and intersects every fibre at 
most once, there exists an open neighbourhood V of .%o in .Y for which this is still 
true. Since V is open in .Y, it is a Lagrangian submanifold of T*Q. Since .% is 
contractible, we can choose V to be contractible as well. Then, Proposition 8.3.10 
yields the existence of U and S. Since Y C @, we have H o dS = 0, that is, $ 
is a solution of the Hamilton-Jacobi equation. To check the initial condition, we 
observe that .%9 = dS(D). On the other hand, by the definition of the canonical lift 
in Example 8.3.8/4, for every € € “% and X € Tz ¢) D we have 


(§, X) = (dSo, X). 


It follows that (dS(x), X) = (dSo(x), X) for all x € D and all X € T,.D and hence 
d(S}p) = dSo. By adding an appropriate constant we obtain S$} p = So. 


Remark 12.2.5 Recall from the proof of Proposition 8.3.10 that an analytic solu- 
tion S can be derived from .” as follows: choose an open subset V C .Y which 
is transversal to the fibres of T*Q and intersects each fibre at most once. Then, 
U = 2(V) is an open subset of Q and V defines a closed 1-form @ on U by 
ao =idy. Now, the generating function S is determined from dS = a. 


As we have seen, analytic solutions exist only on open subsets of Q over which 
the geometric solution is transversal to the fibres of T*Q. In Sect. 12.4 we will 
show that every Lagrangian submanifold can be locally generated by a more general 
object, a so-called Morse family. In this sense, local analytic solutions always exist. 
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12.3 Generalized Hamilton-Jacobi Equations 


In this section, we discuss the following natural generalization of the initial value 
problem for the Hamilton-Jacobi equation: instead of (12.2.1) we consider a system 
of partial differential equations, 


CodS=0, (12.3.1) 


defined by a smooth mapping C: T*Q — R* for which 0 is a regular value. If 
the submanifold @ = C~!(0) C T* Q is coisotropic, (12.3.1) is called a generalized 
Hamilton-Jacobi equation. By Remark 8.5.7/2, every coisotropic submanifold can 
be locally represented as a level set. Thus, even more generally, every coisotropic 
submanifold @ Cc T*Q yields a generalized Hamilton-Jacobi equation. 

Let n = dim Q and k =codim@. An admissible initial condition for (12.3.1) is 
an (n — k)-dimensional submanifold .%) of @ which is isotropic in T*Q and non- 
characteristic in @, meaning that it is transversal to the characteristics of @. For 
dimensional reasons, this is equivalent to the requirement 


Te. AN (Tz@)° = {0} (12.3.2) 


for all € € .%. As in Sect. 12.2, we look for generalized solutions (Lagrangian 
immersions), geometric solutions (Lagrangian submanifolds) and analytic solutions 
(local functions on Q). 


Theorem 12.3.1 Let 1<k <n-—1 and let C: TO> R* be a smooth map- 
ping. Assume that 0 is a regular value of C and that the level set @ = C~'(O) is 
coisotropic. For every admissible initial condition .~y C ©, there exists a general- 
ized solution of the generalized Hamilton-Jacobi equation (12.3.1). 


Proof Let C;: T*Q — R denote the components of the mapping C and let X; de- 
note the Hamiltonian vector fields generated by these functions. Since 0 is a regular 
value of C, the vector fields X; are pointwise linearly independent on @. By Propo- 
sition 8.5.6, they span the characteristic distribution D°@ of @. Let ' denote the 
flows of the X;. There exists an open neighbourhood A of {0} x .% in R* x .% 
such that the mapping 


Wi: A>TO, Wit,€):= (Pj) 0-0 OYE) (12.3.3) 


is defined: indeed, the right hand side is defined on an open neighbourhood of {0} x 
T*Q in R‘ x T*Q and A can be obtained by intersecting this neighbourhood with 
R* x .%. Since W(0, €) =& for all € € .%, the image W(A) contains .%. 

Next, we prove that A can be chosen so that W is an immersion. For that purpose, 
it suffices to show that the tangent mapping of W at (0, &) is injective for all € € .%,. 
Writing tangent vectors of A at (0, €) in the form (x, X) with x € IR‘ and X € Tz.A%, 
we find 

k 
Woe) (x, X) =X+) > xXi€). (12.3.4) 


i=1 
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Since the vector fields X; are pointwise linearly independent on .% and since 
(12.3.2) holds, Yoe) is injective for all € € .%, indeed. 

Finally, we show that (A, Y) is Lagrange. Since A has dimension k + dim.% = 
n, it suffices to prove that W is isotropic. For € € .%, x,y € R* and X,Ye Tz.A%, 
we have 


k k 
“i (x+ Soi Xi(G),¥ + dna] =0, 
i=l i=l 


because the vector fields X; are characteristic for @ and .% is isotropic. Hence, 
(12.3.4) implies that im Yo, £) is an isotropic subspace of T;(T* Q) for all € € .%. 
Since Y% is symplectic, to see that im We, £) is isotropic for all (t, €) € A, it is suffi- 
cient to show that 


im Wg) = (a); (im Wo, 6))- (12.3.5) 


This can be seen as follows. On the one hand, since the vector fields X; are char- 
acteristic, the mapping tr> W(t, €) takes values in the integral manifold Nz of the 
characteristic distribution D°@ of @ through &. Hence, 


: o 
1m Wee) = Dyte) + (Y)-Te-%o. 


On the other hand, for the same reason, the mapping Y% restricts to a local diffeo- 
morphism of Nz. Hence, 


@ Q@: 
Dye) = We Dz © 


Since (12.3.4) implies im oe) = De~ + T:.S%, this proves (12.3.5) and hence the 
theorem. 


Remark 12.3.2 


1. The geometric picture shown in Fig. 12.2 carries over to the present situation, 
but with modified dimensions of the submanifolds involved: @ has dimension 
2n —k, Wy has dimension n — k, the characteristics of @ have dimension k and 
thus the solution has again dimension n. 

2. As for k = 1, with a generalized solution one can associate geometric solutions, 
that is, Lagrangian submanifolds. Moreover, one has an analogous uniqueness 
statement: if .~, and .% are two geometric solutions of (12.3.1) for the initial 
condition .%9, there exists a connected subset of .“4, 1% which contains .% 
and which is open in both .“ and .~4. This follows from Proposition 8.5.3. 

3. If the initial condition .“ is transversal to the fibres of T* Q and intersects each 
fibre at most once, there exists a local generating function, that is, an analytic 
solution. 
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In the remainder of this section, we consider the situation where @ is a gen- 
eral coisotropic submanifold (not necessarily given by a family of functions).° In 
Sect. 12.1 we have proved the Jacobi Theorem, which states that finding a complete 
integral of the Hamilton-Jacobi equation is equivalent to solving the initial value 
problem for the Hamilton equations. Our aim is to prove that this statement is also 
true in the present context. Our presentation is along the lines of [181]. First, we 
must generalize the notion of a complete integral. 

For that purpose, the space of the constants of motion g', which in the case of 
the ordinary time-independent Hamilton-Jacobi equation has dimension n — 1, is 
replaced by a manifold A of dimension n — k, called the parameter manifold. Thus, 
we build the (2n — k)-dimensional manifold Q x A and endow the cotangent bundle 
T*(Q x A) =T*Q x T*A with the modified symplectic form 


PIt« 9 d09 — prt», da, (12.3.6) 


where 0g and 6, are the canonical |-forms on the corresponding cotangent bundles 
and pry»g and pry», are the canonical projections onto the factors T*Q and T*A, 
respectively. For an open subset U C Q x A and x € Q,aé A, we define 


Ug = {x € Q: (x,a) EU}, U, := {ae A: (x,a) EU}. 
For a smooth function S: U — R, we denote 
Sa: Ug > R, S,: U, > R, Sa(x) = Sy (a) = S(x, a), 
and define the partial differentials 
d?S:=prpgodS: U>T*Q,  d4S:=prps4odS: U > T*A. 
Then, for all (x, a) € U we have 
d2S(x,a)=dSa(x), d4S(x, a) =dS,(a). 


Definition 12.3.3 (Complete integral) Let @ C T*Q be a coisotropic submanifold 
of codimension k, 1 < k <n-— 1. Let A be an (n — k)-dimensional manifold and let 
U c Qx A beaconnected open subset. A function S$: U > R is called a complete 
integral of the generalized Hamilton-Jacobi equation (12.2.1) if 


1. every nonempty subset Ug is connected and satisfies dS,(Ug) C ©, 
2. the mapping d2S is a diffeomorphism from U onto an open subset of @. 


A is referred to as the parameter manifold of S and U is referred to as the domain 
of S. In case d2S(U) = @, S is said to be a global complete integral. 


Theorem 12.3.1 extends to this more general situation. For the proof one has to apply the Tubular 
Neighbourhood Theorem for embedded submanifolds to .%), see Remark 6.4.7. 
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Remark 12.3.4 


1. The functions S, form a family of analytic solutions of the generalized Hamilton- 
Jacobi equation. The connected Lagrangian submanifolds dS,(U,) establish a 
foliation of the open subset d2 S(U) of @ by geometric solutions. 

2. For the local discussion of solutions, condition 2 may be weakened to the require- 
ment that the rank of d2S be 2n — k. Let us analyze this requirement in Darboux 
coordinates g, ..259", Pi,---, p" on T*Q and qi, ee aN Pis+++> Pn—k ON 
T*A. In these coordinates, d2 S is given by 


- aS 
(q, q) > (p= **(a.a)). 
It has rank 2n — k iff the matrix 


a7 
——, l<ix<n,1<j<n-k, (12.3.7) 
dqg'dq/ 


has rank n —k. In particular, for k = 1, this requirement is equivalent to (12.1.11). 
Let us add that d4S is given by 


adn (ea=-Saa) 
q. 4 TP aa q. 4 


and that it has rank 2(n — k) iff the matrix (12.3.7) has rank n — k. Thus, condi- 
tion 2 implies that d4S has rank 2(n — k) everywhere. It is, therefore, a submer- 
sion. In particular, d4.S(U) is an open subset of T* A. 


Theorem 12.3.5 (Generalized Jacobi Theorem) Let @ be a coisotropic submani- 
fold of T*Q and let S be a complete integral of the generalized Hamilton-Jacobi 
equation defined by @, with parameter manifold A and domain U. Consider the 


mapping 
g:U>TOxT*A, g(x, a) := (d2S(x, a), —d4S(x,a)). 


Then, the following holds. 


1. A = @(U) is an embedded Lagrangian submanifold of T*Q x T*A endowed 
with the symplectic form (12.3.6). One has 


Prpsg(A)=d2S(U), —— prpx4(A) = —d“S(U). 


2. There exists a unique mapping 1: pryxg(A) > pry 4(A) satisfying the relation 
nm od2S =—d4S. This mapping is a surjective submersion and A is its graph. 

3. The distribution ker x' coincides with the characteristic distribution of the coiso- 
tropic submanifold pry«g(A) of T* Q. In particular, for every a € pty« 4 (A), the 
connected components of m~!(a) are characteristics of pry«g(A). 


12.3 Generalized Hamilton-Jacobi Equations 657 


Proof 1. A is the image of the submanifold dS(U) of T*Q x T*A under the 
diffeomorphism (&,7) + (€,—7). Since dS(U) is Lagrange with respect to the 
cotangent bundle symplectic form pr}. 0 d69 + prt», 404, cf. Example 8.3.8/2, A 
is Lagrange with respect to the modified symplectic form (12.3.6). The relations 
Prt g(A) = d2S(U) and pry 4(A) = —d4S(U) are obvious. 

2. Since the mapping d2 5S is a diffeomorphism onto its image, one can define a 
mapping 

1: pqxQ(A) > pryx4(A), n= —d4S 0 (d2s) 1. 
This mapping is surjective and its graph, when viewed as a subset of T*Q x T*A, 
coincides with A. By Remark 12.3.4/2, d4S is a submersion. Hence, so is 7. 

3. Since A is the graph of z, under the natural identification of T(T* Q x T* A) 
with T(T* Q) x T(T*A), tangent vectors of A correspond to pairs (X, 7’(X)) with 
X € T(pry.g(A)). Since A is Lagrange with respect to the symplectic form (12.3.6), 
for all € € Prt (A) and X,Y € Ts (pry«g(A)), we have 


0 = (prix g 409 — pry« 4 d64)((X, 2’(X)), (Y,2’(¥))) 
= d69(X, Y) — d04(2'(X), x'(¥)). (12.3.8) 


Thus, if 2’(X) =0, then d69(X, Y) =0 for all Y and thus X 1d09 = 0. Conversely, 
if Xid0g = 0, then d64(z’(X), m'(Y)) = 0 for all Y. Since 1, is surjective onto 
Txé)(T*A) and since d@4 is non-degenerate, we conclude m'(X) = 0. 


Remark 12.3.6 


1. If S is a global complete integral, then prp«g(A) = @. 

2. Point 3 of Theorem 12.3.5 implies that for every a € T*A, the complete inte- 
gral S yields a solution of the generalized Hamilton-Jacobi equation defined by 
the coisotropic submanifold @. Geometrically, it is given by a connected com- 
ponent of z~!(q). In particular, for every a € A, the Lagrangian submanifold 
dSq(Ug) C d2S(U) C @ is foliated by the (k-dimensional) connected compo- 
nents of the level sets 7~!(@) with pry+4(@) = a. Thus, the elements of T*A 
play the role of invariants labelling the solutions (up to connected components), 
and zr is the mapping which assigns to every solution the corresponding invari- 
ants. 

In Darboux coordinates g', pj on T*Q and g/, p; on T*A, the Lagrangian 
submanifold A consists of the pairs ((q, p), (q, p)) satisfying 


as as 
— —“(q,q), p = ——-(q, q). 12.3.9 
p aq 4 q) p aq 4 q) ( ) 


The local representative of z is given by (q, p) + (¢(q, p), P(q, p)), where the 
mapping (q, p) + g(q, p) is obtained by solving the first equation in (12.3.9) 
for q and the mapping (q, p) + p(q, p) is obtained by inserting the resulting 
expression for q into the second equation. Thus, Theorem 12.3.5 generalizes 
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the time-independent version of the ordinary Jacobi Theorem, obtained from 
Theorem 12.1.6 by separating the time variable and by removing from the 2n 
constants of motion g', p; the energy E and the initial time fo. This way, the 
parameter manifold A generalizes the space of the constants of motion q’. 

3. Equation (12.3.8) implies 7*d64 = j*d6g, where j : prpxg(A) > T*@Q is the 
natural inclusion mapping. Thus, z defines a symplectic reduction, cf. Sect. 8.7. 

4. If the rank of the restriction of the natural projection T*Q — Q to the isotropic 
submanifold @ is less than n, a complete integral cannot exist. In this case, one 
can use the concept of Morse families, to be introduced in the next section. This 
leads to a further generalization of the Jacobi Theorem [40, 181]. 


For physical problems leading to a generalized Hamilton-Jacobi equation, the 
reader may consult the original papers by Tulczyjew and Benenti [40]. An interest- 
ing example of this type is provided by the motion of a charged particle, formulated 
in a gauge-invariant way in the 5-dimensional Kaluza-Klein space [290, 292]. Here, 
one has two equations, namely a mass and a charge condition, that is, one deals with 
the case of codimension 2. 


12.4 Morse Families 


In this section we show that the concept of a generating function for a fibre- 
transversal Lagrangian submanifold of a cotangent bundle can be generalized in 
such a way that it applies to any Lagrangian submanifold of this bundle. This leads 
to the notion of a Morse family, which generalizes that of a Morse function, cf. 
Sect. 8.9. Morse families were introduced by Hérmander [141], who called them 
phase functions. We also refer to [305] for the first completely intrinsic presentation 
and to [181], where in particular the relation to symplectic reduction is discussed. 
To our knowledge, in physics, Morse families have been used for the first time by 
Benenti and Tulczyjew [40]. 
Let B and Q be manifolds and let 


z:B>Q 


be a submersion. Let 7g: T*B — B and rg: T*Q — Q be the canonical projec- 
tions and let 0g and 0g be the canonical 1-forms in T*B and T*Q, respectively. 
Consider the induced (pull-back) bundle 


BxoTQ=n*(TO)={(,68)€ Bx TO: rb) =z (8)}. 
Recall from Sect. 2.6 that B x g T*@ is a vector bundle over B with projection 
p:BxgTQ—>B,  p(b,é):=b, 


and fibres p~!(b) = The) Q. By Proposition 2.6.1, B x g T*Q is an embedded sub- 


manifold of B x T*@Q and the natural projection B x T*Q —> T*@ restricts to a 


12.4 Morse Families 659 


vector bundle morphism 


g: BxgTOQ>T OQ, y(b, é) :=6, (12.4.1) 
which covers 77, 
TQCPH=TO/P. (12.4.2) 


Since z is a submersion, kerz’ is a vertical subbundle of TB (that is, a regular 
distribution on B) and the annihilator (kerz’)° is a vertical subbundle of T*B, cf. 
Examples 2.7.7 and 2.7.8. We denote 


VB :=kerz’, VB := (kerz’)”. 


In the present context, V°B is often referred to as the conormal bundle associated 
with VB. By Example 8.5.8/2, it is a coisotropic submanifold of T* B. 


Lemma 12.4.1 The mapping 
vw: BxogTQ>T*B, w(b,&):= (xx})"() (12.4.3) 
is an injective vertical vector bundle morphism with image V° B. It satisfies 
Wv*0p = 9" 00. (12.4.4) 
By Proposition 2.7.4, it follows that w is an embedding. Thus, as vector bundles, 
BxoT*Q=V°B. (12.4.5) 
Proof By definition, y preserves the fibres and is fibrewise linear. Moreover, 
Tpow=p, (12.4.6) 


so that y is vertical. Since the tangent mapping 7: T,B — T,(p)Q is surjective 
for every b € B, the dual mapping (mp)": i () Q — TB is injective and one has 


im(x})" = (ker(s}))° = VB. 


Thus, y is an injective vector bundle morphism with image V°B. 
To prove (12.4.4), let (b, €) € B xg T*Q. We choose a section 


o:B>BxgT@ 
such that o (b) = € and consider the induced decomposition of the tangent space 
T(,£)(B xq T*Q) =ker py, ¢) ® 0 (TpB). 


First, we show that both 6g and 6g vanish on the vertical component ker Pip £) For 
that purpose, take Z € ker Poy,e): Then, (12.4.2) implies 
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(9°00) 6,2)(Z) = Co) ow.) (9'(Z)) 
= (9(b, €), (19 0 g)'(Z)) 
= (9(b, &), 2’ 0 p'(Z)) 
=0 
and (12.4.6) yields 
(OB) y.¢)(Z) = On) y.e)(W'(Z)) 
= (Wb, &), (wp 0 Wy (Z)) 


=(W(. &), p'(Z)) 
=0. 
Next, we prove that y*@, and y*6g coincide on the transversal component 


o'(TpB). Using m9 °~vo0o0=7 and mg 0 Woo = idg, for every X € TyB we 
have 


(9°00) 4,.2)(0'X) = Oo) ocb.8)(9" 0. 0'(X)) = (§, 79 0g" 0.0'(X)) = (&, 0, X) 
and 
(W*OB) «2)(o'X) = Os) yee) (W! 00'(X)) 
((75)"§. 25 0! 00"(X)) 
= (x5) "8. X). 


Definition 12.4.2 (Morse family) Let z : B ~ Q be a submersion. A smooth func- 
tion S: B — R is called a Morse family along x if dS(B) is transversal to V°B in 
T*B, that is, if 


T, (T*B) =T,(dS(B)) + T,(V°B) (12.4.7) 
for alln € dS(B) AV°B. The triple (B, z, S) is referred to as a Morse family over Q. 
Remark 12.4.3 The notion of a Morse family generalizes the notion of a Morse 
function. Indeed, if S is a Morse family, then S},-1(,) is a Morse function for ev- 
ery x € Q, because the intersection of the transversality condition (12.4.7) with 


T,(T*z hie) yields the transversality condition (8.9.1) for Morse functions. Thus, 
S yields a family of Morse functions, parameterized by the points of Q. 


In the sequel, let us assume that the intersection dS(B) 9 V°B is nonempty. By 
Theorem 1.8.2, dS(B) N V°B is an embedded submanifold of T*B and one has 


T,(dS(B) N V°B) =T,(dS(B)) NT, (V°B). 
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Since dS(B) M V°B is contained in the embedded Lagrangian submanifold dS(B) 
of T*B, it is isotropic in T* B. Since, in addition, the restriction of the canonical 
projection zg: T*B — B to dS(B) is a diffeomorphism with inverse dS, the pro- 
jection 


Bs := 1p(dS(B) 1 V°B) 
is an embedded submanifold of B and the induced mapping 


(dS);B, : Bs > dS(B) NV°B 


is a diffeomorphism. Let us calculate the dimension of Bs. With dim B =n +k and 
dim Q =n, (12.4.7) implies 


dim(dS(B)N V°B) = dim(dS(B)) + dim V°B — dim(T*B) 
=(n+k)+ (2Qn+k)—2(n+k) 
=dimQ. 


Now, consider the vector bundle morphism y : B x g T*Q > T*B of Lemma 12.4.1. 
Since it restricts to an isomorphism from B x g T*Q onto V°B, there is a unique 
vector bundle morphism 4: V°B —> T*Q such that 

g=hoy, (12.4.8) 
where y is understood as a mapping to V°B. Obviously, A covers 7. Let 

ds: dS(B)NV°B > T*O 
be the restriction of 4 to the isotropic submanifold dS(B) 1 V°B and define 
As :=As 0 (dS)}B,: Bs > T*Q. 
In the sequel, we will need the following two natural inclusion mappings: 
i: dS(B)NV°B > V°B, jo: V°B > T*B. 

Lemma 12.4.4 Let (B, 2, S) be a Morse family over Q. 
1. We have 

i*00 = jg OB- (12.4.9) 


In particular, X is a strict symplectic reduction. 
2. The mappings X5 and As are Lagrangian immersions fulfilling 


M09 =i* 0 jx (Op), 69 =d(Spp). (12.4.10) 
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Proof 1. By (12.4.4), we have w* 0A*(@9) = W* 0 jj (Og). Since w is a diffeomor- 
phism, this yields the assertion. 

2. Using that dS(B) and V°B are transversal and that dS(B) NM V°B is isotropic, 
as well as (8.5.1), (8.7.2) and Proposition 7.2.1/5, for n € dS(B) MN V°B we obtain 


ker(As)}, =T,(dS(B) 0 V°B) Nker Ay 
=T,(d5(B)) NT,(V°B) N (T,(V°B))°? 
=T,(dS(B)) N (T,(V°B))°? 
= (T,,(dS(B)) + T, (V°B))°? 
= (T,(T*B))”” 
= {0}. 


Thus, As is an immersion. Since dS(B) M V°B has the dimension of Q, to see that 
As is Lagrange, it suffices to show that it is isotropic. Indeed, using point | and 
As = oi, we find 

d5(09) =1* o jj Oz) (12.4.11) 


and hence 4% (dg) =i* o jj (d0g) = 0, because dS(B) N V°B is isotropic. Finally, 
applying ((dS) +p; )* to (12.4.11) and using jooio(dS);g, =dSok, where k: Bs > 
B denotes the natural inclusion mapping, we obtain 


ASOQ =k* 0 (dS)* (6B) =k*dS = d(S}p;). 


In the last step we have used point 1 of Remark 8.3.3. 


Definition 12.4.5 (Fibre-critical submanifold) Let (B, 7, S) be a Morse family over 
Q. The embedded submanifold Bs C B is called the fibre-critical submanifold of S. 
The mapping As: Bs — T*Q is called the Lagrangian immersion generated by S. 


Remark 12.4.6 We give a local description of Morse families. Let x',i=l,...,n 
and y*, a =1,...,7r be local coordinates on B adapted to z, that is, z is given by 
U(X, y) =x. 


Denote the corresponding fibre coordinates in T*B by pj, Po and in T*Q by pj. 
Then, the conormal bundle V° B is defined by the condition 


and the morphism A has the form 


A(x, y, p, 0) = (x, p). 
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The mapping dS is given by 
as 0s 
dS(x, y) =(|xX,Y, = (x, y), = (x, y) . 
ox oy 
Thus, the submanifold dS(B) N V°B is defined by the (n + 2r) equations 
as 0s : 
yt ™ y) = Di: aya & y) =0, Pa =0, (12.4.12) 
and the fibre-critical submanifold Bs is given by the r equations 
as 
— (x,y) =0. 12.4.13 
ape %Y) ( ) 


In this language, transversality of dS(B) and V°B means that the (n +r) x r-matrix 


( as as ) 
ax! dy%’ dayBay 


must have rank r at all points (x, y) fulfilling (12.4.13) (Exercise 12.4.2). The La- 
grangian immersion As is then given by 


As(x, y) = (x OD ee. »). (12.4.14) 
ox 


with (x, y) fulfilling (12.4.13). Finally, in terms of the induced fibre coordinates x, 
y% on TB and x', p; on T(T*Q), the tangent space Tx, y) Bs C Tix,y)B is given by 


29 


i — 
axtdy ae sy) dy#ayB yj? = (12.4.15) 


and the tangent space Ty, (x,y)(As(Bs)) C Tascx.yy(T* Q) consists of the pairs 
(x, p), where 
2 29 
av 
xi 4 
dxtaxt YF" + Fpya 


Pi= —— (x,y), 


and x!, ® fulfil (12.4.15). 


Example 12.4.7 Let B = R? and Q = R and let x, y, px, Py and x, px be global 
Darboux coordinates on T*B = R* and T*Q = R’, respectively. We consider the 
surjective submersion 


x: R?>R, U(x, y)=x 


and the smooth function 


1 
S:R° SR, S(x,y):= ay + (x? — 1)y. (12.4.16) 
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The conormal bundle V°B is defined by the condition Py =0 and the morphism A 
has the form 


(x, y; Px, 0) = (x, Px). 
The mapping dS is given by 


as as ae 
dS(x,y)= X,Y, By dds gd) = (x59) 2ey,x Ty —1). 
x dy 


Consequently, the submanifold dS(B) M VB is given by 
Q2xy=pxr, x +y'=1, py=0 
and the fibre-critical submanifold Bs by 
e+y=1, (12.4.17) 


Thus, Bs is the unit sphere in R?. To check the transversality condition, we calculate 


a78 a7S 
eae abd = (2x, 2y). 
(oy ay? «.9)) (2x, 2y) 


This matrix has rank | for all (x, y) which fulfil equation (12.4.17). Therefore, S is 
a Morse family. The Lagrangian immersion As = A5 0 dS is given by 


os 
As(x,y)= (.. aa (x, ») = (x, 2xy), 
Xx 


with (x, y) fulfilling (12.4.17). Parameterizing Bs by 
(x, y) = (cos(2z1), sin(2zt)) 
with ¢ € R, we obtain the following parameterization of As: 
As(cos(2zt), sin(27t)) = (cos(2zt), sin(4zt)). 


We see that the image is a figure eight immersion with the self intersection point 
As(0, 1) = As(0, —1). Thus, (Bs, As) is locally, but not globally, a submanifold.” 
This example can be generalized as follows. Let 


aw:R"xR—- R", m(X, y) =X, 


let f : R” > R be a smooth function for which 0 is a regular value, and let 


1 
S:R’xR—-R, S(x, y):= 3) + fey. 


7 According to Example 1.6.6, As(Bs) can nevertheless be equipped with a submanifold structure, 
though in two inequivalent ways. Both of them are Lagrangian. 


12.4 Morse Families 665 


We encourage the reader to analyze this class of examples along the lines of the 
above discussion (Exercise 12.4.3). 


Example 12.4.8 Let f : Q — R” be a smooth mapping for which 0 is a regular 
value, let P := | 1(0) and let F : P > R be smooth function. We show that the 
canonical lift® (P, F F) of the pair (P, F) coincides with the image of the Lagrangian 
immersion Ag generated by the Morse family 


S:OxR’—R, S(x,y) := f(x)-y+ FQ), 


along the natural projection z : Q x R" > Q.Letée T* Q. To see that € € As(Bs) 
iff € € (P,F F), we choose coordinates x' on Q at 19(&). By Remark 12.4.6, the 
fibre-critical submanifold Bs of S is given by 


0s 
aya y) = fo(x) =0 
Hence, it coincides with P x R”. On Bs, we have 


as a°s A fa 
G O69): Syaqya ») ~ & ). 


Since this matrix has rank 7, S is a Morse family along z and thus Bs is an embed- 
ded submanifold of Q x R”. The Lagrangian immersion As generated by S is given 
in coordinates by 


af 
As(x,y) = (x yt (®) + ~~). 
Hence, its image consists of the points € in T*Q whose coordinates (x, p) satisfy 


p= yale) + = 0), ff) = 


for some y € R’. On the other hand, every X € Tzg(¢) P fulfils 2 Sle (x) X! = ‘ = 0, where 
X = X'0,i. Thus, we obtain 


(ex) = xi(y Fe \e 
Ox 


so that As(Bs) = (P, F), indeed. 


= (dF (g(€)), X), 


We have seen above that every Morse family for which dS(Q) 9 V°B is 
nonempty yields a Lagrangian embedding and hence, locally, Lagrangian subman- 
ifolds of T*Q. The following theorem implies that, conversely, every Lagrangian 


8See Example 8.3.8/4. 
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submanifold of a cotangent bundle can be locally obtained as the image of a La- 
grangian immersion generated by a Morse family. The theorem belongs to H6rman- 
der [141] and Maslov [197], see also [115, 181, 305]. 

Let zy : T*L — L be the canonical projection and denote 


V(T* Q) :=ker(z9) C T(T*Q). 


Theorem 12.4.9 Let (L, 1) be an embedded Lagrangian submanifold of (T* Q, d6g) 
and let 6, denote the canonical 1-form on T*L. Assume that the following condi- 
tions hold. 


1. The 1-form t*@9 on L is exact. 
2. There exists a Lagrangian subbundle F of (T(T*Q))tz which is transversal to 
both TL and (V(T* Q)) x. 


Then, there exists a Morse family (B, 2, S) over Q such that (Bs, As) is an embed- 
ded submanifold equivalent to (L, t). 


Proof Our proof is along the lines of Weinstein [305] and Libermann and Marle 
[181]. 

Assume that conditions | and 2 are fulfilled. Then, F is a Lagrangian comple- 
ment of TL in the symplectic vector bundle (T(T* Q));z and Theorem 8.6.4 yields 
a symplectomorphism ® of an open neighbourhood U of L in T*Q onto an open 
neighbourhood @(U) of the zero section so in T*Z which maps L to sg and satisfies 


®, (Fe) = Toe (TEL) = ker(a1)ip¢e) (12.4.18) 


for all € € L. By condition 2, Vz(T* Q) is acomplement of Fz in Tz (T* Q) for every 
& € L. Hence, (12.4.18) implies that D, (Vz (T* Q)) is a complement of ker(71) pe) 
for all € € L. By shrinking U we can achieve that this remains true for all  € U. 
Then, 


my 0 ®'(Ve(T*Q)) = Gog (To~)(TL)) =TeL (12.4.19) 
for all € € U. Since ® 01 = Sg and my, o P of = id;, the 1-form a on U defined by 
a:=69 — (Lomo ®)*00 — @*6, (12.4.20) 


satisfies 1*a = 0. Then, the generalized Poincaré Lemma 4.3.14 implies that U can 
be shrunk so that a = dh for some smooth function h on U satisfying h o1 = 0. 
Moreover, by condition 1, there exists a smooth function f on L with c*@9 =df. 
We choose B= U C T*Q anda :=(zg);u :U > Q and define S: U > R by 


S=h+@* onj f. 


Then, So.= ff and 
dS = 69 — P* 6). (12.4.21) 
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We show that S defines a Morse family along z which satisfies As(Bs) = U(L). 
Let us start with proving the latter relation. One can check that the vector bundle 
morphism 4: V°B — T*Q is given by the restriction to V°B of the canonical pro- 
jection T*(T*Q) + T*Q. Hence, As = A o (dS);p, is given by the natural inclu- 
sion mapping of Bs C B C T*@Q and thus we have to show that Bs = L. Let € € B. 
By definition of Bs, € € Bs iff (dS(€), Z) = 0 for all Z ¢ Vz B. By (12.4.21), for 
Z € Vz B, we have 


(dS(&), Z)= —((®*61),, Z) = —(®(E), my, 0 ®'(Z)). (12.4.22) 


Hence, if € € L, then ®(&) belongs to the zero section. Then, (dS(é), Z) = 0 and 
thus € € Bs. Conversely, if € € Bs, then (dS(€), Z) = 0 for all Z € VeB and 
(12.4.19) implies that @(€) belongs to the zero section of T*L. This implies € € L. 

It remains to show that dS(B) is transversal to V° B, that is, that (12.4.7) holds for 
all n = dS(&), & € L. Since by condition 2, F¢ is transversal to Tz L in T;(T* Q) = 
T; B, we have 


Tas) (dS(B)) = (dS), (Te B) = (dS) (Te L + Fe). 


By injectivity of (dS), the image (dS); (Fe) is a subspace of Tasig)(dS(B)) of 
dimension n. Since Tas (V°B) has dimension 3n and Tys(¢)(T* B) has dimension 
4n, it therefore suffices to show that 


(dS) (Fs) A Tasg)(V°B) = {0}. (12.4.23) 


For that purpose, let X € F: such that (dS), X E Tasce)(V°B) and let y be a curve 
through € representing X. By (12.4.18), y may be chosen so that ®(y(t)) is con- 
tained in TzL. As a first step, we show that (dS). X € Tascé) (V°B) implies that 


(ds oy(t),Zo y (t)) =0 (12.4.24) 
dt fo 


for all vector fields Z on B taking values in VB. Indeed, we may view Z as a 
function Z : T*B — R and we may assume that it is a submersion. Then, V°B is 
contained in the submanifold Z~!(0) of TB and hence 


Tasé) (V°B) Cc Tas@(Z7' (0)) = ker Zas(ée): 


Thus, if (dS). X € Tas (V°B), then 


d 
: ts oy(t), Zoy(t))= a As oy(t)) = Zas(e) ° (dS) (X) = 0. 


On the other hand, using (12.4.22) we calculate 


d d ; j 
ae y(t), Zo y(t))= “ae oy(t),m, oB'oZo y(t). 
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By our choice of y, the expression on the right hand side can be interpreted as 
arising from the pairing T:B x T; B — R. Hence, we may apply the product rule to 
obtain 


2 (dS o y(t), Zo y(t)) = —(®,X, 2, 0 &'(Z(8))) 
die en Se &))), 
where ©; X is viewed as an element of TfL via the identification of Tog) (TzL) 
with TL. The second contribution from the product rule vanishes, because ®(&) 
belongs to the zero section. Since by (12.4.24), the right hand side vanishes for all 
Z(&) € VeB, (12.4.19) implies that DX = 0. Since @7 is injective, we conclude 


& 
that X = 0. This proves (12.4.23) and hence completes the proof of the theorem. 


More generally, consider a Lagrangian immersion 1: L + T*Q. Since every 
point in L possesses an open neighbourhood for which points | and 2 of Theo- 
rem 12.4.9 are fulfilled, this theorem implies 


Corollary 12.4.10 Locally, every Lagrangian immersion is generated by a Morse 
family. 


Exercises 


12.4.1 In Example 8.5.8/2 we have shown that (VQ)° is a coisotropic submanifold 
of T*Q. Use Lemma 12.4.1 to give an alternative proof of this fact. 
Hint. Use Proposition 7.2.4 and the inequality (8.1.6). 

12.4.2 Prove the statements of Remark 12.4.6. 

12.4.3 Complete Example 12.4.7 by showing that for the class of functions under 
consideration, S is a Morse family. 


12.5 Stable Equivalence 


In this section we derive a partition of Morse families into classes which locally 
generate the same Lagrangian immersion. This will lead us to the notion of stable 
equivalence. The results below belong to H6rmander and Weinstein. 

The local generation concept alluded to above is the following. Let 1: L > T*Q 
be a Lagrangian immersion and let € ¢ L. We say that a Morse family (B, 2, S) over 
Q generates (L, 1) at € if there exists an open subset U of Bs and an open neighbour- 
hood V of € in L such that As(U) =1(V). The class of Morse families generating 
(L,t) at & will be denoted by #(L,1,&). According to Corollary 12.4.10, every 
Lagrangian immersion with target space T* Q is generated at an arbitrary point by 
some Morse family over Q. 

By the local nature of the generation concept under consideration, we can restrict 
our attention to the case where B is an open subset of Q x R” for some r, referred 
to as the fibre dimension of (B, zr, S), and where z is given by the restriction of 
the projection to the factor Q. Points of Q and R” will be denoted by x and y, 
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respectively, and the dimension of Q will be denoted by n. For the first and second 
derivatives of S: B — R we introduce the simplified block matrix notation 


Ss” AG 
S=[S..S], "= Ra | 


For all (x, y) € Bs, we have A (x, y) = 0. For such points, the fibrewise bilinear 
form Sy, on (VB)>g, is referred to as the fibre Hessian? of S. For (x, y) € Bs, 
Tay Bs = {(X, YeT,QO@R’: ales y)X + So, yY= o} (12.5.1) 
and for (X, Y) € Ti,y) Bs, 
(As\iq yy (X, Y) =S"(x,y)X + Be y)Y. (12.5.2) 


Since Ag covers 7, the vector Sey (x, y)Y is tangent to the fibre T* Q at € and thus 


belongs to the intersection Tz (T¥ Q) Tz L. Finally, we introduce the notation 
T:=mgoi:L>@Q. (12.5.3) 


Lemma 12.5.1 Let 1: L > T*Q be a Lagrangian immersion, let —€ € L and let 
(B, 2, S) be a Morse family of fibre dimension r generating (L,t) at &. For (x,y) € 
Bs such that € := Ags(x, y), we have 


r —rank S\,(x, y) =n — rank IT;. (12.5.4) 


Proof Let (X,Y) € Tw,y) Bs and Z := (As), y) (X,Y). Then, II; (Z) = X. Since 
r — rank Sy, y) = dimker S” (x, y) and n — rank IT; — dimker /7;, it is enough 


yy 


to compare the dimensions of the kernels. For that purpose, we set X = 0 in (12.5.1) 
and (12.5.2) and read off the system of equations 


Z=SCWY¥, — S@,y¥=0. 


If dim ker ae (x, y) =0, then Y = 0 and Z = 0 is the only solution and we obtain 
dim ker IT; = 0. If dimker Soy (x, y) £0, since S is a Morse family, Sry (x, y) has 


maximal rank and thus yields an isomorphism between ker Sy, (x, y) and ker J7;. 


Next, we use the Morse-Bott Lemma in the formulation of Corollary 8.9.13 to 
prove the Splitting Lemma for Morse families. We say that a mapping g between 
open subsets of Q x R” preserves the fibres if prg og(x, y) = x for all points (x, y) 
in the domain of ¢. 


Theorem 12.5.2 (Splitting Lemma) Let (B, 2, S) be a Morse family over Q of fibre 
dimension k +1 and let (xo, yo) € Bs. Write yo = (Yo. Yo) with ¥o € R! and ¥o € RK. 


°For an intrinsic definition, see [36, §4.3]. 


670 12 Hamilton-Jacobi Theory 


If the fibre Hessian ae (xo, Yo) has rank k, there exist open neighbourhoods B of 


(xo, Yo) in Q x R! and V of ¥o in R‘, a fibre-preserving diffeomorphism g from 
B x V into B, a smooth function S : B — R and a non-degenerate quadratic form 
Q on R¥ such that 


Sogx, 9,9 = Sx, 7) + AH). (12.5.5) 


Sisa Morse family along the submersion i : BE Q xR! > Q whose fibre Hessian 
Sis vanishes at (xo, Yo) and which generates the same Lagrangian submanifold at 


As(x0. Yo) as S, that is, Ag(Bs) = As(Bs 1 (B x V)). 


Proof Let (xo, Yo) € Bs. Up to a linear transformation which can be absorbed in 
gy, we may assume det Sos (xo, Yo) # 0. Therefore, among the defining equations 


S;, = 0 of Bs we can use the last k equations S, = 0 for locally determining 


y=v@,y) 


in terms of a smooth mapping y : B — R*. Then, the subset B s C B, defined by 
the equation Sy = 0, is locally given by the graph 


(x. 9. WO, 9)). 
By the local diffeomorphism 


g(x, y) = (x. 9,9 - W(x, 9) 


this graph is mapped to Bx {O}. Leti: B—- Bx R* be the corresponding inclusion 
mapping and pr, : B x IR‘ — B the natural projection. Denote 


S:=Sog!o0i:B>R 

and define 
5: BxR‘ SR, S:=Sog |! —Sopr,. 
This is a Morse-Bott function with critical submanifold B x {0}. Now, the Morse- 
Bott Lemma 8.9.13 yields a diffeomorphism @ and a non-degenerate quadratic form 
Q on R¥ such that 
SoOX,FHN=QH), prox, FN =(x,H). 

Then, y := @~! o @ yields the desired diffeomorphism. By construction, we have 

Sx, 9) = S(x.F.V@.H). (12.5.6) 
By assumption, dim ker S\,, 


maximal rank. Hence, also SY 5 (xo, Yo) has maximal rank J, that is, S is a Morse 


(xo, Yo) =/. Since S is a Morse family, Sey (xo, Yo) has 
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family. To show that Ky generates the same Lagrangian submanifold at (x9, yo) as S, 
we can use the local description of Remark 12.4.6. This way, from (12.5.6) we read 
off that By = Bs 1 (B x V). Then, equality of the images follows. 


Remark 12.5.3 


1. Lett: L > T*Q be a Lagrangian immersion and let € € L. Lemma 12.5.1 states 
that 


r >n—rank IT; = dimker /7;, (12.5.7) 


that is, the right hand side yields a lower bound for the fibre dimension of a 
Morse family generating (L,1) at €. Moreover, equality holds in (12.5.7) iff 
oe 6 y) = 0 for some (x,y) € Bs such that As(x, y) = &. A Morse family 
generating (L,1) at € which fulfils this condition will be said to be reduced. In 
view of Corollary 12.4.10, Theorem 12.5.2 states that a reduced family always 
exists, that is, that the lower bound provided by (12.5.7) is sharp. 

2. Let (B, 2, S) be a reduced Morse family over Q of fibre dimension r generating 
(L,1) at € € L and let (x, y) € Bs such that As(x, y) = &. Then, elements Z of 
u(T¢ L) are characterized by 


= Si (x, y)X + Shay: Sn (x, y)X =0, (12.5.8) 


where (X, Y) € T(x,y)B =T,@ ®R’. Since S is a Morse family, S i " (x, y) must 
have maximal rank at (x, y). Thus, by (12.5.8), we have the direct sum decom- 
position 


(Te L) = im S”, (x, y) © S”, (ker S”,(x,y)), (12.5.9) 


where im Se (x,y) =Tz(TEQ) Ne "(Te L). Moreover, Eq. (12.5.2) implies that a 
tangent vector (0, Y) € Ty ee (x) is tangent to the fibre-critical submanifold 
Bs iff it is contained in the kernel of Sy. y(x, y). Since Sy y (X, y) = 0, all vectors 
tangent to the fibres are contained in the ‘kernel of this aprile and we obtain 


Tu,y% 1) CTa.y Bs. (12.5.10) 


The Splitting Lemma can be interpreted as an operation on L (L,t,&), build- 
ing from a given element (B, 7, S) the reduced element (B, I, 5); The following 
operations produce Morse families belonging to #(L, t, &), too. 


(a) Addition: choose c € R and take (B, 7, S +c), 

(b) Composition: choose a submersion 7 : B — Q and a fibre- -preserving diffeo- 
morphism ¢: B — B and take (B, 7, So ~), 

(c) Suspension: choose a non-degenerate bilinear form Q on Ré and take (B, I, S) 
with B= Bx R', z= 0 prg and $ = pry S + pre, Q. 

(d) Restriction: choose an open subset BofB containing a point of ae (u(€)) and 
take (B, rg, Spp)- 
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These operations generate an equivalence relation in “(L, t, €), called stable equiv- 
alence.!° Theorem 12.5.2 implies 


Corollary 12.5.4 Every Morse family generating (L, 1) at & is stably equivalent to 
a reduced Morse family with that property. 


Now, we can state the main result of this section. 


Theorem 12.5.5 Any two Morse families generating (L, 1) at € are stably equiva- 
lent. 


Proof The proof is along the lines of the proof of Proposition 5.4 of [115] and 
Theorem 4.18 in [36]. By Corollary 12.5.4, it is enough to show that any two reduced 
Morse families in #(L,1,&) are stably equivalent. Since this is a local statement, 
we may assume Q = R". Let S and S be reduced Morse families defined on the 
open subsets B and B of R” x R’, respectively. By shrinking B and B if necessary, 
we may assume that As(Bs) = Az 3(B,). 

We will proceed in two steps. In the first step, we construct a fibre-preserving 
diffeomorphism yf : B — B such that the functions § and So w coincide on Bg. 
In the second step, we use the deformation method of Moser to construct a fibre: 
preserving diffeomorphism ®, such that So yo @, = S on the whole of B re 

To carry out the first step, define mappings g: B — R” x R” and 
¢: BR" x R" by 


g(x y)=(x, S(x,y), Ox y)=(x, Sx y)), 


respectively. Since both As and 45 cover the restriction of the projection to the 

factor Q, under the identification of T*Q with R” x R", the restrictions yz, and 

PR correspond to As and Ay, respectively. Let (xo, yo) € Bs and (Xo, Yo) € By be 
Ay 

such that As(Xo, yo) = Ag (Xo, Yo) = &. Since S is reduced, Sc (xo, Yo) has maximal 

rank /. Hence, its image 


W :=im SY, (Xo, Yo) 


is an /-dimensional subspace of T¢ (Ty, Q), the latter coinciding with R” according 
to the above identification. Denoting the corresponding orthogonal projection by 
p:R" > W, we define 


F:BxW-W, F(x, y, W) := po S(x,y) — 


Since W is the tangent space of the submanifold embedding!! y +> S‘.(xo, y) at 
y = yo, it is orthogonal to S’.(xo, yo) and we have p o S).(xo, yo) = 0. Hence, 


!0We refer to Remark 12.6.16 for a comment on the notion of stability in this context. 
"l According to Remark 12.5.3/2, W = Te(TEQ) Nu (T¢L). 
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F (Xo, Yo, 0) = 0. Since, by construction, I (xo, Yo, 0) = po Le (Xo, Yo) is bijective, 
the Implicit Mapping Theorem yields a smooth mapping g from some neighbour- 
hood of (xo, 0) in R” x W to R’ such that for all y in a neighbourhood of yo in R! 
one has p o S).(x, y) = w iff y = g(x, w). After further shrinking B and B if nec- 
essary, we can find an open neighbourhood U of both g(B) and @(B) in R" x R" 
such that pr, (U) = pr, (B) = pry (B) and such that the mapping 


p:U>B, p(X, Z) := (x, g(x, p@)), 


is defined. Then, g(x, p o S’.(x, y)) = y for all (x, y) € B and hence p o y = idg. 
Now, the desired fibre-preserving diffeomorphism is given by 


W:=pog: BoB. 
Indeed, due to 9(B;) = ¢(Bs), it satisfies w (By) = Bs, and for (x, y) € Bs we have 


As( (x, y)) = 90 p(y) = Asx, y), 


because points in p(B) are mapped identically under g o p. Thus, As o a B= AS 
S 
and (12.4.10) yields 


US, ,) = A5 0) = yay)” ° ASo) = U(Sins © VB.) 


so that on B g the functions So y and S differ by a constant!?. By absorbing this 
constant in S, we may assume that So w and S coincide on B re 

Now, we turn to the second step. We denote So = Sand $ 1 = Soy. Then, Bs, = 
Bs, = By and S; — So vanishes on B § up to second order. Thus, by a version of the 
Taylor Theorem, see Exercise 12.5.1, there exist smooth functions h®? on B such 
that 
Si; - = Da (12.5.11) 


a,B 


We put S, = So + ¢(S; — So) and seek for a time-dependent vector field X, whose 
flow ; preserves the fibres, maps B; identically and fulfils 7S; = So for all t € 
[0, 1]. The latter equality holds iff 


d dS, 
(S10 ®) = Jo + B¥(X,(S;)) = OF (X,(S+) + $1 — So), 


0= — 
d 


that is, iff 


X,(S;) + $1 — Sp =0. (13,5;12) 


We may of course assume By and hence B 5 to be connected. 
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Plugging in (12.5.11), together with the ansatz 


dSo 
X= Bey 
= a! ay” ne 
we obtain 
dSo 0So dSo 0 dSo 0S0 
oe) 7 on 3 (s a aes °). 
2, dy ayb +2 Oy” ayB ot dX dy? ayt 


This equation is fulfilled if 
0=H+F(1+G), 


where H =h*? and F = f%?, and where G denotes a matrix-valued function which 
vanishes on Bs for all values of t. In a neighbourhood of B; we can solve this 
equation with respect to F and thus determine X, and @,. By construction, ®, 
preserves the fibres and fulfils $; 0 @; = Sg. Moreover, since H = 0 on B gy we have 


F =0 and hence X; = 0 there. Hence, ®; maps B g identically. 


Remark 12.5.6 


1. A slightly more direct, but not shorter proof of Theorem 12.5.5 can be obtained 
by using the following local description of Morse families, see [78, 141] and 
Exercise 12.5.2. Let L Cc T* Q be Lagrange. Then, for every & € L there exists a 
neighbourhood Lo C L, together with appropriately chosen local coordinates x’, 
y! on T*Q, and a smooth function y+ H(y) such that Lo is generated by the 
Morse family 


S(x,y) =x-y-— Ay). (12.5.13) 


Then, Lo consists of the pairs (x = H’(y), y). 

2. We reformulate the results of the above discussion in terms of Lagrangian sub- 
manifold germs. For that purpose, let us refer to a Morse family (B, 2, S) over Q 
such that As(Bs) contains some given point € € T* Q as a Morse family at €. The 
operations of addition, composition, suspension and restriction, discussed prior 
to Theorem 12.5.5, naturally apply to Morse families at € and thus the notion of 
stable equivalence naturally extends to these families. 

A germ of immersions to a manifold M at a point m € M is an equivalence 
class of immersions to M such that m belongs to their images with respect to 
the following equivalence relation: two immersions (P, g) and (P, ~) of M are 
equivalent at m if there exist open subsets U C P and U C P such that m € 
QU) = go(U ). Theorem 12.5.5 states that germs of Lagrangian immersions at 
— € T*Q bijectively correspond to stable equivalence classes of Morse families 
at €&. In particular, Morse families at € are stably equivalent iff they generate the 
same germ of Lagrangian immersions at &, that is, iff there exist open subsets 
U C Bs and Uc By such that € € As(U) = A,(U). Thus, the latter may be 
taken as a geometric definition of stable equivalence. 


12.6 Maslov Class and Caustics 675 


The concept of Morse family is of fundamental importance in geometric asymp- 
totics, to be dealt with in Sects. 12.7 and 12.8. There is a variety of other physical 
applications, in particular in thermodynamics and in the statics of mechanical sys- 
tems, see [38, 40, 41, 293, 294], as well as [152]. 


Exercises 

12.5.1 Prove the following version of the Taylor Theorem. Let f be a smooth func- 
tion on a manifold M and let g : M — R’” be a smooth submersion. If f 
and df vanish on g~!(0), there exist functions f‘/ on a neighbourhood of 
g~!(0) in M such that 


f=>_f ap 
in] 


12.5.2 Prove the statements of Remark 12.5.6/1. 
Hint. Use the atlas on &(IR”) constructed in Sect. 7.6. 


12.6 Maslov Class and Caustics 


Let Q be a manifold of dimension n and let 1: L > T*Q be a Lagrangian immer- 
sion. In this section, we study the intersection properties of the tangent spaces of 
this immersion with the vertical distribution on T* Q. In this context, we will find a 
topological invariant which plays an important role in geometric asymptotics. 


Definition 12.6.1 (Caustic) The set of critical points of HT := 1g 01: L—> Q is 
called the singular subset of L and is denoted by 4'(L). The set of critical values of 
IT is called the caustic of L and is denoted by I(L). The points of /"(L) are called 
focal points. 


Remark 12.6.2 Since L and Q have the same dimension, a point & € L is critical iff 
the tangent mapping IT; is not injective. 


Let %,(L) denote the subset of L consisting of the points € where IT; has rank 
n — k. These subsets provide disjoint decompositions 


L= (LUE), BL) := LJ Eel). (12.6.1) 
k=1 


First, we show that the structure of the Maslov cycle of a Lagrangian subspace, 
discussed in Sect. 7.6, generalizes to Lagrangian submanifolds in generic position, 
provided the induced bundle :*T(T* Q) is trivial. This result belongs to Arnold [13]. 
For the case where this bundle is nontrivial, we refer to [8] and [56]. Let R2” be 
endowed with the canonical symplectic structure and let Lo := {0} x IR”. We denote 
Fn) := F(Lo) and L(n) := LR"), cf. Sect. 7.6. Let 


x: “T(TOQ)> Lx R™” 
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be a trivialization mapping the vertical distribution to Lo, that is, 
x (Tue) (Tire) 2)) = Lo (12.6.2) 
for all € € L. Via x, the image of the tangent mapping 
te: TeL > Tie)T*Q 
is identified with a Lagrangian subspace in R2”. This way, we obtain a mapping 
F:L> L(n), F(é):= xX (4%; (TeL)). (12.6.3) 


We say that L is in generic position if the mapping F defined by (12.6.3) is 
transversal!? to every submanifold %(n). One can show that every Lagrangian 
submanifold can be brought to generic position by an arbitrarily small transforma- 
tion.!4 


Proposition 12.6.3 Let 1: L > T*Q be a Lagrangian immersion in generic posi- 
tion such that the induced bundle «*T(T* Q) is trivial. Then, X(L) is either empty 
or an embedded submanifold of L of codimension _ . Moreover, X\(L) pos- 


sesses a natural coorientation compatible with the natural coorientation of L(n) 
under the tangent mapping of F. 


Proof Let & € X%(L). Then, rank IT; =n-—k and hence dim ker IT; =k, that is, 


dim(i, (TeL) ATi) (Ti Q)) =k. (12.6.4) 


By applying the diffeomorphism x we obtain dim(F(&) M Lo) = k, that is, F(é) € 
L,(n). Thus, F (2% (L)) C &(n). Conversely, if F (E) € Y(n), then (12.6.4) holds 
and hence & € 2, (L). Thus, F(24(L)) = &(n) NM F(L). It follows that 


Uy (L) = F71(G(n)). (12.6.5) 


Now, Theorem 1.8.2 and Corollary 7.6.11 imply that %;(L) is an embedded sub- 
manifold of codimension Her) Since FY induces a bijection between the normal 


spaces Nz (2 (L)) and N FEA (n)), the natural coorientation of A (n) provided 
by Proposition 7.7.7 carries over to (ZL). 


Remark 12.6.4 


1. Proposition 12.6.3 implies that X'(L) has a structure analogous to that of the 
Maslov cycle SY(L) in L(n), cf. Remark 7.6.12: it is a stratified subset of L, 
with the stratum »,;(L) having codimension | in L and the other strata having 
codimension at least 3. This implies that (ZL) is open and dense in 4'(L). 


13This means, in particular, that F(L) does not intersect LZ(n) forn < er) 


4This is a consequence of the Sard Theorem 1.5.18, see Lemma 4.1.3 in [13]. 
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2. Let us describe L explicitly in terms of local coordinates x', pj on T* Q induced 
from a local chart on Q. Let x! and p; denote the corresponding fibre coordinates 
on T(T* Q), let U C L be an open subset such that ¢;y7 is an embedding and 1(U) 
is contained in the domain of the coordinates x’ and p;. Since the local trivial- 
ization of T(T* Q) induced by the bundle coordinates x, Dis x! and pi fulfils 
(12.6.2), it can be used to construct the mapping F : U > £(n). By Proposi- 
tion 7.6.8, for every € € U there exists a subset K C {1,...,m} such that F(é) 
belongs to the domain of the local chart gx on #(n) defined by (7.6.12) and 
(7.6.16). This local chart assigns to F(€) an n-dimensional symmetric matrix, 
which we denote by Ar AG ). According to (7.6.18) and (7.6.19), in terms of the 


fibre coordinates x! sai pi, the Lagrangian subspace t’Tz L of Tz (T* Q) is given 
by the n equations 


=> AP OHM - DI AL@ Pm, LEK, (12.6.6) 
j¢K meK 
=) AK @ si - SAL @) bm. i K. (12.6.7) 
JEK meK 


Hence, it can be parameterized by the fibre coordinates corresponding to the co- 
ordinate functions x! with i ¢ K and p; with 1 € K. Therefore, these functions 
provide a chart kx on L in a neighbourhood of €. According to (12.6.6) and 
(12.6.7), the matrix entries Ay (€) can be expressed in terms of ae deriva- 
tives of the local eaaenieive: of the remaining coordinate functions x* in this 
chart: 


ox I 
AR (é)= ———(kk (&)), A@= ocx (€)), 
oPm (12.6.8) 


Ak @) = (e K(&)), Ake) = 5 —(k K(&)), 
where /,m e€ K andi, j ¢ K. 

3. Consider the singular subset X'(L). For given K C {1,...,} consisting of k 
elements and given & € L such that F(é) pene) to the domain of the chart gx, 
Proposition 7.6.10 yields that € € X%(L) iff Af mS) = 9 for all /,m € K. In par- 
ticular, for K = {1}, an open subset of 5i(L) is mapped under € b> AK (€) to 
the subspace of the vector space of n-dimensional symmetric matrices A given 
by Ai; = 0. Hence, for all € in this subset we have Be (eK (€)) = 0. Accord- 
ing to Remark 7.7.8, in the corresponding chart gx on &(n), the coorienta- 
tion of a, (n) points from the side where A;; > 0 to the side where Aj; < 0. 
Thus, the first relation in (12.6.8) implies that the induced coorientation of 
- (L) points from the side where ax (py, x2, ..., x) <0 to the side where 


aon ‘(pix 2.x") > 0. 
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Next, we carry over the Maslov index for closed curves in #(n) and the inter- 
section index for curves in #(n) with the Maslov cycle oP (n) to curves in L. 

First, consider the Maslov index jz for closed curves in &(n), cf. Defini- 
tion 7.7.1. Using the mapping F’,, we can define the Maslov index for closed curves 
y in L by 


LL(y) :=U(F oy). 


Like jz, the Maslov index jy, defines a homomorphism from 7 (L) to Z, denoted by 
the same symbol. On the other hand, via F,, the universal Maslov class w on #(n) 
defined by (7.7.4) induces a 1-form 


on L, called the Maslov class of (L, 1). By Proposition 7.7.4, for a closed curve y 
in L, we have 


me] Fux [ w= UWFoy)=pL(y). (12.6.9) 
Y y Foy 


As explained in Remark 7.7.6/1, integration of 1-forms over closed curves de- 
fines an isomorphism from the de-Rham cohomology group H!(L) to the group 
Hom(z;(L), R) and (12.6.9) implies that this isomorphism assigns to the Maslov 
class j41 the homomorphism defined by the Maslov index jr. 

Second, consider the intersection index of curves in #(n) with respect to the 
Maslov cycle P (n), cf. Definition 7.7.10. We carry over the terminology of cross- 
ings introduced in Sect. 7.7 to the present situation: a real number ¢ such that 
y(t) € X'(L) is called a crossing of y with 2'(L). A crossing f is said to be simple 
if y(t) € X\(L). A simple crossing is said to be transversal if y(t) ¢ Ty a) 21(L). 
Depending on whether y(t) is positively or negatively oriented with respect to the 
natural coorientation of &'|(L) provided by Proposition 12.6.3, a simple transversal 
crossing is said to be positive or negative. Proposition 12.6.3, the transversality of F 
and 7 (n), and the definition of the coorientation on X' (ZL) imply the following. 


(a) Crossings of y with X'(L) are crossings of F o y with Pin) and vice versa. 
(b) A crossing of y with 2(L) is, respectively, simple, transversal, positive or neg- 
ative iff it is so as a crossing of Fo y with #(n). 


Since the complement of X'9(L) U X)(L) in L is the closure of the embedded sub- 
manifold &(L) which by Proposition 12.6.3 has codimension 3, point 1 of Propo- 
sition 7.7.9 carries over to the present situation, yielding that 


(c) every curve in L with end points in Xo(L) is homotopic with fixed end points 
to a curve which has only simple transversal crossings with »'(L). 


Since a homotopy with fixed end points of y in L induces a homotopy with fixed 
end points of F o y in @(n), point 2 of Proposition 7.7.9 implies that 
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(d) if two curves in L which have the same end points in X(Z) and which have 
only simple and transversal crossings with »’(L) are homotopic with fixed end 
points, their differences between the numbers of positive and negative crossings 
coincide. 


Points (c) and (d) allow for 
Definition 12.6.5 (Maslov intersection index for Lagrangian immersions) The 


Maslov intersection index of a curve y in L with end points in X’g(L) is defined 
by 


ind; (y) := v4 —v_, (12.6.10) 


where v+ is the number of positive crossings and v_ is the number of negative 
crossings with %'(L) of a curve which is homotopic with fixed end points to y and 
whose crossings are all simple and transversal. 


By point (d), the intersection index so defined is invariant under homotopies with 
fixed end points. It is obviously additive with respect to the composition of curves. 
By (a) and (b), we have 


ind; (y) =Ind,,(F oy) (12.6.11) 


for all curves y with end points in Xo(L). 


Proposition 12.6.6 Let L C T*Q be a Lagrangian submanifold and let y : [0, 1] > 
L be a closed curve with y (0) = y(1) € Xo(L). Then, 


L(y) = ind, (y). (12.6.12) 


This shows, in particular, that the definition of 4, does not depend on the choice 
of the trivialization of 1*T(T* Q) in the construction of the mapping F’. 


Proof Under the assumption that y(0) = y(1) € Xo(L), the Lagrangian subspace 
Lo of R2” is transversal to F o y(0) and F o y(1), so that we can apply Theo- 
rem 7.7.11. In view of (12.6.11), this yields 


LL(y) =U oy) = Indz, (F 0 y) = indy (y). 


Remark 12.6.7 The intersection index indy (y) can be expressed in terms of the 
Kashiwara index by a formula analogous to (7.8.9): for € € L we define 


Lye = Te (The) Q), Lo := UTeL. 


We choose a sufficiently fine covering {U;} of L such that over each U; there exists 
a smooth choice of an auxiliary subspace Li transversal to both Lj¢ and Ly¢. Let 
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y : [a,b] > L be a curve in L and choose a = fg < ty <---: < t, = b such that 
y ((tj-1, tj]) C U; for all 7. Then, Proposition 7.8.8 implies 


ind; (y) 


As(Lie, Lg, Lip) rey (ayy — 8(E1§ £25, L4g) jenygp)> (12.6.3) 


1 
2 


k 
j= 


with the Kashiwara index s taken in the symplectic vector space T; (T* Q). 


Example 12.6.8 


1. Consider R?, endowed with the canonical symplectic structure and with canoni- 
cal coordinates q and p. Let L be an embedded submanifold diffeomorphic to S!. 
For dimensional reasons, L is Lagrange. Let us calculate the Maslov index of L 
using (12.6.13). For that purpose, it suffices to consider a closed curve t + y(t), 
which runs through L exactly once in the direction of a chosen orientation. For 
simplicity, assume that L coincides with the circle defined by g* + p? = 1 and 
that it is oriented clockwise. The singular subset »'(L) consists of the points 
n+ = (1, 0) and n_ = (—1, 0). We choose a covering by connected open subsets 
Uj,...,U4 such that n+ € Uz and n— € U4 and a compatible partition of y. If 
we use the clockwise orientation of R?, the four terms in (12.6.13) are 0 for U; 
and U3 and +1 for U2 and U4, respectively. Thus, we obtain the Maslov index 
+2. This can also be understood in the following way. While running through 
the singularities, the relative position of L3 to the pair (L;, L2) changes. Above 
n+ the subspace L3 lies between L; and L2, whereas beneath n+ it lies outside 
of L,; and L2 in the sense of the chosen orientation. 

2. Ina similar way, one can discuss the Lagrangian immersion 1 : L > R? given by 
L=S|!, realized as the unit circle in C, and 


1:L > R?, u(e'*) = (cos(), sin(2¢)). 


This is a figure eight immersion in horizontal position with respect to the canon- 
ical projection. It has the two self-intersection points +i. The singular subset 
»/(L) consists of the two points +1. The reader can easily convince himself 
that for a curve y which runs through L exactly once, Formula (12.6.13) yields 
LL(y) =0 (Exercise 12.6.1). 


Now, let us analyze the Maslov intersection index using the concept of Morse 
families. For that purpose, let U C L be open and let (B, 7, S) be a Morse family 
generating the Lagrangian immersion t;y : U — T*Q. As in Sect. 12.5, we as- 
sume that B is an open subset of Q x R’ and we write (x, y) for its elements. We 
will also use the simplified notation for the second derivatives introduced there. By 
Lemma 12.5.1, for (x, y) € Bs and é € U such that 1(€) = (x, y), we have 


: W = y 
dim ker Sy (x,y) = dim ker /7;.. 
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Thus, Remark 12.6.2 implies that 1(2'(U)) coincides with the image under As of 
the set of solutions of the system of equations 


Sy =0, det(S¥,,) = 0. (12.6.14) 
Hence, the caustic (U) = IT(2'(U)) is given by 


rU)={xeQ: det(S}, (x, y)) =0 and S\,(x, y) = 0 for some y € R’}. 
(12.6.15) 
On the other hand, outside X'(L) we have 


det(S!,) 40, 


so that for points which are not critical, the first of the equations in (12.6.14) can be 
solved for the variables y*. Thus, if U does not intersect 3’(L), S can be reduced to 
a single generating function on Q. 


Example 12.6.9 For the Morse family of Example 12.4.7, which generates the La- 
grangian immersion of Example 12.6.8/2, the criterion (12.6.14) yields 


2 


a°s 
a7 (Xr y) = 2y =0. 


—(x,y)=x*+y*—-1=0, 
Dy (x, y)=x y dy 


Hence, the critical points are (x, y) = (£1, 0) and the focal points are x = +1. This 
is consistent with what we have found in Example 12.6.8/2. 


Next, we derive a formula for the intersection index in terms of generating Morse 
families. Recall that the index of a quadratic form Q ona vector space V is defined to 
be the number of negative eigenvalues, that is, the dimension of a maximal subspace 
of V on which H is negative definite. One has 


index(Q) = 5 (rank(Q) — sign(Q)), (12.6.16) 


where sign(Q) denotes the signature, that is, the number of positive eigenvalues 
minus the number of negative eigenvalues, counted with multiplicities. 
Lemma 12.6.10 Let. : L + T*Q be a Lagrangian immersion. Let there be given 


1. an open covering {U;} of L such that all the intersections U; 1. U; are connected, 
2. Morse families (Bj, 7, Si), with B; being open subsets of Q x R", such that the 
immersions t; : Uj  T*Q and As, : Bs, > T*Q are equivalent. 
Then, the mappings!» 
cj: U; NU; > Z, cij = index((S/’) . a - index ((S”’) Soa) (12.6.17) 
4 J ’ Si Yi 2d2I 


are constant for alli, j. 


By an abuse of notation, via the diffeomorphism Bs, + Uj; induced by As,, index( (Si)f, yi) is 
viewed as a function on Uj. ; 
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Proof Let (i, j) be a pair of indices such that U; 1 U; is nonempty and let & € 
U; OU;. Denote bo) = Ae (9), / =i, j. Since the Morse families (B;, 77;, S;) and 
(B;,7;, S;) generate 1: L > T*Q at &, for / =i, j, a restriction of (B), 7, S;) to 
some open neighbourhood of bo; in B; arises from some reduced Morse family 
(B, x, 8) generating (L,1) at € by the following operations, applied in the order 
they are listed and with additions and restrictions omitted: a composition with a 
fibre-preserving diffeomorphism yw), a suspension with a non-degenerate quadratic 
form Q; on R and a further composition with a fibre-preserving diffeomorphism 
g. Thus, in a neighbourhood of bo, 


81 = (Sow +Q) ogi, (12.6.18) 


where we have omitted the natural projections occurring in the suspension. To an- 
alyze how the index of the second derivative with respect to the fibre coordinates 
behaves under the above operations, let (B, 2, S) be a Morse family over Q with 
BC Q xR’ open. For a non-degenerate quadratic form Q on R* we have 


index ((S + o)) = index(S’,,) + index(Q). (12.6.19) 


For an open subset BCQxR' anda fibre-preserving diffeomorphism ¢ : BoB 
we calculate 


(S 0g), (b) = (v6) " (8%, (9D) 9, (B) + (S,(9D)) 9%, ). 


Since ~(Bso) = Bs and since S;, = 0 on Bs, and since a similarity transforma- 
tion with a non-singular matrix does not change the index of a quadratic form, this 
implies 

index((So g)),) = index(S',, 0 ?). (12.6.20) 
Using (12.6.19) and (12.6.20), from (12.6.18) we obtain 


index((S))%,,, (b1)) = index(S%; (1 © gi(b)))) + index(Q)) (12.6.21) 


for all bj in a neighbourhood of bo, where / = i, 7. Since Asi o g(bj)) = 
As, (bi), if As, (bi) = As,(bj), then Yi o gi (bi) = Wj ° gj (bj). Hence, (12.6.21) 
implies 


cij (€) = index(Q;) — index(Q;) 


for all € in some neighbourhood of & . This shows that c;; is constant in a neigh- 
bourhood of every point of U; M U;. Since the latter is connected, this implies that 
cij is constant. 


Proposition 12.6.11 Let:: L — T*Q be a Lagrangian immersion. Let y : [0, 1] > 
L be acurve with end points in Xo(L) which has only simple and transversal cross- 
ings with 3/(L). Choose numbers 0 = to < t) < +++ <t, = 1 such that there exist 
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1. open subsets U,,..., Ux such that y ([t;-1, t)]) C U; and U; NU; are connected, 
2. Morse families (Bj, ;, Sj), with B; being open subsets of Q x R", such that the 
immersions t; : U; > T*Q and As, : Bs, > T*Q are equivalent. 


Then, the intersection index of y is given by 


k 


ind, (vy) = ) “{index((S;)",, (7 (4:))) — index((S;)".,(v@i—D))}- (2.6.22) 


i=l 


Proof Using Lemma 12.6.10, for the right hand side of (12.6.22) we find 


k 
2 {index((S;)%, ,, (vy Gi))) — index((Si)%. y, (vy (G—1)))} 


k-1 


= index (($,)", ,, (v())) + Yocizsi(v i) —index(($0)", (7 )). 
i=l 


Since the c;; j are constant on U; N Uj, the sum on the right hand side does not 
depend on the choice of the numbers #;. In particular, we may choose them in such 
a way that y(¢;) € X’o(L) for all i. If the line segment y ([#;_1, t;]) does not intersect 
»(L), Lemma 12.5.1 implies that the rank of aes is constant on y ([t;—1, t;]). 
Then, also the signature is constant. By (12.6.16), then 


index((S;)%, y, (7())) = index((S;)"., (y Gi-1))). 


so that this line segment does not contribute to the sum on the right hand side of 
(12.6.22). For the remaining line segments we may assume that each of them con- 
tains exactly one crossing. By the Splitting Lemma 12.5.2, we may also assume 
that the corresponding Morse families (B;, 7, $;) are reduced, which means that 
they have fibre dimension 1, because the crossings are simple. Then, Lemma 12.5.1 
yields that (S; yy y, (7 (t)) = 0 at the crossing and (5; as (y(t)) £0 outside. There- 
fore, (Si)5,y a) (y(t)) has rank | at ¢; and t;1, whereas the signature changes by +2 at 
the crossing. By (12.6.16), the index then changes by +1 there. As a consequence, 
the right hand side of (12.6.22) counts the crossings of y with 2 (L), weighted 
by +1 in case (5S; “ y, (7 (t)) changes its sign from + to — and weighted by —1 
otherwise. It remains to show that this weighting is consistent with the counting of 
the crossings in the Maslov intersection index (12.6.10), that is, that (S$; a y (y(t) 
changes its sign from + to — iff y crosses &'|(L) in the direction of the coorienta- 
tion of &' (ZL). Since the argument is independent of the line segment, we may omit 
the index i. Let t, be the crossing under consideration and denote & = y(t,). For 
the first part of the argument it is helpful to distinguish between points in U C L 
and points in Bs. Therefore, let (x¢, y-) € Bs be such that 1(&) = As (Xe, ye). Since 
Sey (Xe, Yc) = 0, the bilinear form Sy 4 y (Xe, Ye) must have maximal rank, that is, rank 


1. Thus, we can find coordinates x! on a neighbourhood of I7(€) = x¢ in Q such that 


684 12. Hamilton-Jacobi Theory 


# 0 in a neighbourhood of (xc, yc) in B. By replacing x! by —x! if necessary, 


oe 
we may assume that 
a7s 
—— >0. 12.6.23 
ax!dy ( ) 
Then, the Implicit Function Theorem yields a function xls xy, x7,0.., x") ful- 
filling 
as 2 n 2 
Xe HY, HM, Hy) =O 
FOO Maae aenaa) 
Hence, CO cg OV ts oD) € Bs, so that y and ee oe provide 
coordinates on By in a neighbourhood of (x,., y-). Since y (x!(y, Cg eneee as 8 


Ce ci y) is acurve in Bs, Eq. (12.4.15) implies 


a7S ax! n a7S 
dydx! dy dy? 


=0. (12.6.24) 


; 1 . 
Hence, along the line segment, we have me = 0 at the crossing and 


ax! as 
sign( - )- sien( 7) (12.6.25) 


outside. Now, consider L. By (12.4.12), in the coordinates x! and pi on T*Q, for 
points in a neighbourhood of & in L we have p; = 3. Using that y,x”,...,x” 
provide coordinates on Bs, we find : 


dpi aS axl aS 
dy d(x!)? dy — ax!ay’ 


Since at €& we have 2 = 0, the inequality (12. °. 23) implies = 2p 1 > 0 in some neigh- 
bourhood of &, in ae so that we may take pj, x~,...,x” as aie: on L there. 
Then, 

ax! _ ox opi 1 

“ay y ~ apy Oy” 


From this and from (12.6.25) we read off that, along the line segment, we have 


ax! 
a = = 0 at the crossing and 


: (=) (=) (75) 
sign}; — ]=s19n = sien 
o"\ Opi rN Dy an ay? 


outside. According to Remark 12.6.4/3, this means that if the line segment crosses 


»,(L) in the direction of the coorientation, that is, if ~ changes its sign from — 
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a2 : A : . 
to +, then a changes its sign from + to — and, hence, its index changes by +1. 


This completes the proof. 


Lemma 12.6.10 and Proposition 12.6.11 imply 


Corollary 12.6.12 Let 1: L — T*Q be a Lagrangian immersion and let y be a 
curve in L with end points in Xi9(L). Under the assumptions of Proposition 12.6.11, 
we have 
k-1 
ind, (v) =~ cii+1 + index((S,)", ,, (yD) — index(()", (7 0))). 
i 
If y is closed, then 


k-1 


i=) aia. 


i=l 
In particular, a Lagrangian immersion generated by a single Morse family has triv- 
ial Maslov class, that is, in this case the Maslov index of any closed curve vanishes. 
Example 12.6.13 
1. Let L be the circle g* + p? = 1 in R?’, cf. Example 12.6.8/1. Choosing p as a 


coordinate on L in the vicinity of each of the critical points n+ = (+1, 0), we 
find x(p) = +,/1 — p? and hence 
Ox 2p 


ip = ee 


Thus, at 7+, the coorientation points from the upper half-plane to the lower half- 
plane, whereas at 7_, it points in the converse direction. Consequently, a curve 
running once clockwise around L has Maslov index +2. This is consistent with 
what we have found in the above-cited example. Note that Corollary 12.6.12 tells 
us that the circle cannot be generated by a single Morse family. We encourage 
the reader to construct a set of generating Morse families (Exercise 12.6.3). 

2. Consider the Morse family of Example 12.4.7, which generates the figure eight 
immersion in R* of Example 12.6.8/2. As in point 1, using p as a coordinate 
on L at the critical points n+ = (+1,0), we find that at n, the coorientation 
points from the upper half-plane to the lower half-plane and that at n_ it points 
in the converse direction. Let us determine the Maslov intersection index of the 
following four curves. Denote ¢+ = (0, +1) (the self-intersection points) and 
&44 = ( wo =): Define y; to run from &,4 through n+ to €4_, y2 from 

€,— through ¢_ to &__, y3 from &__ through n_— to €_4 and y4 from é_+ 

through ¢+ back to 4. Since y; traverses n+ in the direction of the coori- 
entation, whereas y3 traverses 7— in the direction opposite to the coorientation, 
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we find 
ind; (71) = 1, ind; (v2) = 9, ind; (3) =—1, ind; (74) = 0. 


In particular, the closed curve obtained by composing y|,..., 4 has Maslov 
index 0. This is consistent with Corollary 12.6.12. 


Next, we show that there is a variety of bundle structures over L associated with a 
given Lagrangian immersion : L — T* Q. These bundles contain information about 
how the pieces of this immersion generated by Morse families are glued together. 
Let £(L, 1) denote the class of Morse families over Q locally generating 1: L > 
T* Q. Recall that the elements (B, 2, S) of @(L, v) are characterized by the property 
that there exists an open subset Us of L such that the immersions (Bs, As) and 
(Us, tty) are equivalent. With any two elements 5), Sz of (L, +) such that Us, NUs, 
is nonempty, one can associate a mapping 


C5,,8):Us, Us, > Z, es, 8, (€) == index((S1)}, y, €)) — index((S2)5, y, )), 


called the transition function of S; and Sz. Here, y; and yo are arbitrarily chosen!® 


fibre coordinates on By, and Bo, respectively. By Lemma 12.6.10, cs,,s, is constant 
on each connected component of Us, 1 Us, and hence smooth. Now, take the subset 
of L x #(L,t) x Z consisting of the elements (&, S, k) such that € € Us and define 
M_ to be the quotient of this subset by the equivalence relation 


(1, S1,k1) ~ (2, S2,k2) iff & =& and kj —k2 =cs,\5, (1). 


From the direct product L x &(L,1) x Z, the quotient M; inherits the natural 
projection 


nML: Mr > L, mM ([E, S,k)]) = 6. 
The elements S of @(L, ) define mappings 
xsi(@M@4) Us) >U,xZ, — xs([E5,0]):= (Ek +e5 3), 


which are easily seen to be bijective. The transition mappings are given by 


XH Xs, EK) = (Ek + cm,5,)). 


Since they are smooth, the family {ys : S € Y(L,1)} defines on Mz, the structure 
of a smooth manifold, cf. Remark 1.1.10. Second countability thereby carries over 
from L, because the latter implies that the covering {Us : S € @(L,1)} contains a 
countable subcovering. This way, Mz, becomes a locally trivial fibre bundle over L 


16For any two choices, the quadratic forms Sy, (€) are similar and hence have the same index. 
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with canonical projection 7™“ and typical fibre Z. Finally, one can check that the 
mapping 

W:M,xZ—> Mz, W((E,S,4)],D) :=[E,S,4+D], 


is well-defined and that it endows M, with the structure of a principal Z-bundle 
over L. This bundle is called the Maslov principal bundle. With the Maslov principal 
bundle, there come the following three associated bundles: 


(a) My, xz Za via the induced action of Z on Z4 = Z/4Z by translation, 
(b) M; xz U(1) via the action of Z on U(1) defined by (k, a) B etka, 
(c) @, := My, xz C via the corresponding action of Z on C. 


Md, is called the Maslov line bundle. The embedding 
Z4—>U(1), kmod4p ei2*, 


induces a vertical principal subbundle embedding M;, xz Za — My, xz U(1). Using 
sheaf theory one can prove that there exists a family of functions 


{cs EC’ (Us): Se L(L,0)} 


such that cs, — cs, = Cs,,5, on Us, 1 Us, [133, Prop. 2.11.1]. Then, 


a +E 7 
e' 2°51 = @1 251.5 et 2 “Sp (12.6.26) 


and thus the local sections e!2°5 in My, xz U(1) combine to a global non-vanishing 
section. As a consequence, M;, xz U(1), and hence the Maslov line bundle ., is 
trivial. The triviality of the complex vector bundle 4, does, however, not imply the 
existence of a single real-valued generating function for L. 


Remark 12.6.14 Let {S; :i € I} be a countable subset of &(L, 1) such that the 
subsets U; = Us, cover L. The covering {U;}, together with the family of transition 
mappings cjj = Cs,,s; Of the corresponding system of local trivializations of the 
Maslov principal bundle M_, defines a 1-cocycle on L with values in Z and thus 
an element fi; of the first integer-valued Cech cohomology Hi} (L, Z) of L, cf. 
Remark 2.2.12/2. According to this remark, jz; uniquely characterizes the principal 
Z-bundle Mz, up to isomorphisms. Corollary 12.6.12 implies that by the natural 
homomorphism 


Hi (L, Z) > H'(L,R), 


(ty is mapped to the Maslov class jy. Let us add that the transition mappings of 
the corresponding system of local trivializations of the Maslov line bundle /, are 


given by e 12“, 


In the remainder of this section, we derive a local normal form for the caustic of 
a Lagrangian immersion in the simplest case. 
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Proposition 12.6.15 Let 1: L > T*Q be a Lagrangian immersion and let & € 
»|(L). Denote x9 = IT (&). Assume that 


t' Tey (21(L)) N Tye) (Ty, Q) = {0}. (12.6.27) 


Then, there exist smooth functions f and g on a neighbourhood U of xo in Q with 
g’ £0 such that [(L) OU coincides with the caustic (Bs) of the Lagrangian 
immersion As generated by the Morse family 


S:UxROR, S(x,y) = f@)+g(x)y— ay. (12.6.28) 


Clearly, for the normal form (12.6.28), we have 
Bs = {(x, yyEUXR: g(x) = ae I'(Bs) = {x EU: g(x) = 0}. (12.6.29) 


Proof The proof is along the lines of the proof of Proposition 6.1 in Chap. VII 
of [115]. Let (B, 2, S) be some Morse family generating 1: L > T*Q at &. By 
Lemma 12.5.1 and by the Splitting Lemma 12.5.2, we may assume that B is an 
open subset of Q x R. Let yo € R such that (xo, yo) € Bs and As(xo, yo) = &p. In 
a first step, we show that there exist smooth functions f,g: Q > Randy: R>R 
such that for (x, y) € Bs we have 


1 
S(x,y) = f(x) + 8@)x@, y) — 3X, oe (x(x, yy = g(x), (12.6.30) 


where g/(x) #0 for all x and 9X (y) # 0 for all y. We will give the argument for 
the case dim Q = 1. The general case can be reduced to this case, see below. By 
a constant shift of the fibre coordinate y, we may achieve that yo = 0. Choosing 
an appropriate coordinate x on Q we may also assume that x9 = 0. Then, since 
As(0, 0) = & belongs to (L), we have S Syy (0, 0) = 0 and hence ay (0,0) £0. 
Thus, by the Implicit Function Theorem, the equation 8S (x, y) =0 can be solved 
for x and hence Bs is given by a smooth function y +> x(y). Then, (12.4.15) implies 


2 2 


S a 
ayo =O); y)x'(y) +S 55 


is 
= (x(y),y) =0 (12.6.31) 


and hence x’(0) = 0. Differentiating (12.6.31) once again, we obtain 


= a> 
(0, 0)x” (0) + Pas 0) =0. (12.6.32) 
ox dy? 


Due to the assumption (12.6.27), S can be chosen so that £3 FSO, 0) 4 0. The proof 
of this statement is left to the reader, see Exercise 12.6.4. Then, (12.6.32) implies 
that x”(0) 4 0. Therefore, the Taylor expansion of x(y) at y = 0 starts with the 
second order term. By replacing the coordinate x on Q by the coordinate —x if 
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necessary, we may assume that the corresponding coefficient is positive and hence 
that x(y) > 0 in some neighbourhood of y = 0. Then, by taking the positive square 
root of x(y) for y > 0 and the negative square root of x(y) for y < 0, we obtain a 
smooth function y +> A(y) satisfying x(y) = re a and A’(0) > 0. It follows that 
the mapping (x, y)  (x,A(y)) is a fibre-preserving diffeomorphism of B which 
transforms S in such a way that Bs is given by x = y?. 

Now, consider the function y 5(S(y?, y) + S(y?, —y)). Since it is even, a 


rgl7 


lemma of Whitney’s’’ yields that there exists a smooth function f on R such that 


f(x?) = 5(S(v”. ») + S(y’, -y)). 


Next, consider the function 
3 
W(y) = 7(S(y*, ¥) — SQ", -y))- 


A brief computation shows that (0) = w’(0) = Ww” (0) = 0 and w’” (0) $ 0. Hence, 
the Taylor expansion of 7 at 0 starts with the third order term, so that by taking the 
third root of w(y) we obtain a unique smooth function x on a neighbourhood of 
zero in R such that x(y)* = w(y). Since w is an odd function, so is x. Hence, x 
is an even function, so that by the above lemma of Whitney’s there exists a smooth 
function g on R such that g(y*) = x (y). Then, 


g(y")xQ) = +(5(9?, y) — S(y*, -y)). 


A brief calculation shows that the functions f, g and x so constructed satisfy 
(12.6.30), indeed. Moreover, since (x/(0))? = A) #0 and g/(0) = (x'(0))’, 
we have x’(y) £0 and hence g/(x) £0 in some neighbourhood of x = 0. 

Finally, in the case dim Q > 1, since S Sey ' (0,0) 4 0, we can choose coordinates 
x! in a neighbourhood of xo in Q such that 


Si 1 £0. 


Then, we can carry out the argument for dim Q = 1, thereby treating the variables 
x*,...,x” as parameters. Due to the parameterized version of the lemma of Whit- 
ney’s cited above, and since in each step of the above construction of the functions 
f(x!), g(x!) and x (y), the smooth dependence on the parameters x”, ..., x” is pre- 
served, these functions depend smoothly on the parameters and thus yield smooth 
functions on Q and Q x R, respectively. This completes the first step of the proof 


of Proposition 12.6.15. 


‘TT h is an even smooth function on R, there exists a unique smooth function f on R such that 
h(y) = h(y?). If h depends smoothly on parameters, then so does h. 
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In the second step, we define a function S$ in a neighbourhood of (x9, yo) in B by 
the right hand side of the first equation in (12.6.30). This function fulfils 


as _ 5 Ox 
ao = (g(x) — x(x, y) es, 


Since g(x) = x(x, y)* for all (x, y) € Bs and since in the coordinates x' on Q 
used above, a # 0 and ox = 0 everywhere, we conclude that Bix, y) = 0 iff 


(x, y) € Bs. Thus, the singular subset of S coincides with Bs. Moreover, choosing 

as 
dyax! 
izing Bs by the coordinates y, x?,...,x", from (12.4.15) we conclude that both the 


coordinates x! on Q in a neighbourhood of xo such that # 0 and parameter- 


singular points of S in Bs, defined by xs (x, y) = 0, and the singular points of Sin 


Bs, defined by “ (x, y) =0, are characterized in these coordinates by the equation 


A) 1 
Gy Oo eon) =O. 


Thus, S has the same singular subset as S and hence it generates the caustic ’(L) in 
a neighbourhood of xo in Q. Finally, we apply the fibre-preserving diffeomorphism 
(x,y) > (x, x(x, y)) transforming S to the Morse family (12.6.28). This finishes 
the proof. 


For an application of this proposition in physics, we refer to Example 12.8.11. 


Remark 12.6.16 (Caustics and Catastrophe Theory) We put the above proposition 
in the perspective of a typology of singularities of Lagrangian immersions. Recall 
that, by Theorem 12.5.5, germs of Lagrangian immersions at a point € € T* Q are in 
one-to-one correspondence with stable equivalence classes of Morse families at &. 
This correspondence can be carried over to the following situation. 

On the one hand, two Lagrangian immersions 1: L > T*Q and?: L—>T*Qare 
said to be equivalent if there exist diffeomorphisms A: L > Land w:Q-— QO and 
a symplectomorphism g : T*Q — T*@Q such that 


tok=Qol, pomg = 09. 


This induces an equivalence relation for germs of Lagrangian immersions in an 
obvious way. Caustics of equivalent Lagrangian immersions are mapped diffeomor- 
phically onto one another. On the other hand, the concept of stable equivalence of 
Morse families at some point € € T*Q discussed in Sect. 12.5 can be generalized to 
stable equivalence of arbitrary Morse families by extending the operation of com- 
position to include arbitrary fibre-preserving diffeomorphisms T*Q — T*Q, pro- 
jecting to diffeomorphisms Q > O and allowing for arbitrary smooth functions on 
Q in the operation of addition. For reduced Morse families, the equivalence relation 
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Table 12.1 Normal forms 


for Morse families over 0 Name Normal form 

with dim Q <4 
Fold y3 +xy 
Cusp ty4 + xy? + xy 
Swallowtail p+xy +xoy?+x3y 
Butterfly ty + x, y4 + xy3 +x3y2 + xay 
Hyperbolic umbilic yy; y3 x1V1V2 $x] £32 


2 ‘ 

Xi Vy + yy + X2Y1 + X32 
ae) 

Xi Vy + X23 + XB + X4y2 


Elliptic umbilic Yiy —y 


Parabolic umbilic yy y3 ty 


reads 
S=Sogt+x, 


with gy: T*Q > T* O being a fibre-preserving diffeomorphism and x : Q > R be- 
ing a smooth function. Then, equivalence classes of germs of Lagrangian immer- 
sions are in bijective correspondence with stable equivalence classes (in the above 
generalized sense) of Morse families, see [14-16] and [19, §1.3]. Thus, the classifi- 
cation of Lagrangian immersions and their singularities (singular subsets) is reduced 
to the classification of Morse families, for which methods of general singularity the- 
ory can be applied, see [115]. In this context, the generating Morse family is referred 
to as an unfolding of the singularity. First, one analyzes the Taylor series to obtain 
normal forms similar to (12.6.28), but containing error terms which for x = 0 van- 
ish to arbitrary order. Next, one shows that up to stable equivalence the error terms 
can be omitted. Thus, in particular, the normal form of Proposition 12.6.15 is sta- 
bly equivalent, in the generalized sense, to the original Morse family. This way, 
one can classify, for example, the stable!® Morse families over Q with dim Q <4, 
see Table 12.1. This yields the famous Thom catastrophes. We see that the catas- 
trophe described by Proposition 12.6.15 is a fold. The list in Table 12.1 is a clas- 
sical result of singularity theory. This theory, which is sometimes also referred to 
as catastrophe theory, was developed by Whitney [313], Thom [287, 288], Mather 
[200-204], Boardman [49] and Arnold [14-16], see also [20, 24, 25, 110, 115]. 
Generally speaking, in this theory one studies the singularity structure of a smooth 
mapping w: M — Q between manifolds. In the first step, in complete analogy to 
the singular subset 2'(L) of IT, one defines the singular set 


Si(w):= {me M:dimkery), =i}, i=0,...,dimM, 
of w. If X;() is a submanifold of M, one can define 


XijW) = Zi Vin), OSs. 


'8 Morse family over Q is stable if it is an inner point of its stable equivalence class with respect 
to a certain C®-topology on the space of all Morse families over Q. 
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Again, if this is a submanifold, one can go on with defining 2; ;,,(yw) and so on. 
These subsets are called the Thom-Boardman singularities.!? As a result, one ob- 
tains a partition of M into a family of locally closed submanifolds with the prop- 
erty that the restriction of y to each component has maximal rank. In some special 
cases, the Thom-Boardman singularities yield a complete classification of generic 
mappings. The Thom catastrophes listed above are of this type. The fold, the cusp, 
the swallowtail and the butterfly correspond to, respectively, the Thom-Boardman 
singularities 119, 41,1,0, 21,1,1,0 and &),1,1,1, and the umbilic catastrophes are of 
the type 2/2 9. In the case under consideration, there are no further Thom-Boardman 
singularities, because all other singularities have a codimension greater than 4. 


Exercises 

12.6.1 Show that the figure eight immersion of Example 12.6.8/2 has vanishing 
Maslov index. 

12.6.2 Show that the function S(x, y;, y2) = —iy} — 5Y3 +xy,+(1—-x)y» defines 
a Morse family. Determine the induced Lagrangian immersion, its singular 
subset and its caustic. 

12.6.3 Find a system of generating Morse families for the unit circle in R’, cf. 
Example 12.6.13/1. 

12.6.4 Complete the proof of Proposition 12.6.15 by showing that under the 
assumption (12.6.27), the generating family S can be chosen so that 


(0,0) £0 

ays”? ; 

Hint. Study the kernel ker((/7’) pry, (z)) in an analogous way as in the proof 
of Lemma 12.5.1. 


12.7 Geometric Asymptotics. The Eikonal Equation 


In this section we apply the method of characteristics and the concept of Morse 
families to the equation of geometric optics, the so-called eikonal equation. We 
restrict our attention to the case OQ = R”. 

Geometric optics rests on the assumption that the wavelength A of light is small 
compared with the typical length scale L of the optical system under consideration. 
Under this assumption, the wave character of light remains hidden and one may 
imagine light as a flow of particles (light rays). 

For simplicity, we consider the scalar wave equation”? on Q x R: 


n>(x) 07 
(“SS - A Jus, 1) =0, (12.7.1) 


Cc 


!9While one can show that for almost all functions f these subsets are submanifolds indeed 
(287, 288], the Thom-Boardman singularities can also be defined in the general case by using 
jet techniques as developed by Boardman. 


20Thus, in particular, we ignore polarization phenomena. 
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—iwt 
’ 


where n(x) is the local refractive index. Making the ansatz u(t, x) = u(x)e we 
obtain the associated Helmholtz equation, 

1 2 

patn (x) Ju(x) = 0, (12.7.2) 


where k = ©. For length scales r fulfilling 4 <r < L, one can study this equation 
in the framework of short wave analysis. The starting point of this procedure is the 
ansatz 


u(x, k) = a(x, ke*5S, (12.7.3) 


The function S is called the eikonal function. It has the following physical interpre- 
tation. 


(a) The equations S(x) = c describe surfaces of constant phase, called wave fronts. 

(b) By expanding S(x) = S(xo) + (k — xo) - VS(Ko) +--+ at a given point xo, we 
see that, close to xo, (12.7.3) can be approximated by a plane wave with wave 
vector 


k(xo) = kn(xo), n(Xo) := VS(xo). 


The vector-valued functions k and n are referred to as the local wave vector and 
the local refractive index vector, respectively. They are orthogonal to the wave 
fronts and we have n(x)? = n?(x) for all x. 


Inserting the ansatz (12.7.3) into the Helmholtz equation, we obtain 
! 2i ia 24 2, \ aikS 
poe ee po ee +néa je" =0. (12.7.4) 


While the amplitude a, as a function of x, varies on the length scale of the optical 
system, the eikonal function S varies on the length scale of the wave length A = oT 


Therefore, it makes sense to expand a(x, k) in powers of 1 


a(x, k) = ao(x) + a(x) + ee) : (12.7.5) 


Plugging in this expansion into (12.7.4) and comparing coefficients, we obtain the 
eikonal equation 
(VS)? =n? (12.7.6) 


in zeroth order of i and the transport equation 
VS-Vinaj + AS=0 (12.7.7) 


in first order of i The eikonal equation is the Hamilton-Jacobi equation for the 
Hamiltonian function 


H (x, p) =p? —n*(x) (12.7.8) 
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on T* Q = R” x R", that is, for a particle with mass 5 moving in the potential n(x). 
It determines the eikonal function S. By plugging in the solution into the transport 
equation, the latter becomes an equation in the indeterminate ag. Let us discuss the 
physical meaning of this equation. By a standard calculation, one can derive the 
following formulae for the energy density p and the energy current density”! P of 
the scalar field u: 


esa (es “EV (t,x))” 
x)= -=(—| —G@,x ,X ’ 
pues 2\c2\ at’ o 

ou 

P(t, x) = —— (t, x) Vu(t, x). 

ot 

Taking the time average, denoted by (-), in leading order of i — 0 we find 
1 4,22 1 22 
(p) = <n°k*ao, (P) = al aoVS. (12.7.9) 


2 


Thus, the energy flows in the direction of the vector field VS, that is, in the direction 


of the local wave vector k. Since by the eikonal equation, YS is a unit vector, we 


n 
have 


This means that the velocity of the energy flow is given by the local phase velocity 
«. Moreover, taking the divergence of (P) in (12.7.9), we find that the transport 
equation (12.7.7) describes energy conservation. From this discussion, we conclude 
that we may view light rays as flow lines of the vector field V S.”? This interpretation 
constitutes the basis for the discussion below. 


Remark 12.7.1 More generally, in the same spirit one can study the asymptotic 
behaviour for k — oo of partial differential equations of the type 


a(x —7¥)u( 0, (12.7.10) 


where 
H:T’Q=R'’xR"-R 


is a smooth function which is polynomial in the fibre variables (the momenta), 


H(x, p) =h(x) + Ohi... Pir --- Pi, 
r 


71 Built in analogy to the energy density and of the Poynting vector in Maxwell electrodynamics. 


2 More precisely, one may view light rays as wave packets, whose width in the direction transversal 
to the energy current vector is negligible. That such wave packets can be prepared follows from 
the uncertainty relation for the Fourier transform in the short wave approximation. 
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This function is referred to as the symbol of the differential operator given by 
(12.7.10). In the special case of the Helmholtz equation, the symbol is given by 
(12.7.8). Making the ansatz (12.7.3) and expanding the amplitude a according to 
(12.7.5), from (12.7.10) we obtain 


0 =H (x, VS(x))ao(x) 


i ( ao(x) a°H 25 dH San 
el F apap) VS(x)) arpa) + 5%, VS) 


*) 
+ Fite: VS(x))ay (x) + o(z). 
k k 
Comparison of coefficients yields the characteristic equation 
H(x, VS(x)) =0 (12.7.11) 
in zeroth order of i and the transport equation 


dH a 1 0H 
(= ee YO@) ge ax! San VS(& 5 are Fer) )ao(s) =0 (12,7,12) 


in first order of ie Equation (12.7.11) is the Hamilton-Jacobi equation for the Hamil- 
tonian function H. In the special case of the Helmholtz equation, (12.7.11) repro- 
duces the eikonal equation (12.7.6) and (12.7.12) reproduces the transport equa- 
tion (12.7.7). Note that all of this carries over to an arbitrary Riemannian manifold 
(Q, g). For example, in this case the eikonal equation reads 


g(VS, VS) =n? 
and the corresponding Hamiltonian function H : T*Q —> R is given by 
H) =e 1,8) — n(x). (12.7.13) 


Now, let us study the eikonal equation (12.7.6). For the sake of clarity, we restrict 
our attention to the vacuum case n = 1, 


(VS)? =1. (12.7.14) 
As noted above, this is the Hamilton-Jacobi equation for the Hamiltonian function 
H(x,p)=p’— 1, 


which up to a constant energy shift models a free particle of mass 5 To solve this 
equation, we apply the method of characteristics, cf. Sect. 12.2 and in particular 
Theorem 12.2.1. Let 


€=H~'0)cT*O, 
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Fig. 12.3 Light rays as 
projections of characteristics 


toQ be SCTO 


shadow 


Two light rays One light ray 


let D C Q be an oriented embedded submanifold of dimension r <n and let So be 
a smooth function on D. Assume that the canonical lift (D, So) of (D, So), defined 
in Example 8.3.8/4, is transversal to @ and that the intersection .% = (D, So) NG 
is transversal in @ to the integral curves of X 7, so that .% is an admissible initial 
condition. The Hamiltonian vector field X y is given by 


XH(X, p) = 2p;0,i, (12.7.15) 
and the Hamilton equations read 
p=0, x= 2p. (12.7.16) 
The integral curves, and thus the characteristics of @, are given by 
P(t) = Po, x(t) = Xo + 2pof, (12.7.17) 


where (Xo, po) € -“. Thus, the method of characteristics yields the generalized so- 
lution .: .Y — T*Q, where 


S=S)xR, L(X0, Po, t) = (Xo + fpo, Po). (12.7.18) 


The characteristics project to straight lines on Q. According to the above discussion, 
these straight lines can be interpreted as light rays. Thus, the generalized solution 
(Y,t) has the following interpretation: D is a source emitting light rays with the 
prescribed phase So. The image /7(.”) is the region of Q which is illuminated, 
whereas its complement Q \ J7(-7) is the region which stays in the shadow. 

The more times a fibre of T* Q intersects .”, the more light rays run through its 
base point and, therefore, the brighter this point appears, see Fig. 12.3. 


Remark 12.7.2 The observation that the projections of the characteristics to Q are 
straight lines generalizes to the case of an arbitrary Riemannian manifold (Q, g). 
Since the eikonal equation is the Hamilton-Jacobi equation of the Hamiltonian func- 
tion (12.7.13), according to Example 9.2.1, the projections of the characteristics to 
Q are geodesics of the metric n~g. 
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First, let us discuss in detail the case where the initial phase is Sp = 0, that is, 
where D is a surface of constant phase. In this case, the canonical lift of (D, So) 
coincides with the conormal bundle D of the submanifold D, cf. Example 8.3.8/3. 


Lemma 12.7.3 For every embedded submanifold D of Q of dimension r <n, the 
canonical lift D is transversal to € and A= DO @ is an admissible initial con- 
dition for ©. 


Proof In the following, leta =1,...,r anda =r+l1,...,n. Since D is embedded, 
we can find coordinates x! on Q such that D is locally given by x* = 0. Accord- 
ing to Example 8.3.8/3, in the bundle coordinates x’ and p; induced on T*Q, the 
canonical lift D is given by 


x* =0, Pa =0. (12.7.19) 


Correspondingly, in the fibre coordinates x! and p; induced on T(T*Q), for given 
(x, p) € D, the tangent space T(x p) D is given by 


x* =0, Pa =0. (12.7.20) 


Now, let (x, p) € DO @. In the coordinates x‘, the defining relation for @ reads 
gl (x) pip j = 1, with g'/ representing the Euclidean metric. Equation (12.7.19) im- 
plies g*8 (x) py Pp = 1. Thus, we can find a such that gob (x) pp # 0. Then, the 
tangent vector defined by x = 0 and pj = dja, lies in Tix, p)D and is transversal 
to T(x,p)@. This shows that D and @ are transversal, so that .% is an embedded 
submanifold of T* Q and hence of @. To prove that .Y is transversal in @ to the in- 
tegral curves of X 7, for dimensional reasons it suffices to show that X q is nowhere 
tangent to .“9. Thus, assume that X 7 (x, p) € Ta.p)D for some (x, p) € -%. Then, 
(12.7.15) and the first equation in (12.7.20) imply that py = 0 for all a, in contra- 
diction to g* (x) py Pp=l. 


Example 12.7.4 Let Q = IR? and let D be the half circle 
D= {xe R?: xf +23 = R?, x1 < 0}. 
Parameterizing D by y+ (Rceos(y), Rsin(y)) with a <y< Se and using the 


parameter y as a global coordinate on D, we find 


a : a 3m 
D= {(Reos(, Rsin(y), p, ptan(y)) : Cian A aa RI 


for the canonical lift and .%) = oy UA with 


4 : . a 32 
Sy = {(2 cos(y), Rsin(y), Ecos(y), + sin(y)) : > <y< =| 
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for the initial condition .“9. Hence, by (12.7.18), % decomposes into the connected 
components .”~ =.) x R and 


my,H= ((R + 2r) cos(y), (R + 2r) sin(y), £cos(y), sin(y)). 


We see that all the light rays run through the origin, whereas through the other 
points, there run exactly two light rays in opposite directions, one from ag and 
one from .7) . The equation for the singular subset ¥(.7) is 

d(T1, 2) 


det ————— = 2(R + 2t) = 0, 
d(y,t) 


where JT = (JT, 12): Y > Q, cf. (12.5.3). Therefore, 


US) = {(0.0, cost, sin(y)) T9:5 =e 5} 
r(S) ={,0) € Q}. 


Thus, the caustic degenerates to a single focal point. 


Example 12.7.5 Let Q =R* and D = {x € R?: (x2)? = x1}. Let us use y = x2 asa 
coordinate on D. Then, the canonical lift D is given by 


D={(v",», p,—2py):y, PER} 


and the initial condition .“ consists of the connected components 


n +1 2 
Fi ={(y.9. ; Ea ):ver}. 
V1+4y? J/1+4y? 


Consequently, 7 consists of the connected components .“~ = 7) x IR and u is 
given by 
ee, t 2ty =| #2y 
Ve G + we ; (7 5) 
Vl+4y? Jl +4y? J/1+4y? V1 +4y? 


It is easy to see that c is injective. Hence, (.%, 1) is in fact a geometric solution. 
The singular subset 3’(.%) and the caustic "(.%) will be discussed later in Exam- 
ple 12.7.8, where we solve the same initial value problem by means of a Morse 
family. 


Next, we find a Morse family generating the generalized solution given by 
(12.7.18). This provides an alternative method to solve (12.7.14). We still limit our 
attention to the case of a constant initial phase So = 0. In what follows, points on 
D will be denoted by &. We choose coordinates y!,..., y” on D and define local 
vector fields on D by 
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ax 


ay (12.7.22) 
ay 


Cy (x) = 
These vector fields form a local frame in TD, viewed as a subset of TQ = R2", 


Proposition 12.7.6 Let D be an embedded submanifold of Q = R" of dimension 
r <nand let B := {(x,x) € Q x D:x #X}. The distance function 


S:B>R, — S(x,)=|Ix—Sl, (12.7.23) 


defines a Morse family along the submersion B — Q induced from the natural pro- 
jection. This Morse family generates a generalized solution of the eikonal equation 
(12.7.14) for the initial condition So = 0 on D, given by 
i ‘ 4 x—X 
Bs ={(x,%) € B: (x—X) LT;D}, As(x,X) = (x kcal } (12.7.24) 
x—xX 


Proof To see that S defines a Morse family, we have to show that the matrix 


(S”,, S”.) has rank r on the fibre-critical submanifold By. For (x, X) € B, we calcu- 
xy? Pyy 
late 
dS xi — Si 
oP Xj a =: pi, (12.7.25) 
dx’ |x — x|| 
as 
© apes (12.7.26) 
a75 ‘ _— a 
| 7 (p Ca) Pi Cai (12.7.27) 
ax! dy% |x — x|| 
a25 e,-es — -e -e de 
_ &a eg — €a)(P p) deg (12.7.28) 
ay*dye IIx — x ae 


and read off that on Bs we have p- eg = 0 and hence 


a’ oe qi (X) 
Ax! dye ~? 


IIx — XI 


Since the vectors eg (X) are linearly independent, this matrix has rank r. Hence, S is a 
Morse family, indeed. Moreover, from (12.7.25) and (12.7.26) we read off (12.7.24). 
Since As takes values in @, (Bs, As) is a generalized solution of (12.7.14). 


Remark 12.7.7 


1. We determine the singular subset and the caustic of (Bs, As). By (12.6.14) and 
(12.7.28), the singular subset (Bs) consists of the points (x, x) € Bs fulfilling 


det(eo -eg(X) — (x—X)- i) =0. (12.7.29) 
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Hence, the caustic [\(Bs) consists of the points x € Q for which there exists 
x € D such that (x, X) belongs to Bs and fulfils (12.7.29). 

Equation (12.7.29) can be formulated intrinsically in terms of the metric hyg 
on D and the exterior curvature kyg of D, both induced by the Euclidean metric 
on Q. For the necessary notions from Riemannian geometry, we refer to the 
standard literature, e.g. [6]. Let I ae denote the Christoffel symbols of the Levi- 
Civita connection on D associated with hgg. Then, one has 


de 
€y es = hop, aye = Ra + Taper 
and hence (12.7.29) becomes 
det(hyg (&) — (k— &) - Kgg (&)) =0. (12.7.30) 


2. In the special case of an oriented surface D C Q = R?3, we have 
kop = kapMp, 
with np denoting the unit vector field orthogonal to D. Here, (12.7.30) yields 
det(||x — Xl kag (&) — hag (x) =0. (12.7.31) 


This is the characteristic equation for the principal radii of curvature of the initial 
surface D. Therefore, in this case the caustic coincides with the set of centres of 
curvature of D. Thus, no caustic occurs iff D is a plane. 

3. Obviously, S(x, x) = —||x — X|| is a generating family, too. It has the same do- 
main as the Morse family (12.7.23) and describes incoming rays. It is possible to 
describe incoming and outgoing rays by the help of a single Morse family: let 


B= {(x,%,a)eQx Dx 8S"! :x¢8} 


and define 
S:B—-R, S(x, X, a) = (k—X)-a. (12.7.32) 


We leave it to the reader to check that this is a Morse family, indeed (Exer- 
cise 12.7.7). Let u® and % be coordinates on D and S”~!, respectively. We 
calculate 


da 
aot 


as n - as i . 
ae ee, Fe a a elo 


Thus, Bs is defined by the relations a | TgD and (x — x) | T,S” | The latter 
one requires a to be parallel or antiparallel to x — x. Hence, 


Bs = { (x, X, a) €B:(x—x) LTgD and x — X= +||x — xlla}. 
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Fig. 12.4 Values of the x2 
discriminant A(x) 


A(x) <0 
(1 real solution) 
A(x) >0 


(3 real solutions) 


A(x) =0 


(2 real solutions) 


Since VS = a, the induced Lagrangian immersion is given by 


, x-—x 
As (x, X, a) = (x, a) = (« 17) ’ 
Ix — || 
where the sign is positive if x — X and a are parallel and negative if they are an- 
tiparallel. This reproduces the Lagrangian immersions of the generating families 
S(x, X) = +||x — X||, because the latter are obtained from (12.7.32) by restriction 
to 


x—X 


a= x=——.. 
Ix — x| 


Let us add that if D is given as the zero level set of a smooth function F : Q > R, 
the Morse family (12.7.32) is equivalent to the function 


S(x, X,a,A) = (x —X)-a+AF(X) (12.7.33) 
(Exercise 12.7.7). 


To illustrate the solution method provided by Proposition 12.7.6, we take up 
Example 12.7.5. 


Example 12.7.8 Let Q = R? and D = {x € R?: (x2)* = xj} and let us choose y = 
x2 as a coordinate on D. Then, 


Xy=(y7,y), eG) =(Qy, D. 


The Morse family of Proposition 12.7.6 is given by 


S(x, y) = |x — &() || = V(x - yy + (x2 — y)?, (12.7.34) 
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where we have identified B = {(x, y) € R?: x ¥ (y’, y)}. According to (12.7.24), 
the defining equation for the fibre-critical submanifold Bs reads 


: x : ere (12.7.35) 
i lw le wp 


and the induced Lagrangian immersion is given by 


As(x, y)= (x x2 iat a ) 
Ver — 2)? + 02 —y)? V/Or — y2)? + On — y? 


Equation (12.7.35) is solved by 


2 t 2yt 
a= ——— 2 p= 
V1+4y? V1+4y? 


with (y, t) € R* such that t 4 0. This yields a global parameterization of By: 


(12.7.36) 


:(y, He R*,t4 of. (12.7.37) 


7 {( ae t 2yt ) 
s= y »y »y 
Vv1+4y? V14+4y? 
In terms of this parameterization, As is given by 
As(y,t) 

( oo t 2yt sign(t) —2ysign(t) 
=\|y »y > ’ 
Vit+4y2 V1+4y? J1+4y? V1+4y? 


It is easy to see that Ag is injective, so that (Bs, Ags) is a Lagrangian submanifold 
and hence a geometric solution whose image is As5(Bs). Next, we discuss the sin- 
gular subset (Bs) and the caustic (Bs). According to (12.7.29), the defining 
equation for 2 (Bs) is 


) (12,738) 


; 2r 
(Ay? eh) 


J1+4y2 
In particular, on 2'(Bs) we have t > 0. Solving for ¢ and plugging this into 
(12.7.37), we obtain 


1 
2d (Bs) = {(5 +37. 9°.) E as:yeR}, 


rBs)=|(5+39% 4°) O:y R}. 


We read off that »’(Bs) is a submanifold of Bs of dimension one, whereas I"(Bs) 
is a subset containing a singular point, see Fig. 12.5. Finally, we count the number 
of intersection points of (Bs, As) with the fibre of T*Q over a point x € Q. This 
can be determined from (12.7.35) by viewing the left hand side as a polynomial in 
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y 


T'(Bs) 


x1 


D 


Fig. 12.5 The Lagrangian submanifold (Bs, As) of Example 12.7.8 for t > 0. Note that .%9 does 
not belong to (Bs, As) 


the variable y and counting the real zeros. Their number can be read off from the 
values of the discriminant 


i\> 37 
ax) =4(x - 5) —— x3, 


depicted in Fig. 12.4. For A(x) > 0 this polynomial has three real solutions and 
for A(x) < 0 it has one real solution and this solution has multiplicity one. For 
A(x) = 0, the polynomial has two real solutions, one of them with multiplicity two, 
or one real solution with multiplicity three. By expressing ["(Bs) in terms of x, 
we see that the zero set of A(x) coincides with the caustic and that the point x 
where the polynomial has one real solution with multiplicity three coincides with 
the singular point. Correspondingly, the fibre over x intersects A5(Bs) three times 
transversally for A(x) > 0, once transversally for A(x) > 0, once non-transversally 
over the singular point, and once transversally and once non-transversally over the 
remaining points of the caustic, see Fig. 12.5. It is clear that both for A(x) < 0 and 
for A(x) > 0 one can obtain analytic solutions from the geometric solution (Bs, As) 
by solving (12.7.35) for y. 


Remark 12.7.9 


1. The fact that, here, the singular subset 2'(Bs) is in fact a submanifold is no 
accident. By Proposition 12.6.3, the complement »(Bs) \ 2(Bs) consists of 
submanifolds of codimension greater than 2. Hence, for dimensional reasons, 
»'(Bs) = %\(Bs). From the point of view of catastrophe theory, the point 
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G; 0,0)em7! (G, 0)) is distinguished. The reader easily checks that this point 
is the only one where the third partial derivative S\,,, vanishes. This means that 
apart from this point one can find a normal form of fold type for S, cf. Proposi- 


tion 12.6.15. If one includes this point, the normal form is of the cusp type 
S(x, y)=y4 + x1y" +229, 


which in the Thom-Boardman classification scheme is labelled by 11,0, cf. 
Remark 12.6.16. 

. The parameter ¢ in the parameterization (12.7.37) of Bs is closely related to the 
flow parameter of the generalized solution 1: .% — T* Q obtained by the method 
of characteristics in Example 12.7.5. Comparison of the immersions (12.7.21) 
and (12.7.38) shows that for t > 0, we have As(y, t) =" (y, t), whereas for t < 
0, we have As(y,t) =v (y, —t). Hence, in the dynamical interpretation of the 
method of characteristics as a Hamiltonian flow with the parameter rf representing 
time, the solution given by (Bs, As) corresponds to integral curves emanating 
from Bae and from .7 , in both cases evolving for time rt > 0. In particular, 
the parameter f in (12.7.38) can be interpreted as the time in case t > 0 and as 
the negative of the time in case t < 0. This makes transparent what is meant by 
saying that the Morse family (12.7.23) models outgoing light rays. We leave it 
to the reader to carry out the analogous analysis for the Morse family S(x, x) = 
—||x—x||. As a result, the Lagrangian immersions of both families together make 
up the generalized solution (.”, 4) obtained by the method of characteristics. 


The simpler example 12.7.4 can be analyzed in the same spirit. We leave this 


as an exercise to the reader (Exercise 12.7.8). Instead, we now turn to the case 
where the initial phase Sp is arbitrary but the initial submanifold D has codimension 
one in Q. As before, we will first discuss the method of characteristics. Under the 
assumption of codimension one, we can make the ansatz that the initial condition 
-Yo is the image of D under a smooth mapping a: D — T*Q. It is evident that this 
mapping must satisfy the conditions 


1 
2 
3 
4 


. woa=idp, 
. (a(x), X) = (dSo, X) for all x € D, X € TxD, 
. Hoa=0, 


. @ is transversal to the integral curves of X7. 


Example 12.7.10 Let Q = R* and D = {x € R*: x; = 0} and let Sp: D> Rbea 
smooth function. We use y = x2 as acoordinate on D. To find the mapping a: D > 
T* Q, we make the ansatz 


a(y) = (0, y, g(y), So), 


which fulfils conditions | and 2 by construction. Condition 3 yields 


HOY +e0)/=1 
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for all y € R. Therefore, Sp must satisfy |S5(y)| < 1 for all y. Under this assumption, 


g(y) =+,/1— (Sp())?. Finally, to meet condition 4, we must require g(y) 4 0 and 


hence |S6(y)| < 1 for all y. Thus, .% consists of the connected components 


St = {(0. y, t¥1= (850), So(y)) |. (12.7.39) 


SF consists of the connected components .7* = ./)- x IR and ¢ is given by 


+2t,/1 — Si(y)?, y+ 284(y)t, #y/ 1 — Sh0)?, Si(9)). (12.7.40) 


The defining equation det tian = 0 for the singular subset &'(.7) is equivalent 


to 


yas 


MY, H)= 


1+ 2586 (y)t — Sp2(y) =0. (12.7.41) 
We discuss two special cases in detail. 


1. Let So(y) = ay with |a| < 1. Here, 


H(y, 1) = (42v1— at, y + at, V1 — a2, a). 


The signs correspond to light travelling in the positive or negative x-direction. 
The images :+(./*) are 2-dimensional hyperplanes in T* Q = R*. Since So = 9, 
Eq. (12.7.41) for the singular subset implies S? 0) = 1, which contradicts the 
transversality requirement So (y)| < 1. Hence, in this case there is no caustic. 

2. Let Sj(y) 40 for all y and restrict attention to Y +. In this case, the caustic is 
given by 


_ 92) ry o2 
r={( ( oy eo: Rt. (12.7.42) 
0 0 


We show that one can choose So so that the caustic degenerates to a single focal 
point on the x1-axis, 


r¢7)={(f,9)} 


for some f € R. According to (12.7.42), this leads to the differential equation 


/ 
So 


which up to an irrelevant additive constant has the solution 


So) =4y f? +y?. 


SIX 
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For the positive sign, f must be negative. In this case, Sg models an ideal concave 
lense. For the negative sign, f must be positive and So models an ideal convex 
lense. 


Next, we determine the eikonal function S. From (12.7.40) we read off that in 
terms of the coordinates y, r, the differential dS is given by 


dS(y,t) = pil(y, tdx1(y, t) + poy, t)dxa(y, t) 
= J! = Sy(vy-d(2t/1 = S40”) + Sh(y)d(y + 284(y)2) 


= 2dt + Si(y)dy. 


The initial condition S;p = So yields S(y, 0) = So(y) and hence 
S(y, t) = 2t + So(y). (12.7.43) 


From this, we can obtain S as a function on Q by solving the equations 


xp =42/1-(Sh(y))7t, mr =y +2840) 


for y and ¢ and plugging in the solutions into (12.7.43). For the two specific initial 
condition from above, this yields 


S(x) =axx+V1—a?x, 


for the linearly increasing initial phase So(y) = ay and 


S(x) = V+ f)? +x3, 


for the model of an ideal concave lense So(y) = / f2 + y?. In the first case, the 
wave fronts are planes and in the second example they are spheres centred at the 
focal point. 


To conclude this section, we show that in the situation where D has codimension 
one, the generalized solution can be generated by a Morse family in much the same 
way as in the case of a constant initial phase So = 0. As before, we denote the 
elements of D by x, choose coordinates y* on D and define the vectors ey (x) by 
(12.7.22). 


Proposition 12.7.11 Let D be an embedded submanifold of Q of codimension one 
and let So be a smooth function on D. Assume that 


80 850 pop 


1, 12.7.44 
dy” ayh ( ) 
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where hag = €q - eg is the induced metric on D.> Let B = {(x,&) € Ox D:x £X}. 
Then, 


S:B>R, — S(x,%) = |x KI + So), (12.7.45) 


is a Morse family along the submersion x : B — Q induced from the natural pro- 
jection. This Morse family generates a generalized solution of the eikonal equa- 
tion (12.7.14) for the initial condition So on D, given by 


a x—X y ChY x x—X 
Bs = (x, x) € B: ———  - eg (x) = —_ (X) |, As (x, X) = | x, ——— ]. 
Ix — x|| dy” Ix — x|| 
Proof We proceed as in the proof of Proposition 12.7.6. The relevant partial deriva- 
tives are 
aS xj — 5; as aSo aS (Pp: ea) Pi — ai 


= =: pi, = —p-€g+—, => = : 
Me ay Pte dy Ox! dy |x — x|| 


axi |Ix—&l 


This yields the asserted formula for Bs. Moreover, it follows that the left hand side 
of (12.7.44), taken at X, coincides with the absolute square of the projection of 
p(x, &) to Tg D. Since p(x, %)* = 1, this implies that on Bs, we have p(x, &) ¢ TzD. 
It follows that the vector fields e, — (p- e€g)p are pointwise linearly independent on 
Bs: assume 


Dhalea — (Pp: eq)p) = (= inte —p: (= has ) P= 0. 


Since )°, Awa is tangent to D but p is not, this equation is only fulfilled if all A. 


vanish. We conclude that the n x (nm — 1)-matrix De has rank n — 1 on Bs and, 


therefore, that Sis a Morse family, indeed. That As takes values in @ is obvious. 


Remark 12.7.12 The discussion of alternative Morse families in Remark 12.7.7/3 
catries over to the present case. In particular, the function 


S(x, X) = —||x — Xl + So®) 


is a generating Morse family, too, and to (12.7.32) and (12.7.33) there correspond 
the extensions 


S(x, X, a) = (x — X) cat So(X), S(x, X, a, A) = (x = x) ‘at MF (&) 7 So(X), 
respectively. 


We refer to the book of Benenti [38] for a lot of additional material. There, the 
reader can find a systematic treatment of optical systems including sources, mirrors 


23This is the abstract counterpart of the condition ISO) < | found in Example 12.7.10. 
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and lenses in the language of Morse families. For instance, for a system consisting 
of a source U and a mirror V, the generating family is 


S(x, u,v) = ||x— vl] + |lv—ull, 


with u € U and ve V. Benenti uses the language of symplectic relations, which 
is well adapted to this sort of problems. To every component of the system there 
corresponds a symplectic relation and the description of the system as a whole is 
obtained by taking the composition of these relations. To every relation, there cor- 
responds a generating Morse family and the Morse family of the full system is the 
sum of them. 


Exercises 

12.7.1 Work out Examples 12.7.4 and 12.7.5. 

12.7.2 Analyze the coorientation of the singular subset X'(Bs) in Example 12.7.8 
by studying the sign of Say along a curve transversal to 3'(Bs). 

12.7.3 Show that, in the situation of Proposition 12.7.6, the function 


S:0xD—>R, S(x, %) = (x —%), 


is a Morse family. Determine the induced Lagrangian immersion, its singular 
subset and the caustic. Is the induced Lagrangian immersion a generalized 
(geometric) solution of the eikonal equation? 

12.7.4 Show that the function 


W:Oxs"?!>R, W(x,a)=a-x 


is a Morse family generating a generalized solution of the eikonal equa- 
tion (12.7.14). What initial condition does this function describe? 
12.7.5 Let xo € Q. Show that the function 


S:OxS"!>R, (x,a)  (x—xo)-a 


is a Morse family generating a generalized solution of the eikonal equa- 
tion (12.7.14) for the point source D = {xo}. What is the difference between 
S and the Morse family (12.7.23) in this case? 

12.7.6 For the case Q = R? and D having codimension 1, verify explicitly that 
Eq. (12.7.29) for the singular subset X'(Bs) of the Morse family (12.7.23) is 
equivalent to det JT’ = 0 by showing that for (x, x) € Bs, 


det IT’ (x, &) = |x — &|| det(S%, (x, %). (12.7.46) 


Hint. Parameterize Bs by x € D and t € R \ {0} as follows: 


e1(X) x e2(X) 


lle1 (x) x e2(X)|| 
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12.7.7 Show that (12.7.32) is a Morse family and that this Morse family is equiva- 
lent to the one defined by (12.7.33). 

12.7.8 Work out the construction of a generalized solution of the eikonal equa- 
tion for Example 12.7.4 by the Morse family method of Proposition 12.7.6. 
Show that the analytic solution (apart from the caustic) is given by S(x) = 
I[xl] — RI. 
Hint. Parameterize Bs by y and the time parameter of the solution obtained 
by the method of characteristics. 


12.8 Geometric Asymptotics. Beyond Lowest Order 


In this section we continue the discussion of the short wave asymptotics of partial 
differential equations of the type 


a(x, -79)uoo =0 (12.8.1) 


on Q = R", defined by a Hamiltonian function H on T* Q which restricts to a poly- 
nomial on each fibre. In the previous section we have discussed the characteristic 
equation 


H(x, VS(x)) =0. (12.8.2) 
Here, we study the corresponding transport equation 
& i oe a°s 
- (x, VS(x)) —— 
dx! 2 apjap; Ox! OxJ 


(> (x, VS(x)) 


{) ane =0 ~~ (12.8.3) 
ODi 


and construct first order asymptotic solutions of (12.8.1) from solutions of (12.8.2) 
and (12.8.3). The leading example will be the Helmholtz equation (12.7.2), with 
its transport equation given by (12.7.7).74 Let .% be an admissible initial condi- 
tion for the characteristic equation, given by the canonical lift of a submanifold D 
of Q of codimension 1| with an initial phase Sp on D. Let a: D > T*Q be the 
corresponding lifting mapping. As explained in Sect. 12.7, by the method of char- 
acteristics, “9 generates a generalized solution 1: .~” — T*Q of the characteristic 
equation (12.8.2), where .Y coincides with the intersection of .%) x R with the do- 
main of the Hamiltonian vector field Xq and where : is induced by the flow ® 
of Xx, 


t:PCAXROTO, t(&, t) = H, (E). (12.8.4) 


For a moment, let us assume that we are away from the caustic of (.%, 1), and 
let us restrict .Y in such a way that it coincides with the image of dS for an an- 
alytic solution S. Then, IJ = 7g 01 is a diffeomorphism onto some open subset 


4For a quite exhaustive discussion of this equation in theoretical optics, we refer to the book of 
Romer [253]. 
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of Q and (12.8.3) can be rewritten as follows. Since X y is tangent to (.%,1), by 
Proposition 2.7.16, it restricts to a vector field X q on .”. Define 


Yy:=11,Xn (12.8.5) 
and let @ denote the flow of Y77. Obviously, 


dH 
Yu(x) = ap, VS(x)) Oi. (12.8.6) 


By (12.8.6), the first term in the transport equation (12.8.3) can be written in terms 
of Yq as 


0H dao 
(x, VS(x)) = (YxHa0)(x) = Vao(x) - Ya (x). 
Opi ox! 
By computing (V - Y;)(x) we find that the second term in (12.8.3) has the form 
a (x, VS(x cag = (x) = (V- Yx)(x) — pu <a (x, VS(K)). (12.8.7) 
x)= x x, x 8. 
dpi Op; axidx/ = x ODi 


As a result, the transport equation (12.8.3) can be rewritten as 


1 1 a°H 

(V0) (&) - Yn (x) + 540) (V - YH) x) = 704) ia aD -(x, VS(x)). (12.8.8) 

Next, we are going to interpret this equation geometrically in terms of half den- 
sities on .Y. This will allow us to solve the transport equation globally. In the same 
spirit as for the characteristic equation, such a global solution will be referred to as a 
generalized solution of the transport equation. For simplicity, we limit our attention 
to the case where 

oi a (12.8.9) 
ax! dpi ~ 

This includes the Helmholtz equation and, more generally, the case where the op- 
erator H in (12.8.1) is Hermitian. Under this assumption, the transport equation is 
equivalent to 


V - (aoYu) =0. (12.8.10) 


Let v, = dx! A--- A dx” be the canonical volume form on QO =R". Then, I7*vy, 
defines a natural volume form on the submanifold o(.%), cf. Formula (12.6.1). By 
(4.1.24), for every vector field X and every smooth function f on Q, one has 


LxVn =(V-X)vn, df A (Xavi) = X (fn. (12.8.11) 
Using these two relations and the identity (4.1.27), we find 


(V (03 ¥4))vn = Lyay, Yn = Lyn (abn): 
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and by Proposition 3.3.3/5, we obtain 
Lyx (aVn) = IT") (Lx,,1T* (aovn))- 
Thus, the transport equation (12.8.10) is equivalent to 
Ly IT (avn) = 0. (12.8.12) 


This is an equation for ae formulated in terms of the n-form [7* (a5Vn) on Y. To 
obtain an equation for ao, we have to pass to half-densities on .”. For an exhaustive 
discussion of this concept we refer to the literature, e.g. [119].7> 


Lemma 12.8.1 We have 
* i * 1 x i 
Lyf (ao|Vn|2) = 11*{ Vao- Yu + 5a0V Yq )T*\vpl2. (12.8.13) 


Proof Using Proposition 3.3.3/5 and the first relation in (12.8.11), we find 
Li, (Ng) = (Caw) SIV WY Ye ns 


This formula carries over to the density |v,|. On the other hand, by the derivation 
property of the Lie derivative, 


1 1 
Ly, UT" Wal) = 217" Wal? Ly, UT" lval2) 
and thus 
1 1 1 
Ly (IT Wvnl?) = 5 Yn)" Wval? 


Now, the assertion follows by applying once again the derivation property of the Lie 
derivative. 


As a consequence of the lemma, the transport equation can be written in the form 


Ly, 1T* (aolvnl?) =0. (12.8.14) 


>In brief, for s € R, an s-density on a real vector space W of dimension n is a mapping v : 
W” — R satisfying v(Aw1,..., AWn) =|det A/S v(wy,..., wn) for all endomorphisms A of W. 
The s-densities on W form a vector space of dimension 1. By taking the s-densities on the tangent 
spaces at every point of a manifold M one obtains the real line bundle | A|*M of pointwise s- 
densities on M. Sections in this bundle are called s-densities on M. Every n-form v on M defines 
an s-density |v|* by |v[°(Xq,..., Xn) = |v(X1,..., Xn)|*. In particular, if M is orientable, | A|*M 
is trivial for every s. The product of an s|-density vy and an s2-density v2 on M is defined by 
(V1 V2)(X1,..-, Xn) = V(X, ..., Xn) v2(X1,..., Xp). It yields an (s; +.52)-density. The calculus 
of differential forms, notably the pull-back and, based on that, the Lie derivative, extends in an 
obvious way to s-densities. 
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In this form, it generalizes to arbitrary half-densities Gp on .7: 
Ly, a0 = 0. (12.8.15) 


The solutions of this equation will be referred to as generalized solutions of the 
transport equation (12.8.3). They can be obtained by means of the method of char- 
acteristics as follows. Let an” be a given initial condition for ap on D. In terms of 
the representation of (.%, 1) by (12.8.4), the solution of (12.8.15) generated by a, 
is given by 


(Eo, t) = al (IT &o)) (®*, 0 IT* (Ivnl2)) Go. 8), (12.8.16) 


where & € -% and @ denotes the flow of Xy. By restricting a generalized solution 
ag to an open subset of .Y which under JT is mapped diffeomorphically to an open 
subset of Q, one can construct an analytic solution ag of (12.8.3) on the latter via 
the relation 


ao = 1T*(aolvnl?). (12.8.17) 
By 3.2.13/2, the flows ® of Xu and @ of Yy are related by 


To @,=¢, of. (12.8.18) 


Using this, from (12.8.16) we obtain 


dig (ox(xo), t) = ay” (xo) (|det(,), I)? (IT*|vn|2) (or(X0), t), (12.8.19) 


where xg € D anda: D > .%) C T* Q denotes the mapping which yields the canon- 
ical lift of the pair (D, So). The partial derivatives in ($1) xo are taken with respect 
to the standard coordinates on Q = R” (Exercise 12.8.2). Thus, denoting 


F (t,x) = det(pr), 


we read off that the analytic solution induced by do is 


(0) 
poe (12.8.20) 


VIF, xo)l 


where t € R and xo € D are determined by x through the relation x = ¢; (xo). Note 
that this solution is limited to a region around D where t and xo are uniquely deter- 
mined by this relation. From (12.8.20) we obtain a short wave asymptotic solution 
of (12.8.1) up to first order: 


(0) 
ay (Xo) elk 5) 


(12.8.21) 
| ¥ (t, Xo)| 


u(x) = 
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Remark 12.8.2 


1. In the general case, the local solution of Eq. (12.8.8) is given by 


ago(x) = 1) 1% % )exp 1 fa oH (ous ) VS(¢(xo))) 
| J (t, xo)| : ° 2 0 ax! Ap; t/\A0), t ‘ 


see Exercise 12.8.1. 
2. We study the behaviour of the local solution (12.8.20) near the caustic. By 
(12.8.18) and IT ow =idp, we have 


(Ney = FT, ava) © (Pt eaten) © Ly (12.8.22) 


The right hand side makes sense for all ¢ such that (t, w(xo)) is in the domain 
of @ and we may use it to extend the mappings t > ($1) xo and tt> ¥ (t, xo) 
accordingly. For the extension, it may happen that ¢; (xo) belongs to the caustic 
I'(SA). Since, then, ®,(a@(xg)) belongs to the singular subset ¥'(.%), the map- 
pings IT ®, (a(xo)) 2nd hence (D1) x, are not bijective, so that Y (t, xo) = 0. Thus, if 
¢(X0) approaches the caustic with t running and xo fixed, -Y (t, xo) necessarily 
tends to zero and hence the solution (12.8.20) diverges. 
3. In local coordinates & ? on SY, we have 


TT*\v,|2 = [|det I7’||d&! A --- A dé” 


3 (12.8.23) 


By (12.8.22), for every xo € D, 


1 


ai 1 4 1 
| F(t, x0)|? \det(I75, ccna) |? = [det((Pr orgy) |? [det(Tacxay) [?- 


Thus, by (12.8.23), the solution (12.8.17) takes the form 


IT* (aolvnl?) &) 


1 1 
= al (xo) |det((®1)),¢q,))| 2 [det(TTeqyy)| 21d! A = Ade" |2, (12.8.24) 
where & = ®,;(a(xo)). Since det((P;),)) and det(I7)x5)) are regular, the right 
hand side of this formula makes sense for all t. Therefore, for given initial data, 
the generalized solution Gp can also be constructed explicitly as a continuation 
of an analytic solution ag through the caustic as follows. Choose a covering of 
Y by local coordinates &' and a subordinate partition of unity and use (12.8.24) 


to successively propagate I7*(ao|Vn| 2) to all of 7. 


Now, given the data (.7, 1) and dg, we will construct a global solution on Q of 
the differential equation (12.8.1) up to first order in i For that purpose, let {U;} 


be a covering of .Y by contractible subsets such that all intersections U; ™ U; are 
contractible and let {(B;, 7;, S;)} be a system of generating Morse families such 
that the immersions As, : Bs, > T*Q and t;y, : Ui > T*Q are equivalent. We may 
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assume that these families are reduced and that the B; are open subsets of Q x R” 
with 2; being induced from the natural projection. In what follows, for simplicity, 
we identify Bs, with U;. Now, let xo € Q be an arbitrary regular point. Then, for 
every & € IT~! (xo), we have a solution of (12.8.1) up to first order in k in some 


neighbourhood W of xo, 

u(x) = ag(xe"®™, 
where ao is determined by (12.8.17) and S is an analytic solution of the character- 
istic equation, locally generating .”. On W, the global solution we are looking for 
should be given by a sum of contributions of this type. However, if we want to put to- 
gether such terms we face the problem that these contributions are only determined 
up to a constant phase.”° To fix the relative phases we proceed as follows. 


(a) To fix the relative phases of contributions coming from a given element of the 
covering, we will use the method of stationary phase. 

(b) To combine the contributions of different elements U; and U; we must require 
that these contributions coincide if they come from the intersection U; N Uj. 
This leads to an additional topological condition on (.7%, 1) and k, known as the 
Bohr-Sommerfeld quantization condition. 


To accomplish step (a), let us choose an element U of the above covering and let 
(B, 2, S) be the corresponding Morse family. First, we show that, given the canon- 
ical’ volume forms v, and v, on R” and Q = R" respectively, S induces a natural 
volume form vz, on the fibre-critical submanifold Bs. Let p : B > R’ be the re- 
striction of the canonical projection. 


Lemma 12.8.3 There exists a unique volume form vg, on Bs such that 
"Vn A p*Vp = VBs A (S5) "Vr (12.8.25) 
on Bs C B. This volume form satisfies 
det(S’,) vB, = IT* vy. (12.8.26) 


Proof Denote F := S\,: B > R’. Let (x,y) € Bs. Choose a basis {X1,..., Xn} in 
T (x,y) Bs and vectors Y),..., ¥, complementing this basis to a basis in Tx y) B. On 
the one hand, we have Bs = F~!(0) and hence ker F i = Tix, y) Bs. On the other 
hand, since S is a Morse family, Bre = 5 (x,y), Sy (x, y)) must have rank r. It 


follows that F*v,(Y1,..., ¥,) #0, so that we can define 


T* Vy A p* X15 any Apel TeencgeN 
tC ce ee nS PVr)(X1 nV ay 
F*v,(Y1,...5 Yr) 


6Because the analytic solutions are determined up to an additive constant. 


7 With respect to the Euclidean metric. 
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This defines an n-form on Bs, indeed, because a change in the choice of the Y’s 
would produce the same determinant factor in the numerator and the denominator. 
Since the right hand side is nonzero, vg, is a volume form. It satisfies (12.8.25), 
because Fi. y>i = 0 for all j = 1,...,”. Finally, the relation (12.8.26) follows 
from F*v, = det(S},,)* vr. 


By means of vg,, to Go we can assign a function a: Bs = U > R, defined by 


1 


alvp,|2 = do. (12.8.27) 


We extend a smoothly to a function on the whole of B with compact support in 
the y-variables and, instead of the ansatz (12.7.3), we now consider the oscillatory 


integral 

k\if. 

(=) / elk SY a(x, y)d’y. (12.8.28) 
JU 


We assume that a admits an asymptotic expansion of the form (12.7.5). Then, the 
integral (12.8.28) is absolutely convergent and depends smoothly on x and k. For 
the analysis of the large k behaviour of integrals of this type one uses the method 
of stationary phase. For increasing k the function y + e!“5@Y) oscillates more and 
more quickly. Thus, only contributions from neighbourhoods of stationary points 
of S should count, whereas the other contributions should give zero in the average. 
This intuition is correct, as the following classical result shows. 


Theorem 12.8.4 (Stationary Phase Method) Let f and g be smooth functions 
on R’. Assume that f has compact support and that @ has a finite number of sta- 
tionary points y,...,Yp in the support of f , all of which are non-degenerate. Then, 


f d’ yf (yyelke™ 


fyael?O") 4. O(k-271), (12.8.29) 


7 signg” (ya) 
-(F =) a 


Proof Denote I (k) := { d’ yf (ye?) . The proof is in three steps. 
1. First, we show that if g does not have stationary points in the support of f, 
then 


lim kN 1(k) =0 (12.8.30) 
k->oo 
for all N > 0. Indeed, in this case, the differential operator 


oe ee 
~ ik |Voll2 2” ays 
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is defined everywhere. Since Dei“? = e'*, for any positive integer N we have 
I(k) = / d yf (y)DN* tee) — i; d’y(DIN+Y F) ye = O(k-N-1), 


where we have used that the Hermitian conjugate D* of D is given by 


1 0;9 
D‘ f =——a; d ; 
eae: (Gao!) 


2. Next, we show that for every smooth function g: R — R with compact support 
and every 4 € C with Re(A) > 0, we have 


i dtg (te =| + o(a7')), (12.8.31) 


where the root is chosen so that Re(V/A) > 0. Indeed, the Gauss integral formula 


yields 
/ © argine"P* =| 90) + / * ale) — ge?” 


for every real A > 0. The terms on the right hand side can be separately continued 
analytically to Re(A) > 0 and by continuity to Re(A) > 0. Then, the square root in 
the first term satisfies Re(A) > 0. Rewriting the second term as 


ie arg) — ge“? * a a(S 7 sO) ae 


- —0o 


we obtain the assertion. 

3. By point 1, we can write the integral J (k) as a sum of integrals over sufficiently 
small domains U4, each containing exactly one stationary point y4 of g. The con- 
tributions omitted this way vanish faster than any power of i Since y,4 is assumed 
to be non-degenerate, y’’(y,) is invertible. By the Morse Lemma 8.9.4, there exist 
local coordinates v' on U, such that v! (ya) = 0 and 


1 2 2 2 2 
e(y)) = 9A) + 5QW), AW) SVP UF Fv he te, 
where ig is the Morse index of g at y4. Thus, the contribution from U4 is given by 
tusk =e [arf sn| foe? 
Ua 


with J denoting the determinant of the Jacobi matrix of the coordinate transforma- 
tion. Since v = 0 is the only stationary point of Q, by point 1, we may extend this 
integral to R’ by choosing an arbitrary extension g of the integrand |J(v)| f (y(v)) 
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to R’. Since f has compact support, g may be chosen to have compact support, too. 
Thus, Jy, (k) reads 


i ik y2 ikv?2 ~iky? ik y2 
cites) f auyeltt + f duce "0 J dujypie 7 ot... | dupe "2" g(y). 


According to (12.8.31), the integral over uv, yields 


2 
y azar, + %-1,0)(1+0(K})). 


Iterating this argument, we obtain a factor ,/ =; =r = / 27 eit for each integration 


over v; with / > ig and a factor ,/ ay =_/ aT eT iF for each integration over v; with 
l <ig. Thus, we end up with 


On\2 On . 
Tu, (k) = (=) >ielf signg Y)| 7(0)| F(yayeXeO x O(k-27). 


A=1 


It remains to determine |J(0)|. For that purpose, we calculate 


1 37Q ay 
saad © = gma) 
ao ay* ap 
= Sylayk (y(v yo ts STOW 0) eet v) 


and take the absolute value of the determinant at v = 0. Since 


iQ. 
2av™av! 


6m; 
this yields 


|J(0)| = |dety"(ya)|? 


and thus completes the proof of the theorem. 


Now, we apply Theorem 12.8.4 to the integral (12.8.28). Every regular point of 
IT(U) possesses an open neighbourhood W such that JT restricts to a diffeomor- 
phism [7,4 : W4 — W on each connected component Wy, of TI7!(W) NU. For 
every x € W, the stationary points of the mapping y+» S(x, y) coincide with the 
y-variables of the points it, (x). Denote 


a= sign(S¥,,). 
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Corollary 12.8.5 For every regular point x € II(U) C Q, the integral (12.8.28) 
admits the asymptotic expansion u(x) + O(k7'), where 


i= >. ame Arey (12.8.32) 
A 


with 

a(IT,'(x)) 
/ldet 5%, (74 106))| 
Sa(x) = S(1,'(), 


o4(x) = o (IT, '(x)). 


aga (x) = 


The functions ag, are analytic solutions of the transport equation and the function 
u is an asymptotic solution of (12.8.1) to first order. 


Proof We only have to check that every ao, is an analytic solution of the transport 
equation. By (12.8.26) and (12.8.27), on Wa, 


dio = (alvag|?) = —S——17* (val?) = 1*(aoalval?). 
| det S”, | 


Since Go is a generalized solution of the transport equation, the assertion follows. 


Remark 12.8.6 


1. Since o jumps at X'(Bs) by a multiple of 2, the relative phases between the sum- 
mands in (12.8.32) are multiples of os In particular, if 4 and €,4 can be joined 
by acurve in » which has a single crossing with (Bg) and if this crossing is 
simple and transversal, then we know from the proof of Proposition 12.6.11 that 
o jumps by +2 at this crossing. Hence, the phase shift between the contributions 
of &4 and 4 is +7. 

2. That ag, is a solution of the transport equation can also be confirmed by a direct 
computation, see Exercise 12.8.3. 


With Corollary 12.8.5 we have accomplished step (a) of our programme for fixing 
the relative phases, outlined on page 714: the constant phase factors era yield the 
desired relative phases between contributions to the asymptotic solution of (12.8.1) 
stemming from one and the same element of the chosen covering of (.%, 1). 

To accomplish step (b) we must analyze the condition that the contributions of 
different elements U; and U; coincide if they come from the intersection U; Uj. 
For that purpose, we first note that the solution uv given by (12.8.32) is obtained from 
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the half density 
a 


| det S”, | 


i * 1 ‘ ~ 
et elk 17* (|v, 7) = elkS+ 70) G0 


on UN Xo(-%), where by an abuse of notation the restriction of the Morse family 
S to U is denoted by the same symbol.”® Thus, from the covering {U;} and the cor- 
responding system {(B;, 7;, S;)} of generating Morse families, we obtain a system 
{ij} of local half densities on Xo(.%) given by 


ij = SIFT) Gi, (12.8.33) 


The local half densities i; and uj coincide on Uj; NU; Xo() and hence combine 
to a half density # on Xo(.7) iff their phases coincide modulo 27 at every regular 
point of U;N U;: 


kS; + 79 =kS;+ 79) mod 2zr. (12.8.34) 


This is the consistency condition announced in step (b) above. It is known as the 
Bohr-Sommerfeld quantization condition. A topological interpretation of this con- 
dition will be given below. The half-density “ may be referred to as a generalized 
first order solution of (12.8.1). By means of a partition of unity {x ;} subordinate to 
the covering {Uj}, it is explicitly given by i = )/; x;#j, and projection to Q yields 


aj(&) ikS; Ig, 
u(x) = >>. >. Pa @)—a)_. (kS} (E)+ Fj ()) | (12.8.35) 
j €el1—!(x)NU; | det(S;)%,,(&)| 


This is the desired global first order solution u : Q \ (7%) > R of (12.8.1) asso- 
ciated with the generalized solutions (.7, 1) and do of the characteristic equation 
and the transport equation, respectively. By the Bohr-Sommerfeld quantization con- 
dition, u does not depend on the choice of the partition of unity. It is, therefore, 
uniquely determined by (.7%, 1) and do, indeed. 

The solution (12.8.35) can be rewritten as a so-called oscillatory half density on 
Q \ I'(.7) as follows. For every point of Q \ '(.%), there exists an open neigh- 
bourhood W such that JT restricts to a diffeomorphism [74 : W4 — W on each 
connected component W, of IT —!(W). Then, on W, we have 


ulval? =K* a := S\(1q!)*a. 
A 


The mapping K * is known as Maslov’s canonical operator. Let us derive an ex- 
plicit formula for K *” in terms of the canonical 1-form 6 on T*Q and the Maslov 
intersection index of curves in .~. For a detailed presentation we refer to [198]. 


28For convenience, in the remainder, we stick to this simplified notation. 
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Proposition 12.8.7 Let x € Q \ '(.%). Let & € Y be a regular point, let ya be 
curves”? from &o to ii (x) and let S(&) be an arbitrarily chosen phase. Then, 


Pn : T. I 
(K7a)(x) = Lewfi(k i; 6 — Find y(ya) + Kt) (aoalvnl?) (x), 
A YA 

where ind y(ya) is the intersection index of ya with (7) and aoalVal2 = 
(IT, ')*40. 
Proof It suffices to verify the asserted formula for every A separately. Thus, let 
y : [0,1] > be a curve from & to one of the 4. Choose j such that 4 € Uj. 
Then, 

—1\*A = A i(kS; Lg, L 

((114')*) &) = ((117')* Hj) &®) = 8G At FFE) (ag alvn|2) (x). (128.36) 


Choosing 0 = f < ti <--: < t < t41; = 1 and jo,...,j; = j such that 
v(t. ti41]) C U;, for alli =0,...,/, we can rewrite S;(€4) and 0; (&,4) as 


1 
séa=> | 
i=1°Y 


i 
dsj, + Si (y (t))) ~ Sint (v())) + Sj (Eo), 
i=l 


(ti.tigi) 


oj (Ea) = )0 (04 (v (41) — o7 (7H) 


i=0 
+ D064 (7 Gi) — 04-4 (¥@))) + a G0). 
i=l 


On the one hand, by (12.4.10), under the identification of U;, with Bs. , we have 
d$j, = “6 (12.8.37) 


for all i. Hence, the first term in the equation for Sj (€,4) yields 


l i 
> as,=)> f vo= | ro= f o= 6. 
i=, ° Viti) j=) YY tii) Y loy y 


On the other hand, by Proposition 12.6.11 and (12.6.16), the first term in the equa- 
tion for 0; (4) yields 


r 


Yo (oi (v G40) — 0, (yG))) = —2indy (7). 


i=0 


?°Which can always be chosen to intersect the singular subset 5(./) transversally. 


12.8 Geometric Asymptotics. Beyond Lowest Order 721 


The asserted formula now follows by inserting all of this into (12.8.36) and taking 
into account the Bohr-Sommerfeld quantization condition (12.8.34). 


Remark 12.8.8 


1. We give a topological interpretation of the Bohr-Sommerfeld condition. For that 
purpose, we observe that the functions s;; := S$; — S; on U; 1 Uj are constant, 
because (12.8.37) implies that on U; NU; we have 


dS;=1"9 =dS;. (12.8.38) 
Moreover, from Lemma 12.6.10 we know that the integer-valued functions 
cij = index((S;),,) — index((S;)‘,) 


on U; M Uj; are constant, too, and hence smooth. As noted in Sect. 12.6, the 
family {c;;} defines an element /1.y of the first integer-valued Cech cohomology 
H} (SY, Z) of %. The image of this element under the canonical homomorphism 


h: Hi(.SY,Z) > H'(.F,R) 


to the first de Rham cohomology H!(.Y, R) coincides with the Maslov class wy 
of (%, 4). Analogously, the family {s;;} defines a 1-cocycle and thus an element 
of the first real-valued Cech cohomology of .”. The image ay of this class 
under / is usually called the Liouville class of (.7, 4). By (12.8.38), it satisfies 
ay = [t*0). By Formula (12.6.16) we have 


1 
= 5 (ri rj) — (a —9;)), (12.8.39) 
where 7; and r; are the fibre dimensions of B; and B;, respectively. Thus, the 
Bohr-Sommerfeld condition takes the form 
k 1 
4 


1 
ci + gi rj) H!(.S,Z). 


Since the third term vanishes under h, the Bohr-Sommerfeld quantization condi- 
tion (12.8.34) can be rewritten in terms of the Maslov and the Liouville classes 
as 


ay = whe Eh(Hi(%,Z)). (12.8.40) 
The cohomology class 
k 1 
PS k= Ge ae 


is called the phase class associated with (.%, 1) and k. 
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2. Let us analyze which geometric object the local half-densities # ; combine to in 
the general case. According to Theorem 2.2.11, the families of transition map- 
pings {eli}, {e727} and {eisii— 7°} define complex line bundles over .7”. 
The first one will be denoted by &y, 4° The second one is the Maslov line bun- 
dle “@y constructed in Sect. 12.6, cf. Remark 12.6.14. The third one is the tensor 


product 
Dep=Ev,® Me, 


called the phase bundle associated with (.”, 1) and k. Note that the bundle &y 
is trivial (like .#), because the family {e'“5'} defines a global non-vanishing 
section in this bundle, cf. Example 2.3.3/2. The family {elt % } defines a section in 
the complex line bundle generated by the family of transition mappings {el 4% }. 


Due to (12.8.39), 


a Pig ris eg 
15 Cij 157; .1Z9%ij 1Zri 
e 2°) =e 4’ Je4-Ve Bes 


so that this bundle is naturally isomorphic to the Maslov line bundle ./y, cf. 
Remark 2.2.12/1. Via this natural isomorphism, the family {e!T%} defines a sec- 
tion in.@y. We conclude that, in the general case, the local half-densities aj on 


2o(/) combine to a section in the restriction of the line bundle ®.y , ® | A| 2S 
to Lo(S). 

By means of the phase bundle ®.¥ ;, the Bohr-Sommerfeld quantization con- 
dition can also be interpreted geometrically as follows. Recall from Sect. 12.6 
that the Maslov line bundle is associated with a principal Z4-bundle over .7. 
Since the latter has discrete structure group, it carries a unique connection, given 
by the unique lift of curves in .Y to curves in this bundle. This connection induces 
a natural connection in the Maslov line bundle .@y and hence in the phase bun- 
dle By ,. Let Ipar(@.y,,) be the space of the sections of ® y , which are parallel 
with respect to this connection. One can show that parallel sections exist iff the 
phase class is integer-valued, that is, iff the Bohr-Sommerfeld quantization con- 
dition holds. Moreover, if parallel sections exist, they are unique up to a constant 
factor and hence given by a constant multiple of the family {el RSi+ 40i dy, 

Let us add that it is common to pass to a k-independent universal object by 
building the C-module 


Vgi= r(\Al2.) ®c (I Fal ® 74))) 
k>0 


called the symbol space of (.%, 1). Obviously, if the Bohr-Sommerfeld quan- 
tization condition holds, the half-density “ is a symbol (an element of YW). 
Thus, the above construction assigns a symbol, i, to every generalized solu- 
tion (.Y, 1) of the characteristic equation, every generalized solution dg of the 


302, , may be identified with the pull-back under ¢ of the so-called prequantum line bundle over 
(T* Q, d@), see Sect. 4.1 and Appendix D of [36] for a detailed description. 
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transport equation and every k > 0 such that the Bohr-Sommerfeld quantization 
condition (12.8.34) is satisfied. 


Example 12.8.9 (Schrédinger equation) We consider the time-independent Schro- 
dinger equation of a particle with unit mass in one space dimension under the influ- 
ence of an external potential V, 


h2 
(- mae = E))y= 0. (12.8.41) 


This equation is of the form (12.8.1), where the symbol H is given by the classical 
Hamiltonian function of the system, 


p 
p= Ye, 


and i is given by Planck’s constant A. Thus, the short wave asymptotics of the 
solutions y corresponds to the semiclassical, or WKB,*! approximation h — 0. 
Let @g be a regular compact connected component of the energy surface H~!(0). 
Then, @g is a smooth closed curve in T*R & R?. For dimensional reasons, @¢ 
is a Lagrangian submanifold. Assume that V is such that @¢ has two intersection 
points x, and x_ with the x-axis. Then, E = V(x) and the singular subset 2’ (@z) 
consists of x; and x_. From Example 12.6.8/1 we know that the Maslov index wg, 
of the fundamental cycle of @g is equal to 2. Consequently, by integrating the phase 
class y. 1 over @z, we find that the Bohr-Sommerfeld condition (12.8.40) reads 


1 1 
— 6—-eZ, 
21h ISG, 2 


Obviously, 1. 6 = A(E), with A(£) being the area enclosed by the curve @g. As 


a result, the Bohr-Sommerfeld quantization condition for the Schrédinger equation 
in one dimension takes the form 


1 
A(E)=2nh(n +5), n=0,1,.... (12.8.42) 


For example, for the harmonic oscillator with potential V (x) = wy? , the curve @r 


LE | 


is an ellipse with semiaxes Pied in momentum direction and *— in position di- 
rection. Hence, A(E) = In Z =, So that the Bohr-Sommerfeld quaneation condition 


becomes 
E=h(n+ . 
=hin 5 J: 


31Named after Wentzel, Brillouin and Kramers. 


724 12. Hamilton-Jacobi Theory 


This is the exact formula for the quantum energy levels. The quantization condition 
(12.8.42) generalizes in an obvious way to an arbitrary integrable system. 

Next, we solve the Schrédinger equation (12.8.41) in the WKB approximation, 
that is, we determine the short wave asymptotic solutions of this equation. The char- 
acteristic equation (12.8.2) reads 


1/dS\? 
alae) +¥=s (12.8.43) 
xX 


and the transport equation in the form (12.8.8) is given by 


d d 

JE — V)=-4% 4 av AE —V)=0. (12.8.44) 
x x 

We restrict our attention to the interior of the classically allowed region, where EF > 


V. Here, the solutions of (12.8.43) and (12.8.44) are given by 


S(x) = +f pe — V(x))dx, ag(x) = c(2(E — V(x))) #, 


respectively. To determine the corresponding asymptotic solution of the Schrédinger 
equation (12.8.41), we apply Proposition 12.8.7. Let us choose & in the region of 
@g where p > 0 and let us put S(&) = 0. For every x in the interior of the classically 
allowed region, J7~!(x) consists of two points, &, with p > 0 and &_ with p <0. 
Connecting & with &_ by a clockwise oriented curve, we obtain 


Al 


VR) = ee [eh VOM 4 oh STEVIE 4 (8, 
—_ Xx 


because such a curve has Maslov index —1. This formula yields the WKB approx- 
imation for the solutions of the Schrddinger equation (12.8.41) in the classically 
allowed region. Up to a constant phase factor, it can be rewritten as a real-valued 


function: 
1 1 A 
¥@)~>5 EXVO) cos(¢ f 26 — V(x))dx — *). 


We encourage the reader to compare the derivation given here with the usual deriva- 
tion of this formula in the standard text books of quantum mechanics, see e.g. [101]. 


For an exhaustive discussion of the n-dimensional Schrédinger equation we refer 
to [198] and for the study of general spectrum conditions in the semiclassical ap- 
proximation we recommend [78]. The above discussed structures also yield a gen- 
eral geometric framework for WKB quantization. Regarding this aspect, we make 
the following remark. For details, the reader may consult [36]. 


Remark 12.8.10 As mentioned above, the line bundle &y , can be identified with 
the pull-back to the Lagrangian immersion (.%, 1) of the prequantum line bundle 
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E7*9,k over T* QO and the phase bundle can be identified with 
Dy =U EO KO MS. 


One says that a Lagrangian immersion (.%, 4) of T*Q is quantizable if it satisfies 
the Bohr-Sommerfeld quantization condition with k = i By Remark 12.8.8/1, this 
is equivalent to the requirements that the phase class g.~ ; be integer-valued, or that 
the phase bundle admit a global parallel section #. The pair consisting of (7, t) 
and u is called a semiclassical state and Maslov’s canonical operator is called a 
semiclassical quantization mapping. 


The method of stationary phase is restricted to points apart from the caustic, 
because Corollary 12.8.5 applies to such points only. Over focal points, every gen- 
erating Morse family necessarily has degenerate critical points, so that this method 
cannot be applied. To deal with this situation in a systematic way, one has to use 
distribution-valued half-densities, see [115], [141] and [36]. However, if we have a 
normal form for the generating families in the neighbourhood of the caustic at our 
disposal, we can nevertheless get insight into the behaviour of asymptotic solutions 
near the caustic. We discuss this for the simplest type, the fold. For an exhaustive 
treatment we refer again to [115]. From Proposition 12.6.15 we read off that the 
oscillatory integral (12.8.28) has the form 


[o.@) 
| elk FO+EWOY-IW ag (x, y)dy, (12.8.45) 
—0o 


where x € Q = R”. The Malgrange Preparation Theorem entails that a can be 
written in the form 


os 
a(x, y) = bo(x) + bi(x)y + h(x, Way & y) 


= bo(x) + bi(x)y + A(x, y)(g(x) — y’), 


where bo, bj and h are smooth functions. Inserting this into (12.8.45) and perform- 
ing partial integration, we obtain 


ikf (x) °° ik(@@y—4y3) °° ik(g@y—4y3) 
Ce” ag (X, k) e 37 dy + a1 (x, k) e 3- ydy . 
—0oo —0o 


(12.8.46) 
with ag and a; denoting certain asymptotic series. Using the Airy functions 


MON 1.3 
ace) = [ y-3 dy, 1teER, 
—cCO 


32See e.g. [110, §IV.2]. 
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we can rewrite this formula as 
: ‘5 k ’ k = 
ee { OO a(&3 €00) ce a aac). (12.8.47) 
k3 ik3 


Thus, we obtain the following asymptotic expansion of the oscillatory integral 
(12.8.45): 


eikf &) “a(S 200) +4 “2 4'(8 ee) + o(2). (12.8.48) 
i 2 


ik3 


Example 12.8.11 (Helmholtz equation) Let us return to the Helmholtz equation 
(12.7.2). We will use Formula (12.8.48) to study the qualitative behaviour of light 
rays in the neighbourhood of the caustic.* Inserting the normal form (12.6.28) for 
S into the eikonal equation, one obtains 


(Vf +y2(Vg) + 2yVF-Ve=l. 


According to (12.6.29), on Bs we have g(x) = y’. Hence, for every x, the variable 
y can take the values ++,/ g(x), and thus this equation breaks into the two equations 


(Vf +e(Ver=l, 2F-Vg=0. (12.8.49) 


Inserting the asymptotic expansion (12.8.48) of the oscillatory integral (12.8.45) 
into the Helmholtz equation and comparing coefficients, one obtains the following 
form of the transport equation, see Exercise 12.8.4: 


2V f - Vbo + Afbo + 2gVg- Vb; + gAghi + (Vg)b; =0, (12.8.50) 


2Vg-Vbo + Aghy +2V f -Vbi + Afb; =0. (12.8.51) 


Thus, Formula (12.8.48) yields a solution u of the Helmholtz equation up to first 
order in i provided the functions f, g, bo and b, solve Eqs. (12.8.49), (12.8.50) and 
(12.8.51). Equations (12.8.49) form a system of nonlinear partial differential equa- 
tions, which is elliptic for g < 0, hyperbolic for g > 0 and parabolic for g = 0. Ac- 
cording to (12.6.29), these cases correspond, respectively, to points in the shadow, 
points in the illuminated region and points on the caustic. Using asymptotic formu- 
lae for the Airy functions for large positive arguments [115] one finds that for large 
k and g > 0 one has 


ko seikf 2ke? ox 1. (2ke? x 
u~ ——,— }50cos — — ) —),g?2 sin ——]r.  (12.8.52) 
Vir(k3g)4 . = - 


This formula yields a model for the intensity of light in the illuminated region. For 
large k and g <0 one finds an exponentially decaying solution, which of course 


33This goes back to Ludwig [187], see also [86, 157]. 
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corresponds to the shaded region. However, close to the caustic, where k3 g is small 
even for large k, a detailed analysis yields 


u~ —A(k3g) ~ —A(0). (12.8.53) 
k3 


Comparing (12.8.52) with (12.8.53), we see that the value of uw on the caustic is 
of a similar magnitude as the value of u in the illuminated region, multiplied by 


ké. Let us conclude. While the ordinary short wave asymptotics given by (12.8.21) 
yields a divergent intensity of light on the caustic, the finer analysis sketched above 
provides an estimate for the intensity near the caustic. In particular, the intensity on 
the caustic is large but finite. 


Exercises 

12.8.1 Prove that the local solution of Eq. (12.8.8) is given by the formula in Re- 
mark 12.8.2/1. 
Hint. Show that +. x) =(V- Yu) (G(x). 

12.8.2 Confirm Formula (12.8.19). 

12.8.3 Prove by direct inspection that the function ap, given in Corollary (12.8.5) 
is a solution of the transport equation. 
Hint. A guide to the proof can be found in [187]. 

12.8.4 Use the identities 


A”(t)+1tA(t) =0, A(t) + tA'(t) + A(t) =0 


for the Airy functions to verify Formulae (12.8.50) and (12.8.51). 
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noncommutative integrability, 637 
symplectic reduction, 542 
Euler-Lagrange equations, 430 
Exact 
contact manifold, 386 
differential form, 185 
construction of a potential, 191 
symplectic manifold, 356 
Exponential mapping, 236 
of a classical group, 241 
of a vector Lie group, 241 
Extended 
Hamiltonian system, 442 
phase space, 441 
Extension of an integral curve, 98 
Exterior 
algebra, 76 
of differential forms, 165 
curvature, 700 
derivative, 168 
intrinsic formula, 170 
on a manifold with boundary, 179 
power of a vector bundle, 75 
product, 75 
of differential forms, 165 


F 
Fibre 
bundle, 294 
derivative, 429 
dimension of a Morse family, 668 
Hessian of a Morse family, 669 
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Fibre-critical submanifold, 662 
Fibre-preserving mapping of Morse families, 
669 
Figure eight, 37 
Lagrangian immersion, 664 
Maslov index, 685 
non-equivalent submanifold structures, 41 
singular subset and Maslov class, 680 
First return mapping, 137 
First return time function, 137 
Flag manifold, 264 
Floquet multipliers, 363, 456 
Floquet’s Theorem, 131 
Flow, 99 
conjugacy, 127 
geodesic, 437 
symplectic reduction, 533 
Hamiltonian, 360 
linearization, 126 
of commuting vector fields, 106 
on a topological space, 303 
projection to the orbit space, 303 
characteristic exponents and 
multipliers, 313 
time-dependent, 112 
Flow box chart, 107, 437 
Focal point, 675 
Fold, 691 
Foliation, 123 
dimension or rank, 123 
Lagrangian, 397 
natural fibrewise action, 398 
regular and singular, 123 
simple, 124 
symplectic, 368 
Force-free top, 542 
Free Lie group action, 274 
Frobenius Theorem, 120 
for Pfaffian systems, 209 
Functional stability criteria, 150 
Functions in involution, 569 


G 
G-manifold, 270 
Hamiltonian, 492 
Riemannian, 272 
symplectic, 272, 396, 492 
g-manifold, 278 
symplectic, 492 
G-vector bundle, 272 
Gauge 
field in lattice gauge theory, 549 


Index 


transformation 
in electromagnetism, 204 
in lattice gauge theory, 550 
Gauk 
law., 551 
theorem, 202 
General linear group, 7, see also classical Lie 
groups 
Generalized 
action and angle variables, 628 
force, 216 
Hamilton-Jacobi equation, 653 
complete integral, 655 
solution 
of the eikonal equation, 696 
of the Hamilton-Jacobi equation, 650 
of the transport equation, 712 
Generalized Jacobi Theorem, 656 
Generalized Poincaré Lemma, 192 
Generating 
function 
for a canonical transformation, 404 
for a Lagrangian submanifold, 405 
Morse family, 662 
at a point, 668 
Generic position, 676 
Geodesic 
flow, 437 
momentum mapping, 533 
symplectic reduction, 533 
vector field, 437, 533 
Geometric 
asymptotics, 692 
multiplicity, 130 
optics, 692 
solution, 650 
Germ of immersions, 674 
Global trivialization, 58 
Gradient, 200 
vector field, 418, 452 
Graph criterion 
for a symplectic reduction, 401 
for a symplectomorphism, 403 
GraBmann manifold, 264 
as an orbit manifold, 292 
Grobman-Hartman Theorem, 148 
Gromov Nonsqueezing Theorem, 451 
Group 
action, see Lie group action 
average, 251 
classical, 7, 14, see also classical Lie 
groups 


Index 


Group (cont.) 
of symplectomorphisms 
of a symplectic manifold, 406 
of a symplectic vector space, 325 


H 
H-Lyapunov function, 313 
H-stability, 313 
Haar measure, 250 
Half-density, 711 
oscillatory, 719 
Hamilton equations, 428 
in terms of action and angle variables, 602 
under linear nonholonomic constraints, 431 
Hamilton-Jacobi equation, 643, 644 
analytic, geometric and generalized 
solutions, 650 
complete integral, 646 
generalized, 653 
Hamiltonian 
diffeomorphism, 410 
flow, 360 
flow box chart, 437 
function, 428 
G-manifold, 492 
isotopy, 410 
Lie group action, 492 
mechanics, 428 
system 
on a Poisson manifold, 434 
on a symplectic manifold, 428 
regular reduction, 510 
singular reduction, 527 
symmetric, 492 
time-dependent, 441 
vector field 
on a Poisson manifold, 365 
on a symplectic manifold, 360 
Harmonic oscillator 
action and angle variables, 604 
hidden symmetry, 503 
Helmholtz equation, 693 
Hermitian vector bundle, 87 
Hessian 
of a function, 416 
of a relative equilibrium, 309 
of an equilibrium, 127 
Heteroclinic, 161 
Hidden symmetry 
of the harmonic oscillator, 503 
of the Kepler problem, 539 
Hilbert mapping 
for a lattice gauge model, 555 
for the spherical pendulum, 547 
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Hodge 
dual, 198 
of the exterior derivative, 200 
star operator, 198 
Hodge-Laplace operator, 200 
Hofer norm and metric, 414 
Hofer-Zehnder 
capacity, 451 
theorems, 452 
Holonomic constraints, 214 
Homoclinic, 161 
Homogeneous 
coordinates, 9 
distribution, 117 
manifold, 280 
space, 261, 280 
Homomorphism 
bundle, 77 
of representations, 243 
theorem 
for Lie groups, 266 
for vector bundles, 89 
Homotopy 
groups, 188 
of mappings, 188 
operator, 190 
type, 188 
with fixed end points, 342 
Homotopy-equivalence, 188 
Horizontal part of a vector bundle morphism, 
83 
H6rmander index, 352 
H6rmander-Kashiwara index, 347 
Hyperbolic 
critical integral curve or fixed point, 130 
endomorphism or automorphism, 130 
umbilic, 691 
Hyperplane distribution, 385 
Hyperregular Lagrangian function, 429 
Hypersurface, 385 
of contact type, 392 
Hypocycloid, 556 


I 
Ideal gas, 212 
Ideal lense (eikonal function), 706 
Identity component, 223, 265 
Immersion, 32 
equivalence, 35 
figure eight, 37 
Implicit Mapping Theorem, 32 
Index 
H6rmander, 352 
Kashiwara, 347 
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Index (cont.) 
Maslov 
of a Lagrangian immersion, 678 
of the Lagrange-Grafmann manifold, 
338 
Maslov intersection 
for Lagrangian immersions, 679 
for pairs, 351 
of the Lagrange-Grafmann manifold, 
343 
Morse, 416 
of an equilibrium, 137 
Induced 
Lie algebra homomorphism, 232 
vector bundle, 82 
vector field on a submanifold, 91 
Inertia tensor, 543 
Initial 
submanifold, 39 
topology, 5 
Inner 
automorphisms of a Lie group, 222 
momentum mapping, 502 
multiplication of differential forms with 
multivector fields, 166 
Integrable 
differential form, 178 
distribution, 116 
criteria, see Stefan-Sussmann Theorem 
Pfaffian system, 208 
criteria, 210 
system, 569 
action and angle variables, 589 
characteristic frequencies, 602 
isochronous or anisochronous, 603 
non-degenerate, 603 
noncommutative integrability, 627 
symplectic reduction, 588 
Integral 
criterion for exact differential forms, 192 
curve 
critical, 126 
of a time-dependent vector field, 111 
of a vector field, 97 
periodic, 126 
relatively critical, relatively periodic, 
308 
invariant, 183 
Poincaré-Cartan, 445 
manifold 
of a distribution, 116 
of a Pfaffian system, 208 
Integrating factor, 211 


Index 


Integration 

on a manifold, 178 

on a manifold with boundary, 179 
Interior of a manifold with boundary, 179 
Intersection of transversal submanifolds, 50 
Intertwine 

Lie algebra actions, 278 

Lie group actions, 270 

representations, 243 
Invariance of domain, 159 
Invariant 

differential form, 182 

function, 275 

manifold, 155 

of a linear vector field, 160 


of the modified harmonic oscillator, 161 


of the planar pendulum, 160 
scalar product, 251 
tensor field, 275 
theory 
for a lattice gauge model, 555 
for the spherical pendulum, 547 
torus, 466, 602 
vector field, 303 
Inverse Mapping Theorem, 31 
Inversion mapping, 220 
Involutive distribution, 117 
Isochronous integrable system, 603 
Isoenergetic Poincaré mapping, 455 
Isoenergetically non-degenerate, 616 
Isospectral, 578, 579 
Isotopy, 409, 410 
Isotropic 
immersion or submanifold, 358 
subspace, 319 
vector subbundle, 359 
Isotropy 
group, 272 
representation 
for a proper action, 283 
of the Lie algebra, 277 
of the Lie group, 273 


type 
stratum, 295 
subset, 274 


Isotypic spectrum part, 312 


J 
Jacobi identity, 95 
Jacobi Theorem, 647 


K 
KAM theory, 468, 615 
Kashiwara index, 347 


Index 


Kepler problem, 536 
action and angle variables, 605 
energy-momentum mapping, 558 
Hamilton-Jacobi equation and complete 
integral, 648 
integrability, 570 
Lenz-Runge vector, 539 
momentum mapping, 537 
Moser’s regularization, 541 
noncommutative integrability, 640 
relation to the geodesic flow on S*, 541 
symplectic reduction, 536 
Kernel 
of a Lie group action, 272 
of a multilinear form, 183, 315 
of a vertical vector bundle morphism, 86 
Killing 
form, 247 
vector field, 276 
adjoint and coadjoint representations, 
277 
left and right translation, 278 
Kirillov form, 377 
Kirillov Theorem, 377 
Kirkwood gaps, 477 
Kovalevskaya top, 575 
Kronecker tensor field, 78 
Kiinneth Formula, 192 


L 
Lagrange 
class, 622 
equations, 215, 216 
top, 575 
Lagrange-GraBmann manifold, 333 
Lagrangian 
complement of a Lagrangian vector 
subbundle, 359 
foliation, 397 
natural fibrewise action, 398 
function, 216 
immersion, 358 
generated by a Morse family, 662 
quantizability, 725 
mechanics, 216, 428 
submanifold, 358 
generating function, 405 
of a cotangent bundle, 373 
Weinstein Theorem, 397 
subspace, 319 
criteria, 320 
vector subbundle, 359 
Lattice gauge theory, 549 
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Lax equation, Lax pair, 578 
Leaf of a foliation, 123 
Left 
coset, 261 
Lie algebra action, 278 
Lie group action, 270 
multiplication, 222 
translation, 222 
Killing vector fields, 278 
momentum mapping, 501 
symplectic reduction, 512 
trivialization, 223 
in terms of left-invariant vector fields, 
228 
in terms of right-invariant vector fields, 
235 
Left-invariant 
differential form, 248 
distribution, 256 
vector field, 227 
volume form, 250 
Legendre transformation, 429 
Lemma 
generalized Poincaré, 192 
local straightening, 107 
Morse, 416 
Morse isotopy, 418 
Morse-Bott, 423 
Poincaré, 190 
splitting of Morse families, 669 
Lenz-Runge vector, 539 
Level Set Theorem 
for manifolds, 50 
for R", 11 
Lie algebra, 95 
action, 278 
symplectic, 492 
homomorphism, 95 
homomorphism induced by a Lie group 
homomorphism, 232 
of a Lie group, 228 
for the classical Lie groups, 229-232 
simple, semisimple, solvable, nilpotent, 
234 
Lie derivative, 109 
formulae, 109, 110, 172, 173 
Lie group, 220, see also classical Lie groups 
cotangent bundle, 223 
symplectic structure, 372 
homomorphism, 220 
identity component, 223 
induced Lie algebra homomorphism, 232 
parallelizability, 223, 228, 235 
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Lie group (cont.) 
simple, semisimple, solvable, nilpotent, 
234 
tangent bundle, 223 
Lie group action, 270 
canonical, 396 
direct product, 271 
effective, free, transitive, 274 
Hamiltonian, 492 
kernel, 272 
lift to the cotangent bundle, 500 
locally free, 505 
orbit, orbit mapping, orbit space, 272 
proper, 281 
properly discontinuous, 292 
restriction, 271 
Riemannian, 272 
stabilizer, 272 
strongly Hamiltonian, 497 
symplectic, 272, 396, 492 
Lie subgroup, 256 
conjugacy classes, 274 
embedded, 256 
equivalence, 256 
Lie-Poisson structure, 367 
noncommutative integrability, 637 
symplectic foliation, 379 
Lift 
of a diffeomorphism, 372 
of a Lie group action, 500 
Light ray, 694 
Linear 
asymptotic stability, 145 
criteria, 147, 148 
Lie group action, 271 
Poisson structure, 367 
slice, 286 
stability, 145 
criteria, 147, 148 
symplectic 
reduction, 322 
slice, 508 
vector field, 102 
critical integral curves, 132 
in two dimensions, 132 
invariant manifolds, 160 
Linearization of a flow, 126 
Linearly equivalent vector fields, 132 
Liouville 
class, 721 
form, 355 
vector field, 371, 385 


Index 


Liouville Theorem 
on integrable systems, 585 
on the phase space volume form, 433 
Local 
centre, centre-stable, centre-unstable 
manifolds, 156 
chart, 2 
coframe, 72 
diffeomorphism, 19 
frame, 67 
invariant manifold, 155 
refractive index, 693 
representative 
of a mapping, 17 
of a section, 65 
of a tangent vector, 23 
of a vector bundle morphism, 61 
section 
of a submersion, 34 
of a vector bundle, 65 
stable manifold, 156 
symplectic potential, 356 
trivialization 
of a principal bundle, 292 
of a vector bundle, 58 
unstable manifold, 156 
wave vector, 693 
Local Straightening Lemma, 107 
Locally 
compact, 2 
connected, 2 
defining 1-form for a hyperplane 
distribution, 386 
finite, 20 
free, 505 
Hamiltonian vector field, 361 
trivial 
fibre bundle, 294 
mapping, 557 
Lorentz transformation, 203 
Lorentzian manifold, 206 
Lorenz gauge, 204 
Lyapunov 
asymptotic stability, 152 
function 
for a critical integral curve, 150 
for a relatively critical integral curve, 
313 
stability, 152 
Lyapunov Centre Theorem, 459 
Lyapunov Subcentre Theorem, 475 
Lyapunov Theorem 
for a critical integral curve, 151 
for a relatively critical integral curve, 313 
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M 
Manifold 
atlas, 3 
centre, centre-stable, centre-unstable, 156 
complex analytic, 3 
contact, 386 
differentiable, 3 
dimension, 2 
direct product and sum, 10 
fibre-critical, 662 
flag, 264 
GraBmann, 264 
homogeneous, 280 
integral 
of a distribution, 116 
of a Pfaffian system, 208 
Lagrange-Grafmann, 333 
local chart, 2 
Lorentzian, 206 
orientation, orientable, oriented, 176 
parallelizable, 69 
Poisson, 365 
pseudo-Riemannian, 194 
real analytic, 3 
Riemannian, 194 
smooth, 3 
stable, 156 
Stiefel, 263 
symplectic, 354 
topological, 2 
unstable, 156 
Manifold with boundary, 179 
Mapping 
contact, 386 
degree, 188 
differentiable, 17 
energy-momentum, 556 
equivariant, 270 
local representative, 17 
locally trivial, 557 
momentum, 492, see momentum mapping 
period, 126, 478 
Poincaré, 138 
isoenergetic, 455 
Poisson, 365 
proper, 281 
smooth, 18 
symplectic 
criteria, 362, 365 
of symplectic manifolds, 354 
of symplectic vector spaces, 317 
transversal, 47 
Marle-Guillemin-Sternberg normal form, 521 
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Maslov 
class 
of a Lagrangian immersion, 678 
of the circle, 680 
of the figure eight, 680 
universal, 340 
cycle, 333 
index 
of a Lagrangian immersion, 678 
of the circle, 685 
of the figure eight, 685 
of the Lagrange-GraBmann manifold, 
338 
intersection index 
for pairs, 351 
of a Lagrangian immersion, 679 
of the Lagrange-GraBmann manifold, 
343 
line bundle, 687 
principal bundle, 687 
Maslov’s canonical operator, 719 
Mathieu equation, Mathieu function, 484 
Maurer-Cartan 
equation, 250, 254 
form, 253 
Maximal 
atlas, 4 
integral curve, 98 
Maxwell’s equations, 203 
Method 
of characteristics, 651 
for the eikonal equation, 696, 712 
of isospectral deformation, 579 
of stationary phase, 715 
Metric, see Riemannian manifold 
Minkowski space, 201 
Mishchenko-Fomenko Theorem, 633 
Mobius strip, 6 
as a vector bundle, 59 
as an orbit manifold, 292 
Modified harmonic oscillator 
critical integral curves, 135 
invariant manifolds, 161 
Poincaré mapping, 141 
Modular function, 251 
Momentum mapping, 492 
for coadjoint orbits, 502 
for the Euler top, 544 
for the geodesic flow, 533 
for the inner automorphisms of a Lie 
group, 502 
for the Kepler problem, 537 
for the spherical pendulum, 564 
for the translations on a Lie group, 501 
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Monodromy, 618 
of the spherical pendulum, 626 
Morphism 
of G-manifolds, 270 
of g-manifolds, 278 
of Lie group actions, 270 
of vector bundles, 60 
Poisson, 365 
Morse 
family, 660 
at a point, 674 
fibre dimension, 668 
fibre Hessian, 669 
fibre-critical submanifold, 662 
generated Lagrangian immersion, 662 
operations, 671 
reduced, 671 
Splitting Lemma, 669 
stable equivalence, 690 
stable equivalence at a point, 672 
function, 415 
homology, 421 
index, 416 
inequalities, 421 
theory, 415 
Morse Isotopy Lemma, 418 
Morse Lemma, 416 
Morse-Bott function, 422 
Morse-Bott Lemma, 423 
Morse-Smale pair, 421 
Moser’s regularization of the Kepler problem, 
541 
Multivector field, 78 


N 
Natural 
1-form on a cotangent bundle, 371 
coorientation of the Maslov cycle, 341 
decomposition of a vector bundle 
morphism, 83 
fibrewise action 
on a cotangent bundle, 375 
on a Lagrangian foliation, 398 
symplectic form on a cotangent bundle, 371 
volume form 
of a Riemannian manifold, 196 
of a symplectic manifold, 355 
of a symplectic vector space, 318 
Nekhoroshev Theorem, 628 
Nilpotent Lie algebra or Lie group, 234 
Noether Theorem, 495 
Non-autonomous Hamiltonian system, 441 
Non-characteristic position, 650 


Index 


Non-degenerate 
bilinear form, 315 
critical integral curve or fixed point, 130 
critical point of a function, 416 
elliptic 4-elementary fixed point, 466 
endomorphism or automorphism, 130 
integrable system, 603 
Non-resonant torus, 602 
Noncommutatively integrable system, 627 
Lie-Poisson structure, 637 
symplectic reduction, 633 
Nonholonomic constraints, 214 
Hamiltonian formulation, 431 
Normal 
bundle, 91 
form 
Birkhoff, 463, 467 
for a hyperbolic critical integral curve, 
150 
for Morse families, 691 
for the linearized flow along a periodic 
integral curve, 129 
space, 91 


O 
w-orthogonal 
subspace, 316 
vector subbundle, 359 
One-parameter 
family of differential forms, 174 
group of local diffeomorphisms, 104 
subgroup, 222 
Operations on Morse families, 671 
Orbit, 272 
cylinder, 456 
manifold, 291 
mapping, 272 
reduction 
regular case, 513 
space, 272 
for a free proper action, 291 
smooth function, 299 
structure theorem, 296 
type, 274 
stratum, 295 
Orbit Theorem, 279 
Orbit-momentum type stratum, 522 
Orientation and orientability 
of a manifold, 176 
of a manifold with boundary, 179 
of a vector bundle, 58 
Orthogonal group, 14, see also classical Lie 
groups 


Index 


Oscillatory 
half-density, 719 
integral, 715 


P 
Pairing of differential forms with multivector 
fields, 166 
Parabolic umbilic, 691 
Parallel transporter in lattice gauge theory, 549 
Parallelizable, 69, 70 
Parameter manifold, 655 
Parametric resonance, 478 
Partial derivatives, 61 
Partition of unity, 20 
on a manifold with boundary, 179 
Path-ordered exponential, 131 
Period, 126 
automorphism 
of a periodic integral curve, 129 
of a relatively periodic integral curve, 
309 
bundle, 618 
lattice, 589 
mapping, 126, 478 
Periodic integral curve, 126 
centre, centre-stable, centre-unstable 
manifolds, 156 
characteristic multipliers, 130 
elliptic, 130 
hyperbolic, 130 
stability criteria, 149 
invariant manifolds, 155 
non-degenerate, 130 
period automorphism, 129 
stability criteria, 143 
stable and unstable manifolds, 156 
Periodically time-dependent vector field, 114 
Perturbation theory, 612 
Pfaffian system, 207 
integrability criteria, 210 
integral manifold, 208 
Phantom burst, 476 
Phase bundle and phase class, 721, 722 
Phase space, 428 
Piecewise C*-curve, 43 
Planar pendulum, 134 
invariant manifolds, 160 
with moving suspension, 484 
with varying length, 481 
Pliicker equation, 534 
Poincaré Lemma, 190 
Poincaré mapping, 138 
isoenergetic, 455 
Poincaré-Birkhoff Theorem, 488 
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Poincaré-Cartan integral invariant, 445 
Poincaré-Cartan Theorem, 182 
Point transformation, 372 
Poisson 

bivector, see Poisson tensor 

bracket, 364 

equation for the top, 573 

manifold, 365 

mapping, 365 

morphism, 365 

space, 532 

structure, 365 

linear, 367 

tensor, 366 

vector field, 368 
Polar 

coordinates, 6 

decomposition, 227 
Potential for an exact differential form, 191 
Prequantum line bundle, 722 
Principal bundle, 292 
Problem of small divisors, 472 
Product rule, 61 
Projection 

of a vector bundle morphism, 60 

of an invariant flow to the orbit space, 303 
Projective 

cotangent bundle, 389 

space, 8 

as an orbit manifold, 292 

Proper (mapping or action), 281 
Properly discontinuous action, 292 
Pseudo-Riemannian manifold, 194 
Pull-back 

of a differential form, 80 

of a section, 72, 76 

of a vector bundle, 82 

of functions, 30 


Q 
Quantizable Lagrangian immersion, 725 
Quasiperiodic, 468, 602, 611, 615, 631 
Quaternionic 
conjugation, 7 
symplectic group, 15, see also classical Lie 
groups 
Quaternions, 7 
Quotient 
homomorphism, 266 
Lie group, 265 
of a Lie group action, see orbit space 
vector bundle, 89 
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R 
r-elementary, 463 
Rank 
of a bilinear form, 315 
of a distribution, 115 
of a foliation, 123 
of a Pfaffian system, 207 
of a vector bundle, 58 
of a vector bundle morphism, 60 
symplectic, 319 
Real analytic 
manifold, 3 
mapping, 18 
Real and complex orthogonal groups, 14, see 
also classical Lie groups 
Reconstruction Theorem for vector bundles, 62 
Recurrence Theorem of Poincaré, 447 
Reduced 
characteristic exponents and multipliers, 
311, 312 
configuration space, 214 
mass, 571 
Morse family, 671 
phase space, 509, 522, 529, see also 
symplectic reduction 
symplectic manifold, 399 
Reduction to the centre manifold, 163 
Reeb 
graph, 584 
theorem, 421 
vector field, 388 
Regular 
distribution, 86, 115 
energy surface, 435 
foliation, 123 
Lagrangian function, 429 
orbit cylinder, 456 
orbit reduction, 513 
Pfaffian system, 207 
point, 11, 34 
symplectic reduction, 509 
of a symmetric Hamiltonian system, 
510 
value, 11, 34 
Regular Reduction Theorem, 509 
Related 
sections under a morphism, 67 
vector fields under a mapping, 67 
Relative 
equilibrium, 308 
Lyapunov function, 313 
period, 308 
phase, 308 
stability, 313 
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topology, 5 
Relatively 
critical integral curve, 308 
invariant differential form, 182 
periodic integral curve, 308 
Representation, 243 
Resonant torus, 602 
Restricted 3-body problem, 477 
Restriction 
of a Lie group action, 271 
of a mapping in domain or range, 38 
of a vector bundle to a submanifold, 83, 84 
of a vector field to a submanifold, 91 
Riemannian 
G-manifold, 272 
Lie group action, 272 
manifold, 194 
natural volume form, 196 
Right 
coset, 261 
Lie algebra action, 278 
Lie group action, 270 
multiplication, 222 
translation, 222 
Killing vector fields, 278 
momentum mapping, 501 
symplectic reduction, 512 
trivialization, 223 
in terms of left-invariant vector fields, 
228 
in terms of right-invariant vector fields, 
235 
Right-invariant 
differential form, 254 
vector field, 235 
flow, 237 
Rolling disk, 214 


S 
Sard’s Theorem, 34 
Section 
of a submersion, 34 
of a vector bundle, 65 
Semialgebraic set, 547 
Semiclassical 
approximation, 723 
quantization mapping, 725 
state, 725 
Semisimple Lie algebra or Lie group, 234 
adjoint and coadjoint representations, 248 
Separatrix, 135 
Shifting trick, 503 
Short wave asymptotics, 692 
Signature of a quadratic form, 345 
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Simple 
crossing, 342 
foliation, 124 
Lie algebra or Lie group, 234 
Singular 
distribution, 115 
foliation, 123 
Pfaffian system, 207 
point, 11, 34 
reduction 
via regular reduction, 528 
subset 
of a Lagrangian immersion, 675 
of the circle, 680 
of the figure eight, 680 
symplectic reduction, 522 
of a symmetric Hamiltonian system, 
527 
value, 11, 34 
Singular Reduction Theorem, 524 
Singularity theory, 690 
Sjamaar’s principle, 528 
Slice 
decomposition of invariant vector fields, 
307 
of a chart adapted to a distribution, 118 
of a tubular neighbourhood, 286 
representation, 280 
for a proper action, 283 
Slice Theorem, see Tubular Neighbourhood 
Theorem 
Smale programme, 557 
Small perturbations, 612 
Smooth 
family of differential forms, 174 
function on the orbit space, 299 
manifold, 3 
mapping, 18 
SO(3, 1), see also classical Lie groups 
Lie algebra, 232 
universal covering homomorphism, 226 
SO(3), see also classical Lie groups 
adjoint representation, 246 
coadjoint orbits, 381 
Lie algebra, 230 
universal covering homomorphism, 224 
SO(4), see also classical Lie groups 
adjoint representation, 246 
coadjoint orbits, 381 
Lie algebra, 232 
universal covering homomorphism, 225 
Solvable Lie algebra or Lie group, 234 
Sp(1) 
isomorphism to SU(2), 224 
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Lie algebra, 230 
Space 
cone, 611 
coordinates, 543 
Special orthogonal and unitary groups, 14, 15, 
see also classical Lie groups 
Spectral stability criteria, 150 
Spectrum of a linear symplectomorphism, 325 
Spherical coordinates, 6 
Spherical pendulum 
action variables, 625 
energy surfaces, 623 
energy-momentum mapping, 563 
integrability, 576 
invariant theory and Hilbert mapping, 547 
momentum mapping, 564 
monodromy, 626 
nonexistence of global action variables, 624 
symplectic reduction, 545 
Splitting Lemma for Morse families, 669 
Splitting Theorem for Poisson manifolds, 369 
Stability, 142 
criteria 
for hyperbolic critical integral curves, 
149 
for periodic integral curves, 143 
linear, 145 
criteria, 147, 148 
Lyapunov, 152 
of a planar pendulum with moving 
suspension, 484 
of a swing (planar pendulum with varying 
length), 481 
relative, 313 
structural, 153 
Stabilizer, 272 
Stable 
burst, 476 
equivalence 
of Morse families, 690 
of Morse families at a point, 672 
manifold, 156 
of a linear vector field, 160 
of the modified harmonic oscillator, 161 
of the planar pendulum, 160 
Morse family, 691 
Standard 
contact forms on R2"*+! and on S2"+!, 389 
coordinates on R”, 4 
smooth structure of R”, 4 
symplectic matrix, 317 
symplectic structure of R2”, 317 
Stationary phase method, 715 
Stefan-Sussmann Theorem, 118 
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Stereographic projection, 4 
Stiefel manifold, 263 
Stokes’ Theorem, 180 
classical version, 202 
Straightening Lemma, 107 
Stratum 
orbit type, 295 
structure theorem, 296 
orbit-momentum type, 522 
Strict symplectic reduction, 399 
Strong Lyapunov function, 151 
Strongly Hamiltonian Lie group action, 497 
Structural stability, 153 
Structure 
constants, 228 
group of a principal bundle, 293 
SU(2), see also classical Lie groups 
adjoint representation, 246 
coadjoint orbits, 380 
isomorphism to Sp(1), 224 
Lie algebra, 230 
SU(3), see also classical Lie groups 
adjoint quotient, 301 
adjoint representation, 246 
coadjoint orbits, 380 
Subimmersion, 35 
Submanifold, 35 
coisotropic, 358 
criteria, 384 
embedded, 38 
equivalence, 35 
fibre-critical, 662 
figure eight, 37 
initial, 39 
intersection, 50 
isotropic, 358 
Lagrangian, 358 
of a manifold with boundary, 179 
symplectic, 358 
transversal, 48 
weakly embedded, see initial 
Submersion, 32 
existence of local sections, 34 
Subordinate covering, 20 
Superintegrable, see noncommutatively 
integrable 
Suspended Hamiltonian system, 442 
Suspension 
of a Morse family, 671 
of a vector space automorphism, 128 
Swallowtail, 691 
Swing, 481 
Symbol 
of a differential operator, 695 
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space, 722 


Symmetric Hamiltonian system, 492 


regular reduction, 510 
singular reduction, 527 


Symplectic 


basis, 317 
capacity, 451 
foliation, 368 
of the Lie-Poisson structure, 379 
form 
on a manifold, 354 
on a vector space, 317 
G-manifold, 272, 396, 492 
g-manifold, 492 
group, 15, 325, see also classical Lie 
groups 
spectral properties, 325 
immersion, 358 
leaves of a Poisson manifold, 368 
Lie algebra action, 492 
Lie group action, 272, 396, 492 
manifold, 354 
mapping 
basic properties, 324, 355 
criteria, 362, 365, 403 
of symplectic manifolds, 354 
of symplectic vector spaces, 317 
matrix, 317 
normal bundle, 359, 508 
orthogonal 
subspace, 316 
vector subbundle, 359 
potential, 356 
rank, 319 
reduction 
for a lattice gauge model, 551 
for a symmetric Hamiltonian system, 
510, 527 
for integrable systems, 588 
for noncommutatively integrable 
systems, 633 
for symplectic manifolds, 399 
for symplectic vector spaces, 322 
for the Euler top, 542 
for the geodesic flow, 533 
for the Kepler problem, 536 
for the spherical pendulum, 545 
for the translations on a Lie group, 512 
graph criterion, 401 
structure, see symplectic form 
submanifold, 358 
subspace, 319 
tubular neighbourhood, 521 
vector bundle, 354 
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Symplectic (cont.) 

vector field, 360 

vector space, 317 

vector subbundle, 359 

volume form, 355 
Symplectic Eigenvalue Theorem, 325 
Symplectic Foliation Theorem, 368 


Symplectic Tubular Neighbourhood Theorem, 


520 

infinitesimal version, 508 
Symplectization, 390 
Symplectomorphism 

generating function, 404 

graph criterion, 403 

group (of a symplectic manifold), 406 

of symplectic manifolds, 354 

of symplectic vector spaces, 317 


T 
Tangent 
bundle, 55 
mapping, 28, 56 
properties, 61 
space, 21, 22 
of a level set in a manifold, 50 
of a level set in R”, 12 
of a manifold with boundary, 179 
vector, 22 
of a curve, 30 
Taylor 
formula for manifolds, 104 
series on a Lie group, 240 
Tensor 
bundle, 74, 78 
field, 78 
on a manifold with boundary, 179 
product 


of local sections in a vector bundle, 73 


of vector bundle morphisms, 73 
of vector bundles, 73 
Theorem 
Arnold, 586 
Birkhoff normal form, 463, 467 
Birkhoff-Lewis, 469 
Carathéodory-Jacobi-Lie, 627 
Cartan-Lie, 628 
constant rank, 32 
Darboux, 356 
equivariant Darboux, 396 
Frobenius, 120 
Frobenius for Pfaffian systems, 209 
GauB, 202 
generalized Jacobi, 656 


Grobman-Hartman, 148 

Gromov Nonsqueezing, 451 
implicit mapping, 32 

inverse mapping, 31 

Jacobi, 647 

Kirillov, 377 

level sets in manifolds, 50 

level sets in R”, 11 

Liouville on integrable systems, 585 


Liouville on the phase space volume form, 
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Lyapunov 

for a critical integral curve, 151 

for a relatively critical integral curve, 

313 

Lyapunov centre, 459 
Lyapunov subcentre, 475 
Mishchenko-Fomenko, 633 
Nekhoroshey, 628 
Noether, 495 
on closed subgroups, 259 
on invariance of domain, 159 
orbit, 279 
Poincaré recurrence, 447 
Poincaré-Birkhoff, 488 
Poincaré-Cartan, 182 
reconstruction of vector bundles, 62 
Reeb, 421 
regular reduction, 509 
Sard, 34 
singular reduction, 524 
slice, see tubular neighbourhood 
splitting for Poisson manifolds, 369 
Stefan-Sussmann, 118 
Stokes, 180 
structure of strata, 296 
symplectic 

eigenvalue, 325 

foliation, 368 

tubular neighbourhood, 520 
transversal mapping, 48 
tubular neighbourhood 

for an embedded submanifold, 290 


for the orbits of a Lie group action, 287 


Weinstein, 394, 397, 407 
Thom catastrophes, 691 
Thom-Boardman singularities, 692 
Time-dependent 

canonical transformation, 443 

flow, 112 

Hamiltonian system, 441 

vector field, 111 
Toda, 576 
Toda lattice, 581 
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Top 
energy-momentum mapping, 560 
Euler, 542, 575 
action and angle variables, 608 
momentum mapping, 544 
noncommutative integrability, 637 
symplectic reduction, 542 
integrability, 572 
Kovalevskaya, 575 
Lagrange, 575 
Poisson structure, 573 
under an external force, 572 
Topological manifold, 2 
Transition 
functions 
of local trivializations, 58 
of Morse families, 686 
mappings 
of local charts, 2 
of local trivializations, 58 
Transitive Lie group action, 274 
Translations on a Lie group, 222 
momentum mapping, 501 
symplectic reduction, 512 
Transport 
equation, 693 
generalized and analytic solutions, 712 
of the Schrédinger equation, 724 
operator, 67 
Transversal 
crossing, 342 
mappings, 47 
submanifolds, 48 
Transversal Mapping Theorem, 48 
Tubular neighbourhood, 286 
symplectic, 521 
twisted product representation, 294 
Tubular Neighbourhood Theorem 
for an embedded submanifold, 290 
for the orbits of a Lie group action, 287 
Twisted product, 293 
Two-body problem, 570 
Two-centre problem, 571 
Typical fibre 
of a locally trivial fibre bundle, 294 
of a vector bundle, 58 
of an associated fibre bundle, 294 


U 
Unimodular and unitary groups, 14, 15, see 
also classical Lie groups 
Universal 
covering homomorphism, 224-226 
Maslov class, 340 
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Unstable manifold, 156 
of a linear vector field, 160 
of the modified harmonic oscillator, 161 
of the planar pendulum, 160 


Vv 
Value 
regular, singular or critical, 34 
weakly regular, 510 
Vanishing ideal, 92 
Vector bundle, 57 
morphism, 60 
symplectic, 354 
with boundary, 179 
Vector field, 65 
canonical vertical, 371 
characteristic 
of a coisotropic submanifold, 383 
of a differential form, 184 
conjugacy, 127 
contact, 388 
derivation property, 94 
equivariant, see invariant 
geodesic, 437, 533 
gradient, 418, 452 
Hamiltonian 
on a Poisson manifold, 365 
on a symplectic manifold, 360 
integral curve, 97 
invariant, 303 
Killing, 276 
left-invariant, 227 
Liouville, 371, 385 
locally Hamiltonian, 361 
on a level set in R”, 66 
on a manifold with boundary, 179 
periodically time-dependent, 114 
Poisson, 368 
Reeb, 388 
related under a mapping, 67 
right-invariant, 235 
symplectic, 360 
tangent to a submanifold, 90 
time-dependent, 111 
Vector Lie group, 220 
exponential mapping, 241 
Lie algebra, 231 
Vector subbundle, 84 
coisotropic, 359 
isotropic, 359 
Lagrangian, 359 
of a vector bundle with boundary, 179 
symplectic, 359 
symplectic orthogonal, 359 
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Vertical 
part of a vector bundle morphism, 83 
vector bundle morphism, 60 
vector subbundle, 84 
Volume form, 176 
left-invariant, 250 
of a Riemannian manifold, 196 
of a symplectic manifold, 355 
of a symplectic vector space, 318 


Ww 
Wall-Kashiwara index, 347 
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Wave equation, wave front, wave vector, 692, 


693 

Weakly embedded, see initial 
Weakly regular value, 510 
Weinstein 

conjecture, 450 

dual pair, 637 
Weinstein Theorems, 394, 397, 407 
Weinstein-Moser conjecture, 461 
Witt-Artin decomposition 

of a symplectic vector space, 323 

of the tangent space, 507 
WKB approximation, 723 


